COGS 300 — Fall 2013 Assignment 2

advertisement
COGS 300 — Fall 2013
Assignment 2
Solution
Question 1
Solution
(a) At first, cheat. At the second time, cheat if you didn’t get caught the first time. The expected utility is
81.69.
(b) When the probability of Watched is increased a small amount (e.g., from 0.3 to 0.5), the policy changes
to cheat at the first opportunity and don’t cheat at the second opportunity. When the probability of
increased more (e.g., to 0.8) the policy becomes to not cheat at either time.
(c) We can change P(Grade1|Cheat1) and P(Grade2|Cheat2).
If the probability are changed to P(Grade2 = F|Cheat2 = T) = 0.1 and P(Grade2 = A|Cheat2 =
T) = 0.4 (with the other probabilities for Cheat2 = T remaining at 0.25 each), the policy changes to
not cheat at the second opportunity.
If the probabilities of P(Grade1|Cheat1) are changed to be P(Grade2 = F|Cheat2 = T) = 0.1,
P(Grade2 = C|Cheat2 = T) = 0.25 P(Grade2 = B|Cheat2 = T) = 0.25 and P(Grade2 = A|Cheat2 =
T) = 0.4 the policy changes to not cheat at the first opportunity.
(d) See:
Watched1
Watched2
Caught 1
Punishment
Caught 2
Cheat 1
Utility
Cheat 2
Grade 1
Grade 2
Final Grade
If the probability of watched1 goes up, we can even get to the situation where the policy is to not cheat
at the first opportunity and to cheat at the second opportunity. See:
http://www.cs.ubc.ca/~poole/cogs300/2013/cheat_decision_watched2.xml
for such an example (which didn’t happen by changing the probabilities in the first model).
1
(e) At the moment cheating at time 1 only affects the Grade at time 2 by affecting whether the student
will cheat at time 2. We can add an arc from Cheat1 to Grade2. The model can them make it so that
cheating at time 1 reduces grade 2 for the case when the student does not cheat at time 2 (because the
agent doesn’t learn from their mistakes). See:
http://www.cs.ubc.ca/~poole/cogs300/2013/cheat_decision_watched2_grade2.xml
for an example.
(f) You need to say something sensible here.... The probabilities are way off; most students don’t cheat.
(However, it is not clear how we could acquire these probabilities, or why the students should believe
any probabilities the university provides — they are subjective beliefs after all). The utilities for
punishment need to be more extreme. There are more than two opportunities to cheat, but maybe we
could model all subsequent behaviour as a single variable.
(g) Making punishment more severe (suspension or recorded on transcript for first offences) makes cheating less good even if the probability of getting caught is 0.2. See e.g.,:
http://www.cs.ubc.ca/~poole/cogs300/2013/cheat_decision_honour.xml
(h) Write something sensible here. Present your argument for or against.
Question 2
Solution
(a) The decision network is:
Final
Grade
Difficulty
Pass
Midterm
Pass Final
Study
Midterm
Study
Final
Utility
Total Effort
(b) In the following the actual numbers are not significant. However,they have the following characteristics that is important: A is preferred to B which is preferred to C which is much preferred to F for all
efforts. Working little is much better than working a lot. The best outcome is to get an A with little
effort. The worst outcome is to work hard and get an F.
2
Grade Total Effort Utility
A
Lot
50
A
Little
100
B
Lot
40
B
Little
90
C
Lot
30
C
Little
80
Lot
0
F
F
Little
20
(c) For this student, getting an A is much better than getting a B, which is a little bit better than a C which
is a little bit better than F. Arguably they will feel better getting a low grade if they worked hard than
if they didn’t (although we won’t take marks off for the other way around). What is important is the
big gap between the A’s and the rest.
Grade Total Effort Utility
A
Lot
95
Little
100
A
B
Lot
40
B
Little
30
C
Lot
20
Little
10
C
F
Lot
10
F
Little
0
Question 3
Solution Chris is right, the fact that a user did not use a particular word is important information that should
not be ignored. For example, if people who want to know about printer problems almost always use the word
“print” (or a derivative), the fact that someone did not use the work “print” is evidence that it is not a printer
problem. This is different from now knowing whether they used the word “print”.
Question 4
Solution
(a) The words are independent of each other given the help page. (But they are dependent on each other
if the help page is not observed.)
(b) The words probably should not be independent of each other, there are some words that are likely to
go together (such as “I” and “am”) and some that are not likely to go together (such as “folder” and
“directory” — someone is likely to use only one of these depending on which system they use).
(c) The topics are independent of each other given no words (but are dependent given words). The words
are independent of each other given the topics (but are dependent of each other given no topics).
(d) If one topic is a superset of another (e.g., shirt and clothes) or are unlikely to go together (e.g., fashion
and science, even they are both associated with “model”) or are likely to go together (e.g., fashion and
shirts). There are many possible answers.
3
Download