hw2sol

advertisement
Solutions: Homework #2
1a)
P(A) = 5/9 = .56
P(B) = .2/9 = .22
P(A  B) = P(A) + P(B) - P(A  B) = .56 + .22 -.11 = .67
P(A  B) = P(A)*P(B|A) = .56*(.2) = .11
P(A|B) = .5
P(B|A) = .2
b)
A and B are not independent because P(A|B) is different from P(A). In other
words, the probability that a person is female changes if we know that their name
begins with the letter J.
2) Using Bayes Theorem (+ = Positive test; D = Drunk):
P(D|+) =
P(+|D)*P(D)
P(+|D)*P(D) + P(+|D`)*P(D`)
=
=
(.9)*(.2) / [(.9)*(.2) + (.05)*(.80)]
.18 / (.18 + .04)
=
.18 / .22
3)
Siblings
Frequen
cy
Percent
Valid
Cumulative
Percent
Percent
Valid 0
9
6.9
6.9
6.9
1
65
50.0
50.0
56.9
2
36
27.7
27.7
84.6
3
11
8.5
8.5
93.1
4
7
5.4
5.4
98.5
8
1
.8
.8
99.2
11
1
.8
.8
100.0
130
100.0
100.0
Total
Statistics
Siblings
N
Valid
Missing
Mean
130
0
1.67
Std. Deviation
1.372
Variance
1.882
.81
Above, you can find the SPSS output. I’m going to transform it a little to show how to
calculate E(x) and S using the discrete random variable formula.
P(x) rounded
x*p(x)
[x-E(x)]2 [(x-E(x))^2]*p(x)
x
frequency
(f/116)
0
9
.069
0.00
2.81
0.19
1
65
.500
0.50
0.46
0.23
2
36
.277
0.55
0.1
0.03
3
11
.085
0.26
1.75
0.15
4
7
.054
0.22
5.40
0.29
8
1
.008
0.06
39.98
0.32
11
1
.008
0.09
86.92
0.70
1.91
1.68
E(x)
Var
SD
Sum [x*p(x)]
Sum[x-E(x)^2*p(x)]
sqrt(var)
1.68
1.91
1.38
You will notice that the values for SPSS differ from those I calculated below. The reason
for that is the SPSS values are based on the actual number of observations that we have.
The discrete random variable calculation assumes we are dealing with an infinite number
of families/siblings. Please come see me if you would like more details about this.
4)
n = # of trials = 3
x = # of successes = 2
p = P(success) = .4
1-p = P(failure) = .6
Binomial formula: (n choose x) * px * (1-p)n-x
P(x=2) = (3 choose 2) * .42 * .61
=
3 * .16 * .6
5)
N
r
n
x
=
.288
# of marbles = 10
# of successes (blue marbles) = 4
total # of marbles picked = 3
# of blue marbles picked = 1
Hypergeometric Rule: (r choose x) * [(N-r) choose (n-x)] / (N choose n)
P(x=1)
=
(4 choose 1) * [(10-4) choose (3-1)] / (10 choose 3)
=
(4 choose 1) * (6 choose 2) / (10 choose 3)
=
4
*
15
/
120
=
60 / 120
=
.50
6)
Statistics
Height
N
Valid
Missing
130
0
Mean
67.84
Std. Deviation
4.426
Variance
z
7)
19.590
=
x - mean / s
=
63 – 67.8 / 4.4
=
-4.8 / 4.4
=
-1.09
According to the Z-table, .8621 of the standard normal curve lies above –1.09 (Body).
This suggests that the proportion of the AD student body that is over 5'3" is .8621.
x - mean / s
64 – 67.8 / 4.4
-3.8 / 4.4
-0.87




z
z
z
z




x - mean / s
72 – 67.8 / 4.4
4.2 / 4.4
0.94
According to the table, .8078 of the standard normal curve lives above –0.87 (Body),
and .1736 lies above 0.94 (Tail). Therefore, the area between these two points is
.8078 -.1736 = .6342. This suggests that the proportion of AC students that fall
between 5'4" and 6' is .6342. Alternatively, you could have calculated the area in the
tail below -0.87 (.1922) and the area in the body beneath 0.94 (.8264; .8264 -.1922 =
.6342. OR, you could have found the area between 0 and -0.87 (.3078) and added that
to the area between 0 and 0.94 (.3264). The sum of .3078 and .3264 is .6342. Golly,
there are a lot of ways home for this one.
Download