solutions

advertisement
Stat 250.3
November 10, 2003
SOLUTIONS
Activity 7: Confidence Intervals and Hypothesis Tests
Situation 1:
A sociologist wants to know how much more money on average a husband makes than his wife. The sociologist records the salaries
for 40 random couples (assume both husband and wife are employed).
A) What type of data is this? Paired
B) Suppose that the sample mean salary for husbands was $52,000 and the sample mean salary for women was $41,000. Differences
are taken for each pair and the husband’s salary is $11,000 more on average with a standard deviation of the differences equal to
$8000. Construct a 95% confidence interval for the mean difference in salaries between husbands and wives.
Parameter of Interest: μd: average difference of salaries in a couple (husband minus wife).
Statistic of Interest:
d
=$11,000
Conditions: n=40>30
Confidence interval calculations: df = 40-1=39. Looking in the Table A.2, for 30 df (next lowest for 39df) and Confidence level 95%,
we have that the multiplier t* = 2.04.
sd 8000
=
 1264.9 .
n
40
Thus, the 95% CI for μd is d ± t* s.e.( d
The s.e.(
d
)=
) = 11 000 ± (2.04) (1264.9) = ($8,414.58, $13,580.42)
Interpretation of the CI: We are 95% confident that on average the salary of a husband is from $8,414.58 to $13,580.42 higher
than the salary of his wife.
C) Test the hypothesis that on average the salaries of the husbands are higher than the ones of their wives.
H0: μd =0 vs Ha: μd≠0
t
d  0 11000

 8.69 , and the p-value=P(T> 8.9) > 0.05 (where T is a t random variable with 39 df.).
sd
1264.9
n
So the conclusion is that we reject the H0, and conclude the on average, the husband salary is higher than the one of his wife.
Using the CI above, without any further calculations we could have rejected H0 at significance level alpha=0.025. However,
after we got the p-value, we can see that H0 is rejected for alpha = 0.05 as well.
Situation 2:
A scientist wants to know how effective a new drug is at lowering blood pressure. He randomly assigns 70 patients to receive either the
blood pressure lowering drug, or a placebo (half get each). The average decrease in blood pressure for the drug group is 8 with a
standard deviation of 10. The average decrease for placebo was 2 with a standard deviation of 4.
A) Construct an appropriate 99% confidence interval for the difference in the mean decrease in blood pressure between the drug and
placebo.
Parameter of Interest: µ1-µ2 , where µ1 is the average decrease in blood pressure for people using the drug, and µ 2 is the
average decrease in blood pressure for people using the placebo.
Statistic of Interest:
x1  x2  8  4  4
Conditions: n1=n2=35 > 30, and the two samples are random and independent.
Confidence interval calculations: df = min(35-1, 35,1)= 34. Looking in the Table A.2, for 30 df (next lowest for 34df) and
Confidence level 99%, we have that the multiplier t* = 2.75.
s.e.( x1  x ) 
s1 2 s 2 2


n1 n2
10
2
35

4
2
35
 1.82
Thus the 99% CI for µ1-µ2 is 4 ± (2.75) (1.82) = (-1.005, 9.005).
B) Test the hypothesis that the drug is effective. (i.e. works better than the placebo) at alpha=0.01 significance level.
H0: µ1-µ2 = 0 vs Ha: µ1-µ2 ≠0
Since zero falls in the 99%CI for µ1-µ2 , we fail to reject the null hypothesis at alpha=0.01 significance level, and conclude that
there is no significant difference between the two means. Therefore, the drug is not significantly effective testing at
alpha=0.01 significance level.
Stat 250.3
November 10, 2003
Situation 3: 12.7 From the Text
A study was done by randomly assigning 200 volunteers with sore throats to either drink a cup of herbal tea or use a sore throat
lozenge to ease their pain. The percent reporting relief for the two methods were compared. What are the research question, the
population parameter, and the sample statistic for this study?
Research question: How much difference is there between the percent getting relief from sore throat symptoms if herbal tea
is used versus if throat lozenges are used?
Population parameter: p1  p 2 = difference in percents reporting relief for population that would use herbal tea and population
that would use throat lozenge.
Sample estimate:
volunteers.
pˆ 1  pˆ 2
= difference in observed percents reporting relief for the two different methods in the study of 200
Situation 4:
A random sample of students was surveyed on whether they smoked pot, and whether they ever tried a “harder” drug (cocaine, heroin,
etc.) The summary statistics for the two variables were as follows:
X = Number who Used a Harder Drug
Test and CI for Two Proportions
Sample
Pot
No Pot
X
N Sample p
34 140 0.242857
40 248 0.161290
Estimate for p(1) - p(2): 0.0815668
95% lower bound for p(1) - p(2): 0.0106494
Test for p(1) - p(2) = 0 (vs > 0): Z = 1.89 P-Value = 0.029
A) Create a 95% for the difference in proportion of students that smoked pot and that did not smoked pot that tried a “harder” drug.
Let p1 be the proportion of students that smoked pot, that tried a harder drug, and p2 be the proportion of students that
haven’t smoked pot, that tried a harder drug,
pˆ 1  pˆ 2  34 / 140  40 / 248  0.08156,
se(
pˆ 1  pˆ 2 ) 
pˆ 1(1  pˆ 1) pˆ 2(1  pˆ 2)

 0.04311
n1
n2
So, the 95% CI for p1 -p2 is 0.08156 ± 2(0.04311) = (-0.0046, 0.16779)
B) Is there enough evidence to conclude that a higher proportion of students who smoked pot than students who did not try pot have
tried a “harder” drug?
Since zero falls in the interval calculated in part (A), we fail to reject the null hypothesis that the two proportions are equal at
alpha=0.025 significance level.
However, if you go through the steps to test the hypothesis H0: p1 -p2 =0 vs Ha p1 -p2 >0, you will get z-stat = 1.89 p-value =
0.029. Using the p-value, we can conclude that we REJECT H0 at alpha=0.05.
Download