Stat 301 B – Fall 2014 – Midterm exam 1

advertisement
Stat 301 B – Fall 2014 – Midterm exam 1
2 October 2014
Questions packet
Instructions:
1. Put your name on the back of the last page of the questions packet. I don’t want to see your
name until I have finished grading.
2. Read each question carefully and completely. Ask if you don’t understand something.
3. Answer each question and show work in the space provided. Scratch paper is provided for your
use, but I will only read and evaluate what you put in the answer spaces.
4. Use the JMP output. You do not need to calculate something that JMP has already calculated. I
am happy to answer questions along the lines of ‘(You pointing to a number on the JMP output)
Is this the estimate of the difference in means?’.
5. The exam has a total of 100 pts. Each question is worth 3, 4, or 5 points.
6. You can keep the Information packet and Formula sheet. Turn in the Questions packet.
1
1. Dioxin in chemical plant workers.
a.
3 pts. Based on what you see in the histogram and/or box plot, what is the concentration of Dioxin
(approximately) in the typical worker at this chemical plant? Include units with your answer.
b. 4 pts. Is the distribution of Dioxin concentration symmetrical or skewed? Briefly explain your choice.
c. 4 pts. What is the standard error of the mean? (show your work, state what item in the JMP output
provided your answer, or state that you need additional information)
d. 3 pts. Describe, in a non-technical way (i.e., to someone who cares about chemical plant workers but
doesn’t known any statistics), how to interpret the standard error of the mean (i.e., explain what the se
“means”).
e. 4 pts. Dioxin is a nasty chemical. The company wants to be certain that the mean Dioxin
concentration in all 1550 workers is less than 500 ppt. Calculate the T statistic that tests the null
hypothesis that the mean Dioxin concentration equals 500 ppt. (show your work, you don’t need to
calculate a p-value)
2
f. 4 pts. The two-sided p-value for the test in part e is 0.0245. What should you tell management when
they ask you how the mean Dioxin in all 1550 workers compares to the standard of 500 ppt?
g. 5 pts. The 95% confidence interval for the mean is (328, 488). Is it reasonable to conclude that 95%
of the workers at this chemical plant have Dioxin concentrations between 328 ppt and 488 ppt? Briefly
explain why or why not.
2. Oatmeal and blood pressure.
a. 4 pts. Calculate the pooled standard deviation, sp. (Show your work, state what item in the JMP
output provided your answer, or state that you need additional information)
b. 4 pts. How many degrees of freedom are associated with the pooled standard deviation, sp. (Show
your work, state what item in the JMP output provided your answer, or state that you need additional
information)
c. 5 pts. Is the pooled standard deviation an appropriate summary of the variability in each of the two
groups of men? Briefly explain why or why not.
3
d. 4 pts. The report of this investigation includes the following table of sample averages:
Group
Sample size (N) Sample Average
eat oatmeal
20
117.067
hate oatmeal
42
126.213
Are the averages in this table appropriately reported? Briefly explain your answer.
e. 4 pts. What is the 95% confidence interval for the difference in means? (Show your work or state
what item in the JMP output provided your answer)
f. 4 pts. Based on what you see in the JMP output, what can you say about the p-value for the test of
the null hypothesis that the mean difference = 0? Briefly explain your answer.
g. 5 pts. Are these data paired or are they two independent samples? Briefly explain your choice.
h. 5 pts. Is it appropriate to conclude that eating oatmeal three times a week or more will reduce your
blood pressure? Briefly explain why or why not.
4
3. PCB and egg shell thickness
a) 4 pts. What egg shell thickness do you expect to find if the PCB concentration in the egg = 0? (Show
your work, state what item in the JMP output provided your answer, or state that you need additional
information.)
b) 4 pts. Give a 95% interval that describes the average egg shell thickness in a large collection of eggs,
all with a PCB concentration = 0. (Show your work, state what item in the JMP output provided your
answer, or state that you need additional information.)
c) 4 pts. What is the p-value for the test of the null hypothesis that the regression slope = 0? (Show your
work, state what item in the JMP output provided your answer, or state that you need additional
information.)
d) 5 pts. Based on your result in question 3c, is it appropriate to conclude that “The regression slope for
Brown Penguins on Anacapa Island = 0?”. Briefly explain your answer.
e) 4 pts. What is the 95% confidence interval for the difference in average shell thickness between eggs
with a PCB concentration of 225 ppb and eggs with a PCB concentration of 226 ppb? (Show your work,
state what item in the JMP output provided your answer, or state that you need additional information.)
5
f) 4 pts. If the shell of an egg is thin, it is more likely to get broken during incubation. What 95% interval
describes the uncertainty in predicted egg shell thickness for a single egg with a PCB concentration of
350 ppb? (Show your work, state what item in the JMP output provided your answer, or state that you
need additional information.)
g) 4 pts. The investigators made sure they had only one egg from each nest and only one measurement
of PCB concentration per egg. Would you be concerned if the data set had 3 measurements of PCB
concentration for each egg, i.e., a total of 195 observations? Briefly explain your concern or lack of
concern.
h) 3 pts. Do you have any concern about lack of fit of the regression line, i.e. the assumption that the
relationship between PCB concentration and mean egg thickness can be described by a straight line?
Briefly describe what in the output indicates a concern or lack of concern.
i) 3 pts. Do you have any concerns about the assumption of equal variances? Briefly describe what in
the output indicates a concern or lack of concern.
j) 3 pts. Do you have any concerns about the assumption that the errors are normally distributed?
Briefly describe what in the output indicates a concern or lack of concern.
6
Download