STATISTICS 401B Fall 2014 Laboratory Assignment 6

advertisement
STATISTICS 401B
Fall 2014
Laboratory Assignment 6
1. PCBs have been in use since 1929, mainly in the electrical industry, but it was not until the 1960s that they
were found to be a major environmental contaminant. In 1998, researchers reported the following study in
an environmental journal: Thirteen study sites from the five Great Lakes were selected. At each site, 9 to
13 herring gull eggs were randomly collected each year for several years an the mean PCB content in the
eggs were determined. The following are data for the years 1982 and 1996:
Site
Year
1
2
3
4
5
6
7
8
9
10
11
12
13
1982 61.48 64.47 45.50 59.70 58.81 75.86 71.57 38.06 30.51 39.70 29.78 66.89 63.93
1996 13.99 18.26 11.28 10.02 21.00 17.36 28.20
7.30 12.80
9.41 12.63 16.83 22.74
Note: Do all computations for parts (b),(c), and (d) using hand calculation. Use Excel to compute
P
P
di , d2i , i = 1, . . . , 13 and thus
di , d2i and sd and sd¯. Use the output from a JMP analysis using the
data file gull egg PCB.jmp downloaded from the web page to provide answers to parts (a),(e) and (f).
Turn in the marked JMP output page and show work.
(a) Do the Normal probability plot, the boxplot, and the stem-and-leaf plots from the JMP output indicate that a paired t-test could be used? Explain.
(b) Legislation was passed in the 1970s restricting production and use of PCBs. Thus the active infusion
of PCBs from local sources has been severely curtailed. Compute the paired t-statistic to test the
research hypotheses that there has been a significant decrease in mean PCB content in herring gull
eggs. Use α = .01. State your hypotheses clearly. Define the symbols you use carefully.
(c) Explain what is meant by the p-value in words by defining it for the above test. Show how to find
bounds for the p-value of the above test using the t-table.
1
(d) Use a 98% confidence interval to estimate the decrease in the mean PCB content in herring gull eggs.
Test the hypothesis in part (b) using this interval stating the α level used for this test clearly. State
your conclusion.
(e) Extract the t-statistic from the JMP output for testing the hypothesis you stated in part (b). Circle
it in the JMP output. State the p-value associated with this test extracted from the JMP output.
(f) Extract the 98% confidence interval for the decrease in the mean PCB content in herring gull eggs
from the JMP output. Copy it here and circle it in the JMP output.
2. Twenty observations on etch uniformity on silicon wafers, y, are taken during a qualification experiment
for a plasma etcher. The data are: 5.34, 6.00, 5.97, 5.25, 6.65, 7.55, 7.35, 6.35, 4.76, 5.54, 5.44, 4.61, 5.98,
5.62, 4.39, 6.00, 7.25, 6.21, 4.98, 5.32. Use this data to provide answers to the following questions.
P
Note: Do all computations for parts (b), and (c), using hand calculation. For this data
y = 116.56,
P 2
y = 694.33. Use the output from a JMP analysis to provide answers to parts (a),(d) and (e). Turn in
the marked JMP output and show work.
(a) Do the box plot or the normal probability plot indicate any violation of the conditions underlying the
use of the chi-square procedures for constructing confidence intervals or testing hypotheses about σ?
2
(b) A quality control inspector might require the etch uniformity of the wafers to be reasonably consistent, in particular, that σ of more than 1 unit to be unsatisfactory. Is there sufficient evidence in the
data to determine that the standard deviation meets this criterion? State appropriate hypotheses and
perform a test with α = .05.
(c) Estimate the standard deviation of the etch uniformity of the wafers using a 95% confidence interval.
(d) Extract the statistic from the JMP output for testing the hypothesis you stated in part (b). Circle it
in the JMP output. State the p-value associated with this test extracted from the JMP output.
(e) Extract the 95% confidence interval for the standard deviation of the etch uniformity of the wafers
from the JMP output. Copy it here and circle it in the JMP output.
3
3. In a packing plant, a machine (A) packs cartons with jars. It is proposed to purchase a new machine (B)
that is advertized to pack faster on the average than the machine currently used. To compare the machines
the times it takes each machine to pack ten cartons are recorded. The results, in seconds, are shown in the
following table.
Machine
A
B
42.7
42.1
43.8
41.3
42.5
42.4
Packing Time(in sec.)
43.1 44.0 43.6 43.3
43.2 41.8 41.0 41.8
P
43.5
42.8
42.1
42.3
44.1
42.7
y
432.7
421.4
P 2
y
18,726.91
17,762.00
(a) Use the normal probability plots output from the JMP output to comment on whether the assumption
of normality of the two populations is reasonable. Does this plot support the assumption that the
population variances are the same for the two populations? Explain
(b) Compute the sample variances s2A and s2B , respectively, of the two samples using the above summary
statistics. (Must use hand computation)
(c) Compute a statistic to test the research hypothesis that the two population variances of the packing
times for the two machines are different using α = .05 by hand. State your hypotheses, the rejection
region and your conclusion. (Must use hand computation)
(d) Extract the statistic from the JMP analysis for testing the hypothesis you stated in part (b). Circle
it in the JMP output. State the p-value associated with this test extracted from the JMP output.
Due Thursday, Oct 30, 2014 (turn-in during the first 20 min.
4
of the lab )
Download