Statistics 401 Final Examination (200 points)

advertisement
Statistics 401
Final Examination (200 points)
Name
1. The price of a laptop of a particular make and configuration among dealers nationwide is assumed to have
a Normal distribution with mean µ = $1200 with standard deviation σ = $50.
(a) (10) What is the probability that a laptop of the same make, chosen randomly from a dealer, will cost
between $1136 and $1308 ? (Show your work).
(b) (10) What is the probability that the average price of 25 laptops of the same make, chosen randomly
from a dealer, will be greater than $1225? (Show your work).
2. To estimate the mean potency µ of an antibiotic in a production batch, a manufacturer takes potency
readings of 14 random samples, giving ȳ = 8.67 , and s2 = 0.56.
(a) (10) Estimate the mean potency for the batch using a 95% confidence interval.
(b) (10) According to the industry standard, the standard deviation of the potency for this drug,
σ, must not exceed 0.7. State and test the hypotheses that this batch does not meet the
industry standard. Use α = .05.
1
3. Brands of a type of electrical wire produced by two different manufacturers are being studied to compare
their resistivity characteristics. The values of resistivity measurements obtained by the experimenter suggests that they can be considered to be samples from normal distributions and are used to calculate the
following statistics:
Brand A
Brand B
2
n1 = 8 ȳ1. = 0.140 s1 = 0.000007 n2 = 9 ȳ2. = 0.145 s22 = 0.00002
Assuming that the population means are µ1 and µ2 and the population variances are σ12 and σ22 , respectively,
answer the following questions:
(a) (10) Test the hypothesis H0 : σ12 = σ22 vs. Ha : σ12 6= σ22 . Use α = .05 and show work.
(b) (10) Now assume σ12 = σ22 . Compute the standard error of the difference in the sample means ȳ1. − ȳ2. .
(c) (10) Use the standard error from part (b) to construct a statistic to test the research hypothesis that
Brand B wires have larger mean resistivity. State the null and alternative, the rejection region, and
your decision using α = .05. Show work.
(d) (10) Use the standard error from part (b) to compute a 90% confidence interval for µ1 − µ2 . Test the
hypothesis in part (c) using this interval, stating the α-level of the test.
2
4. An experiment was conducted to determine the relationship between the amount of warping in thin strips,
(y), made from a particular alloy and temperature, (x) . A simple linear regression model of the form
y = β0 + β1 x + was first fitted to the data.
Temperature
(◦ C ) (x)
Amount of
Warping (y)
ȳi.
15
11, 11, 14
12
20
14, 12, 13
13
25
14, 12, 16
14
30
18, 14, 16
16
35
21, 19, 17
19
40
24, 20, 22
22
45
24, 23, 28
25
50
31, 30, 32
31
The following statistics have also been computed from the above data:
Sxx = 3150.0
Syy = 980.0
Sxy = 1650.0
(a) (10) Complete the analysis of variance table for the regression below. Use the F-ratio from the anova
table to test H0 : β1 = 0 vs. Ha : β1 6= 0, at α = .05. Show work and state your conclusion.
Source
d.f.
SS
MS
F
Regression
Error
Total
23
(b) (10) Compute the t-statistic to test the hypothesis that H0 : β1 = 0 vs. Ha : β1 6= 0. Test this
hypothesis using α = .05. State the rejection region and your decision clearly. Show work.
3
(c) (10) Perform the lack of fit test by completing the following table and state your conclusion. Use
α = .05.
d.f.
Source
SS
MS
F
Lack of Fit
Pure Error
Total Error
(d) (10) Based on the above analysis and examining relevant residual plots, it was decided to fit
the polynomial model y = β0 + β1 x + β2 x2 + to this data. Using the JMP output of this
fit given below, test the hypothesis that the squared term is necessary in this model. Use
α = .05. State the null and alternative hypotheses, circle the test statistic and the
p-value, and state your conclusion.
Term
Intercept
x
x*x
Estimate
14.309524
-0.342857
0.0133333
Std Error t-Ratio
2.965769
4.82
0.197591
-1.74
0.003005
4.44
Prob>|t|
<.0001
0.0974
0.0002
5. In a laboratory experiment, the weight loss in milligrams of a certain machine part due to friction when
specimens were used in a wear-tester with four different lubricants SP51, L43, Z907, and Blu-Tek, were
measured. Lower the weight loss, the better the performance of the lubricant. The lubricants were assigned
to the test runs completely at random. The model yij = µi + ij , i = 1, . . . , 4; j = 1, . . . , 8 with ij ∼
N (0, σ2 ), was used to analyze the resulting data:
i
1
2
3
4
Lubricant
SP51
L43
Z907
Blu-Tek
12
8
10
8
11
10
5
5
Weight Loss (in milligrams)
7
10
9
10
5
7
10
9
7
8
9
6
8
7
5
6
12
7
7
6
7
8
6
7
ȳi.
9.75
8.0
7.25
6.5
(a) (10) Complete the analysis of variance table below by filling in the blanks. State the null and
alternative hypotheses you test using the F-ratio. Perform this test using α = .05.
Source
Lubricant
d.f.
SS
46.5
MS
Error
Total
123.5
4
F
(b) (10) Sp51 and L43 are petroleum distillate based lubricants while Z907 and Blu-Tek are synthetic
oil based lubricants. Is the average performance of the petroleum distillate based lubricants same as
the average performance of the synthetic oil based lubricants? To answer this question, calculate a
t-statistic to test the hypothesis H0 : (µ1 + µ2 )/2 = (µ3 + µ4 )/2 vs. Ha : (µ1 + µ2 )/2 6= (µ3 + µ4 )/2
State your conclusion using α = .05?
(c) (10) Sp51 and Z907 contain a standard silicon additive while L43 and Blu-Tek contain a new hi-tech
detergent additive. Is the new additive result in better performance in the detergents than when
using the standard additive? To answer this question, calculate a t-statistic to test the hypothesis
H0 : (µ1 + µ3 )/2 ≤ (µ2 + µ4 )/2 vs. Ha : (µ1 + µ3 )/2 > (µ2 + µ4 )/2
State your conclusion using α = .05?
(d) (10) Compute the LSD value at α = .05. Use the LSD multiple comparison procedure to compare the
4 lubricant means using the underlining method. Summarize the results of this procedure in words.
5
6. The data below, extracted from Consumer Reports, comprise of gasoline mileage, y, measured as gallons
per 100 miles (GP100M), and six aspects of automobile design and performance for 36 automobiles in a
certain model year. It is desired to develop a regression model to predict gasoline mileage of a vehicle using
6 explanatory variables measured: CYL=Number of Cylinders, DISP= Engine Size in cubic inches, HP=
Horsepower, DRAT=Drive Ratio, WT=Weight in lbs., ACCEL= Acceleration in sec.
Automobile
Ford Country Sq.
Chevy Malibu Wag
Chevette
Toyota Corona
Datsun 510
Dodge Omni
Audi 5000
Volvo 240 GL
Saab 99 GLE
Peugeot 694 SL
Buick Century Sp
Mercury Zephyr
Dodge Aspen
AMC Concord D/L
Chevy Caprice Cl
Ford LTD
Mercury Grand Ma
Dodge St Regis
Ford Mustang 4
Ford Mustang Ghi
Mazda GLC
Dodge Colt
AMC Spirit
VW Scirocco
Honda Accord LX
Buick Skylark
Chevy Citation
Olds Omega
Pontiac Phoenix
Plymouth Horizon
Datsun 210
Fiat Strada
VW Dasher
Datsun 810
BMW 320i
VW Rabbit
CYL
8
8
4
4
4
4
5
6
4
6
6
6
6
6
8
8
8
8
4
6
4
4
4
4
4
4
6
6
4
4
4
4
4
6
4
4
DISP
351
267
98
134
119
105
131
163
121
163
231
200
225
258
305
302
351
318
140
171
86
98
121
89
98
151
173
173
151
105
85
91
97
146
121
89
HP
142
125
68
95
97
75
103
125
115
133
105
85
110
120
130
129
138
135
88
109
65
80
80
71
68
90
115
115
90
70
65
69
78
97
110
71
DRAT
2.26
2.56
3.7
3.05
3.54
3.37
3.9
3.5
3.77
3.58
2.73
3.08
2.71
2.73
2.41
2.26
2.26
2.45
3.08
3.08
3.73
2.97
3.08
3.78
3.05
2.53
2.69
2.84
2.69
3.37
3.7
3.1
3.7
3.7
3.64
3.78
WT
4.054
3.605
2.155
2.56
2.3
2.23
2.83
3.14
2.795
3.41
3.38
3.07
3.62
3.41
3.84
3.725
3.955
3.83
2.585
2.91
1.975
1.915
2.67
1.99
2.135
2.67
2.595
2.7
2.556
2.2
2.02
2.13
2.19
2.815
2.6
1.925
ACCEL
14.3
15
16.5
14.2
14.7
14.5
15.9
13.6
15.7
15.8
15.8
16.7
18.7
15.1
15.4
13.4
13.2
15.2
14.4
16.6
15.2
14.4
15
14.9
16.6
16
11.3
12.9
13.2
13.2
19.2
14.7
14.1
14.5
12.8
14
GP100M
6.45
5.21
3.33
3.64
3.68
3.24
4.93
5.88
4.63
6.17
4.85
4.81
5.38
5.52
5.88
5.68
6.06
5.49
3.77
4.57
2.93
2.85
3.65
3.17
3.39
3.52
3.47
3.73
2.99
2.92
3.14
2.68
3.28
4.55
4.65
3.13
A JMP program was used to fit the model
GP 100M = β0 + β1 CYL + β2 DISP + β3 HP + β4 DRAT + β5 WT + β6 ACCEL + .
Using the JMP output attached, extract or calculate answers for the following:
(a) (10) Some statistics in the JMP output indicates that β coefficients in the full model are not accurately
estimated? What are these? Explain. What statistic in the JMP output indicates that the problem
may be caused by multicollinearity?
6
(b) (10) Part of your JMP output contains statistics computed for the reduced model GP 100M = β0 +
β1 CYL + β3 HP + β4 DRAT + β6 ACCEL + fitted to the above data. Calculate a 95% confidence interval
for β1 using information in this output. Argue why this model is better than the full model.
(c) (10) Using the statistics computed for both the full and reduced models, perform an F-test of H0 :
β2 = β5 = 0 vs. Ha : β2 and/or β5 6= 0 in model 1. Use α = .05
(d) (10) From the residual diagnostics of the 4-variable model fit, select one case each that you might
select as a possible x-outlier, a possible y-outlier, or an influential observation. Give a reason for each
of your choices.
Possible
x-outlier
Case No.
Reason you selected this case
y-outlier
Inluential Case
7
THIS PAGE IS INTENTIONALLY LEFT BLANK
1
Fit Full Model: GP100M= β 0 + β1CYL + β 2 DISP + β 3 HP + β 4 DRAT + β 5WT + β 6 ACCEL + ε
Summary of Fit
RSquare
RSquare Adj
Root Mean Square Error
Mean of Response
Observations (or Sum Wgts)
0.957794
0.949062
0.25755
4.256687
36
Analysis of Variance
Source
DF Sum of Squares
Model
6
43.653642
Error
29
1.923631
C. Total
35
45.577272
Parameter Estimates
Term
Estimate
Intercept
-4.629662
CYL
-0.002149
DISP
0.001849
HP
0.0128752
DRAT
0.8402421
WT
1.4406212
ACCEL
0.0449719
Mean Square
7.27561
0.06633
Std Error
0.691698
0.086783
0.002848
0.005284
0.165975
0.319137
0.03673
t Ratio
-6.69
-0.02
0.65
2.44
5.06
4.51
1.22
F Ratio
109.6846
Prob > F
<.0001*
Prob>|t|
<.0001*
0.9804
0.5213
0.0212*
<.0001*
<.0001*
0.2307
VIF
.
9.1682468
27.584196
8.5964282
3.8791678
22.881076
1.7623069
Fit Reduced Model: GP100M= β 0 + β1CYL + β 3 HP + β 4 DRAT + β 6 ACCEL + ε
Summary of Fit
RSquare
RSquare Adj
Root Mean Square Error
Mean of Response
Observations (or Sum Wgts)
0.899895
0.886978
0.383638
4.256687
36
Analysis of Variance
Source
DF Sum of Squares
Model
4
41.014747
Error
31
4.562525
C. Total
35
45.577272
Parameter Estimates
Term
Estimate
Intercept
-4.576246
CYL
0.2502098
HP
0.0366092
DRAT
0.4349388
ACCEL
0.1704049
Std Error
0.985124
0.090037
0.005121
0.17165
0.043532
Mean Square
10.2537
0.1472
t Ratio
-4.65
2.78
7.15
2.53
3.91
F Ratio
69.6685
Prob > F
<.0001*
Prob>|t|
<.0001*
0.0092*
<.0001*
0.0166*
0.0005*
VIF
.
4.4477831
3.6394897
1.8699163
1.1156751
Case Automobile
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
Ford Ctry Sq. W
Chevy Malibu W
Chevette
Toyota Corona
Datsun 510
Dodge Omni
Audi 5000
Volvo 240 GL
Saab 99 GLE
Peugeot 694 SL
Buick Century Sp..
Mercury Zephyr
Dodge Aspen
AMC Concord D/L
Chevy Caprice C
Ford LTD
Mercury Grnd Marq
Dodge St Regis
Ford Mustang 4
Ford Mustang Ghia
Mazda GLC
Dodge Colt
AMC Spirit
VW Scirocco
Honda Accord LX
Buick Skylark
Chevy Citation
Olds Omega
Pontiac Phoenix
Plymouth Horizon
Datsun 210
Fiat Strada
VW Dasher
Datsun 810
BMW 320i
VW Rabbit
CYL DISP HP DRAT
8
8
4
4
4
4
5
6
4
6
6
6
6
6
8
8
8
8
4
6
4
4
4
4
4
4
6
6
4
4
4
4
4
6
4
4
351
267
98
134
119
105
131
163
121
163
231
200
225
258
305
302
351
318
140
171
86
98
121
89
98
151
173
173
151
105
85
91
97
146
121
89
142
125
68
95
97
75
103
125
115
133
105
85
110
120
130
129
138
135
88
109
65
80
80
71
68
90
115
115
90
70
65
69
78
97
110
71
2.26
2.56
3.7
3.05
3.54
3.37
3.9
3.5
3.77
3.58
2.73
3.08
2.71
2.73
2.41
2.26
2.26
2.45
3.08
3.08
3.73
2.97
3.08
3.78
3.05
2.53
2.69
2.84
2.69
3.37
3.7
3.1
3.7
3.7
3.64
3.78
WT ACCEL GP100M Residual
4.054
3.605
2.155
2.56
2.3
2.23
2.83
3.14
2.795
3.41
3.38
3.07
3.62
3.41
3.84
3.725
3.955
3.83
2.585
2.91
1.975
1.915
2.67
1.99
2.135
2.67
2.595
2.7
2.556
2.2
2.02
2.13
2.19
2.815
2.6
1.925
14.3
15
16.5
14.2
14.7
14.5
15.9
13.6
15.7
15.8
15.8
16.7
18.7
15.1
15.4
13.4
13.2
15.2
14.4
16.6
15.2
14.4
15
14.9
16.6
16
11.3
12.9
13.2
13.2
19.2
14.7
14.1
14.5
12.8
14
6.45
5.21
3.33
3.64
3.68
3.24
4.93
5.88
4.63
6.17
4.85
4.81
5.38
5.52
5.88
5.68
6.06
5.49
3.77
4.57
2.93
2.85
3.65
3.17
3.39
3.52
3.47
3.73
2.99
2.92
3.14
2.68
3.28
4.55
4.65
3.13
0.4079
-0.4628
-0.0016
-0.0124
-0.3439
0.1293
0.0749
0.5414
-0.3201
0.1293
0.2056
0.5855
0.0591
0.4462
0.0253
0.2674
0.3508
-0.5289
0.3339
-0.5175
-0.0841
-0.2499
0.4006
-0.0323
0.3205
-0.0252
-0.7584
-0.8372
-0.1537
0.2216
-0.5406
-0.1229
-0.0134
-0.0108
0.4352
0.0812
Stud.
Resid
1.1458
-1.3013
-0.0045
-0.0341
-0.9376
0.3483
0.2113
1.5304
-0.9825
0.3844
0.5523
1.6503
0.1784
1.2074
0.0707
0.7616
0.9937
-1.4744
0.9047
-1.4023
-0.2339
-0.6821
1.0823
-0.0890
0.8946
-0.0757
-2.1899
-2.2745
-0.4410
0.6210
-1.6443
-0.3375
-0.0366
-0.0312
1.2764
0.2268
hats Cook's
D
0.1389 0.042
0.1407 0.055
0.1067 0.000
0.0996 0.000
0.0862 0.017
0.0630 0.002
0.1470 0.002
0.1497 0.082
0.2788 0.075
0.2311 0.009
0.0584 0.004
0.1448 0.092
0.2558 0.002
0.0719 0.023
0.1323 0.000
0.1624 0.022
0.1532 0.036
0.1255 0.062
0.0744 0.013
0.0745 0.032
0.1211 0.002
0.0878 0.009
0.0690 0.017
0.1019 0.000
0.1277 0.023
0.2479 0.000
0.1851 0.218
0.0795 0.089
0.1748 0.008
0.1343 0.012
0.2656 0.196
0.0984 0.002
0.0864 0.000
0.1868 0.000
0.2101 0.087
0.1287 0.002
Download