Chapter Twelve

advertisement
Chapter 12
Analysis of Variance
12-2 One-Way ANOVA
1. a. One-way analysis of variance is appropriate for these data because they represent three or
more populations categorized by a single characteristic that distinguishes the populations
from each other. The distinguishing characteristic in this case is epoch.
b. One-way analysis of variance tests the equality of two or more population means by
analyzing sample variances. It finds a difference in the population means if the variance
between the sample means is larger than can be expected considering the variance within the
samples.
3. We should reject the hypothesis that the three epochs have the same mean skull breadth. There
is sufficient evidence to conclude that at least one of the means is different from the others.
NOTE: When testing the hypothesis that three or more groups have the same mean, the test statistic
is F, where F is the ratio is the ratio of the variance between the groups to the variance within the
groups as defined in the text. This manual generally uses the generic notation F = s 2B /s 2p . As in
previous chapters, the superscripts and subscripts (the numerator df and denominator df) may be
used to identify which F distribution to look up in the tables.
5. Ho: μ1 = μ2 = μ3
H1: at least one μi is different
α = 0.05 and dfnum = 2, dfden = 33
C.V. F = Fα = F0.05 = 3.3158
calculations:
F = s 2B /s 2p
0.05
= 669.0011/70.6481
F
1
3.3158
= 9.4695 [TI-83/84+]
P-value = P( F332 >9.4695) = 0.0006 [TI-83/84+]
conclusion:
Reject Ho; there is sufficient evidence to reject the claim that μ1 = μ2 = μ3. There is
sufficient evidence to reject the claim that the three books have the same mean Flesch
Reading Ease score.
7. Ho: μ1 = μ2 = μ3 = μ4
H1: at least one μi is different
α = 0.05 and dfnum = 3, dfden = 156
C.V. F = Fα = F0.05 = 2.6626 [Excel]
calculations:
F = s 2B /s 2p
= 11.99995/34.08497
= 0.35206 [Excel]
3
P-value = P( F156
>0.3521) = 0.7877 [Excel]
2
33
0.05
1
2.6626
3
F 156
264
CHAPTER 12 Analysis of Variance
conclusion:
Do not reject Ho; there is not sufficient evidence to reject the claim that μ1 = μ2 = μ3 = μ4.
There is not sufficient evidence to reject the claim that the mean weight loss is the same for
all four diets. Given the four mean losses that range from 2.1 lbs to 3.3 lbs, it appears that
one year of following the diet does not result in a weight loss worth the effort. The effort
may be justified, however, when other potential benefits are considered – viz., maintaining
one’s weight (i.e., not gaining weight), enjoying a healthy lifestyle, etc.
9. Ho: μ1 = μ2 = μ3
H1: at least one μi is different
α = 0.05 and dfnum = 2, dfden = 32
C.V. F = Fα = F0.05 = 3.3158
calculations:
F = s 2B /s 2p
0.05
= 0.42400.933/7981.129
= 5.313 [SPSS]
F
1
3.3158
2
P-value = P( F32 >5.313) = 0.010 [SPSS]
conclusion:
Reject Ho; there is sufficient evidence to reject the claim that μ1 = μ2 = μ3. There is
sufficient evidence to reject the claim that PG, PG-13 and R movies have the same gross
amount.
2
32
NOTE FOR THE REMAINING EXERCISES IN THIS SECTION: This section is calculation-oriented.
Do not get so involved with the formulas that you miss the concepts. This manual arranges the
calculations to promote both computational efficiency and understanding of the underlying
principles. The following notation is used.
k = the number of groups
ni = the number of scores in group i (where i = 1,2,…,k)
x i = the mean of group i
s i2 = the variance of group i
x = the overall mean of all the scores in all the groups
= (Σni x i )/Σni = the (weighted) mean of the group means
= Σ x i /k = simplified form when each group has equal size n
2
B
s = the variance between the groups
= Σni( x i – x )2/(k-1)
= n Σ( x i – x )2/(k-1) = ns 2x = simplified form when each group has equal size n
s 2p = the variance within the groups
= (Σdfi s i2 )/Σdfi
= Σ s i2 /k = simplified form when each group has equal size n
numerator df = k-1
denominator df = Σdfi
= k(n-1) = simplified form when each group has equal size n
2 2
F = s B /s p = (variance between groups)/(variance within groups)
P-value = Fcdf(F, 99, numerator df, denominator df) from the TI-83/84+ calculator
One-Way ANOVA SECTION 12-2
265
As a crude check against errors, and to help get a feeling for the problem, always verify that the
following “overall” values are realistic in that
x is a value between the lowest and the highest of the x i values.
s 2p is a value between the lowest and the highest of the s i2 values.
11. Since each group has equal size n=10, use the simplified form of the calculations.
The following preliminary values are identified.
n
Σx
Σx2
x
s2
k=3
small
10
4315
1918833
431.5
6323.389
medium
10
3906
1620816
390.6
10570.267
x = Σxi /k
n = 10
s 2x = Σ(x i -x) 2 /(k-1)
= 630.723
large
10
3858
1764878
385.8
30717.956
= 402.633
s = Σsi /k
2
p
.
2
= 15870.537
Ho: μ1 = μ2 = μ3
H1: at least one μi is different
α = 0.05 and dfnum = 2, dfden = 27
C.V. F = Fα = F0.05 = 3.3541
0.05
calculations:
2 2
F = ns x /s p
F
1
3.3541
= 10(630.723)/15870.537
= 0.3974
P-value = Fcdf(0.3974,99,2,27) = 0.6759
conclusion:
Do not reject Ho; there is not sufficient evidence to reject the claim that μ1 = μ2 = μ3.
There is not sufficient evidence to reject the claim that the different car categories have the
same mean head injury values. No, these data do not suggest that larger cars are safer.
2
27
NOTE: Since subtracting the same value from each will not affect the differences between the
group means or any variances in the problem, that strategy may be used to reduce the
mathematical magnitude of each of the following problems. In Exercise 13, for example, one
could subtract 3 minutes from each time and deal only with the excess seconds over 3 minutes as
follows.
mile 1: 15 24 23 22 21
mile 2: 19 22 21 17 19
mile 3: 34 31 29 31 29
266
CHAPTER 12 Analysis of Variance
13. Since each group has equal size n=5, use the simplified form of the calculations.
The following preliminary values are identified (all measurements in seconds).
mile 1
5
1005
202055
201.0
12.50
n
Σx
Σx2
x
s2
k=3
mile 2
5
998
199216
199.6
3.80
x = Σxi /k
n=5
= 203.8
s = Σsi /k
s = Σ(x i -x) /(k-1)
2
x
mile 3
5
1054
222200
210.8
4.20
2
p
2
= 37.24
.
2
= 6.8333
Ho: μ1 = μ2 = μ3
H1: at least one μi is different
α = 0.05 and dfnum = 2, dfden = 12
0.05
C.V. F = Fα = F0.05 = 3.8853
F
1
3.8853
calculations:
2 2
F = ns x /s p
= 5(37.24)/6.8333
= 27.2488
P-value = Fcdf(27.2488,99,2,12) = 3.45E-5 = 0.00003
conclusion:
Reject Ho; there is sufficient evidence to reject the claim that μ1 = μ2 = μ3. There is
sufficient evidence to reject the claim that it takes the same time to ride each of the miles.
Yes; the data suggest that there may be a hill on mile 3.
2
12
15. The Minitab output is as follows
Level
king
menthol
filter
N
25
25
25
Mean
1.2560
0.8720
0.9160
StDev
0.2329
0.2424
0.2478
Source
Factor
Error
Total
DF
2
72
74
SS
2.2083
4.1856
6.3939
MS
1.1041
0.0581
F
18.99
P
0.000
Ho: μ1 = μ2 = μ3
H1: at least one μi is different
α = 0.05 and dfnum = 2, dfden = 72
C.V. F = Fα = F0.05 = 3.1504
calculations:
F = s 2B /s 2p
= 1.1041/0.0581
0.05
= 18.99 [Minitab]
F
1
3.1504
P-value = P( F722 >18.99) = 0.000 [Minitab]
conclusion:
Reject Ho; there is sufficient evidence to reject the claim that μ1 = μ2 = μ3. There is
sufficient evidence to reject the claim that the three types of cigarettes yield the mean
amount of nicotine. Yes; given that only the king size cigarettes are not filtered, it appears
(although not formally justifiable by the ANOVA) that the filters do make a difference.
2
72
17. The Tukey results indicate that the only significant difference (P-value = 0.021) is between μ1
the mean for the small cars) and μ3 (the mean for the large cars). The conclusion is the same as
that obtained with the Bonferroni results. The slightly larger Bonferroni P-value =0.024
indicates that, at least in this case, the Bonferroni method is more conservative (i.e., requires a
larger difference before declaring significance).
Two-Way ANOVA SECTION 12-3
267
12-3 Two-Way ANOVA
1. Two-way analysis of variance is appropriate for these data because they are categorized by two
characteristics that distinguish groups of data from each other. The distinguishing
characteristics in this case are gender and age bracket.
3. Because each cell contains the same number of observations (viz., 5), this is a balanced design.
5. Ho: there is no gender-age interaction
H1: there is gender-age interaction
α = 0.05 and dfnum = 1, dfden = 60
C.V. F = Fα = F0.05 = 4.0012
calculations:
F = MSInt/MSErr
0.05
= 1.995/8.275
= 0.24 [Minitab]
F
1
4.0012
P-value = P( F601 >0.24) = 0.625 [Minitab]
conclusion:
Do not reject Ho; there is not sufficient evidence to reject the claim that there is no
interaction between gender and age bracket. There is not sufficient evidence to reject the
claim that heights are not affected by an interaction between a person’s gender and that
person’s age bracket.
1
60
7. Ho: μ1 = μ2 [there is no age effect]
H1: at least one age μi is different
α = 0.05 and dfnum = 1, dfden = 36
C.V. F = Fα = F0.05 = 4.0012
calculations:
F = MSAge/MSErr
0.05
= 24.379/8.275
= 2.95 [Minitab]
F
1
4.0012
P-value = P( F601 >2.95) = 0.625 [Minitab]
conclusion:
Do not reject Ho; there is not sufficient evidence to support the claim that age bracket
has an effect on height.
1
60
9. Ho: μ1 = μ2 [there is no type effect]
H1: at least one type μi is different
α = 0.05 and dfnum = 1, dfden = 12
C.V. F = Fα = F0.05 = 4.7472
calculations:
F = MSType/MSErr
0.05
= 34060.5/15133.6
= 2.25 [Minitab]
F
1
4.7472
P-value = P( F121 >2.25) = 0.159 [Minitab]
conclusion:
Do not reject Ho; there is not sufficient evidence to support the claim that type has an effect
on head injury measurements. There is not enough evidence to support the claim that the
type of car (foreign or domestic) has an effect on head injury measurements.
1
12
268
CHAPTER 12 Analysis of Variance
NOTE: In exercises 11-13 the n=12 values in each of the six cells corresponds to df=11 for each of
the six cells and df = 6(11) for the pooled estimate of the mean square error.
11. Ho: there is no subject-target interaction
H1: there is subject-target interaction
α = 0.05 and dfnum = 2, dfden = 66
C.V. F = Fα = F0.05 = 3.1359 [Statdisk]
calculations:
F = MSInt/MSErr
0.05
= 3.375/[not given]
= 3.73 [Statdisk]
F
1
3.1359
2
P-value = P( F66 >3.73) = 0.0291 [Statdisk]
conclusion:
Reject Ho; there is sufficient evidence to reject the claim that there is no subject-target
interaction. There is sufficient evidence to reject the claim that perceived self-esteem in
other target people is unaffected by an interaction between the subject self-esteem and the
target’s self-esteem.
2
66
13. Ho: μ1 = μ2 = μ3 [there is no subject effect]
H1: at least one size μi is different
α = 0.05 and dfnum = 2, dfden = 66
C.V. F = Fα = F0.05 = 3.1359 [Statdisk]
calculations:
F = MSSubject/MSErr
0.05
= 1.4306/[not given]
F
1
3.1359
= 1.5824 [Statdisk]
2
P-value = P( F66 >1.5824) = 0.2132 [Statdisk]]
conclusion:
Do not reject Ho; there is not sufficient evidence to support the claim that there is a subject
effect. There is not sufficient evidence to support the claim that the perceived self-esteem
in other target people is affected by the subject’s own measure (low, medium or high) of
self-esteem.
2
66
NOTE FOR HAND CALCULATIONS: The formulas and principles in this section are logical
extensions of those in the previous section. In particular,
SSRow = Σni (x i - x) 2 for i = 1,2,3… [for each row]
SSCol = Σnj (x j - x)2 for j = 1,2,3… [for each column]
SSTot = Σ (x - x)2 for all the x’s
When there is only one observation per cell, the unexplained variation is
SSErr = SSTot – SSRow – SSCol, and there is not enough data to measure interaction.
When there is more than one observation per cell, the unexplained variation (i.e., the failure of
experimental units in the same cell to respond the same) is
SSErr = Σ (x - xij )2 = Σdfij sij2 [for each cell – i.e., for each i,j (row,col) combination]
and the interaction sum of squares is
SSInt = SSTot – SSRow – SSCol – SSErr.
Two-Way ANOVA SECTION 12-3
269
15. There are n=3 observations in each of the indicated whey-supplement cells.
WHEY
10%
20%
Σx = 13.9
Σx = 14.1
Σx2 = 64.45
Σx2 = 66.33
x = 4.6333
x = 4.7000
s2 = 0.0233
s2 = 0.0300
30%
Σx = 14.4
Σx2 = 69.26
x = 4.8000
s2 = 0.0700
E
Σx = 9.6
M
Σx2 = 30.74
E yes x = 3.2000
N
s2 = 0.0100
Σx = 11.1
Σx2 = 41.09
x = 3.7000
s2 = 0.0100
Σx = 15.1
Σx2 = 76.13
x = 5.0333
s2 = 0.0633
Σx = 16.3
Σx2 = 88.61
x = 5.4333
s2 = 0.0233
x = 3.8000
x = 4.1667
x = 4.8667
x = 5.1167
S
U
P
P
L
T
0%
Σx = 13.2
Σx2 = 58.10
no x = 4.4000
s2 = 0.0100
x = 4.6333
x = 4.3417
Σx = 107.7 x = 4.4875
Σx2 = 494.71 s2 = 0.4959
SSSupp = Σni (x i -x) 2 = 12(4.6333-4.4875)2 + 12(4.3417-4.4875)2 = 0.51041667
SSWhey = Σnj (x j -x)2 = 6(3.8-4.4875)2 + 6(4.1667-4.4875)2 + 6(4.8667-4.4875)2 + 6(6.1167-4.4875)2 =
SSTot = Σ (x -x)2 = df∙s2 = 23∙(0.4959) = 11.40625
2
SSErr = Σ (x -xij ) =
Σdfij ij2 =
s
6.69125
2(0.0100) + 2(0.0233) + 2(0.0300) + 2(0.0700)
2(0.0100) + 2(0.0100) + 2(0.0633) + 2(0.0233) = 0.4800
SSInt = SSTot – SSSupp – SSWhey – SSErr = 11.40625 – 0.51041 – 6.69125 – 0.48000 = 3.72459
This is the ANOVA table for the tests of hypotheses in this exercise.
Source
Supplement
Whey
Interaction
Error
Total
df
1
3
3
16
23
SS
0.51041
6.69125
3.72459
0.48000
11.40625
MS
0.51041
2.23042
1.24153
0.03000
F
17.0137
74.3472
41.3843
Ho: there is no whey-supplement interaction
H1: there is whey-supplement interaction
α = 0.05 [assumed] and dfnum = 3, dfden = 16
C.V. F = Fα = F0.05 = 3.2389
calculations:
F = MSInt/MSErr
0.05
= 1.24153/0.03000
= 41.3843
F
1
3.2389
P-value = Fcdf(41.3843,99,3,16)
= 9.11E-8 = 0.00000009
conclusion:
Reject Ho; there is sufficient evidence to reject the claim that there is no interaction
between the whey and the supplement. There is sufficient evidence to conclude that the
ratings are affected by an interaction between the use of the supplement and the amount of
whey.
Because there is interaction, further testing for whey or supplement effects are not appropriate.
The presence of interaction means that the effect of the supplement, for example, depends
upon the amount of whey and cannot be determined in a single test that combines the different
whey amounts.
3
16
270
CHAPTER 12 Analysis of Variance
17. The table and preliminary calculations are as follows.
WHEY
S
0%
U no 4.4
P yes 3.3
P
x = 3.85
10%
20%
4.5
5.0
x = 4.75
4.6
3.8
x = 4.20
2
30%
4.6
5.4
x = 5.00
x = 4.525
x = 4.375
Σx = 35.6
Σx2 = 161.42
x = 4.45
s2 = 0.4286
SSSupp = Σni (x i -x) = 4(4.525-4.45) + 4(4.375-4.45) = 0.0450
2
2
SSWhey = Σnj (x j -x)2 = 2(3.85-4.45)2 + 2(4.20-4.45)2 + 2(4.75-4.45)2 + 2(5.00-4.45)2 = 1.6300
SSTot = Σ (x -x)2 = df∙s2 = 7∙(0.4286) = 3.000
SSErr = SSTot – SSSupp – SSWhey = 3.0000 – 0.0450 – 1.6300 = 1.3250
This is the ANOVA table for the tests of hypotheses in this exercise.
Source
Supplement
Whey
Error
Total
df
1
3
3
7
SS
0.0450
1.6300
1.3250
3.0000
MS
0.04500
0.54333
0.44167
F
0.1019
1.2302
Ho: μ1 = μ2 [there is no supplement effect]
H1: at least one supplement μi is different
α = 0.05 [assumed] and dfnum = 1, dfden = 3
C.V. F = Fα = F0.05 = 10.128
calculations:
F = MSSupp/MSErr
0.05
= 0.04500/0.44167
= 0.1019
F
1
10.128
P-value = Fcdf(0.1019,99,1,3) = 0.7684
conclusion:
Do not reject Ho; there is not sufficient evidence to support the claim that the supplement
has an effect on the rating. There is not enough evidence to support the claim that use of
the supplement affects the ratings of the pancakes.
1
3
Ho: μ1 = μ2 = μ3 = μ4 [there is no whey effect]
H1: at least one age μi is different
α = 0.05 [assumed] and dfnum = 3, dfden = 3
C.V. F = Fα = F0.05 = 9.2766
calculations:
F = MSWhey/MSErr
0.05
= 0.54333/0.44167
= 1.2302
F
1
9.2766
P-value = Fcdf(1.2302,99,3,3) = 0.4327
conclusion:
Do not reject Ho; there is not sufficient evidence to support the claim that whey has an
effect on the rating. There is not enough evidence to support the claim that the amount of
whey used affects the rating of the pancakes.
3
3
Statistical Literacy and Critical Thinking
271
Statistical Literacy and Critical Thinking
1. Since two of the three samples are from the same power source, the three sets of sample data
are not independent. Since one-way analysis of variance requires independent sets of sample
data, the one-way analysis of variance should not be used. Even if the three data sets came
from three independent power sources, if all the measurements were taken at precisely the
same time the data would consist of matched triples and the one-way analysis of variance
(since it ignores the time characteristic) would not be the optimal technique to use.
2. One-way analysis of variance is used when the data are categorized according to one factor,
while two-way analysis of variance is used when the data are categorized according to two
factors.
3. While the factors used to categorize the data may originally be measured at any level, the data
to be analyzed must be quantitative data for which meaningful means and standard deviations
can be calculated. MPAA rating is not a quantitative variable. Running time could be
analyzed using the factors of budget bracket and MPAA rating, but MPAA rating could not be
analyzed using the factors of budget bracket and running time bracket.
4. No. Samples must be representative of the population to which the inference is to be made. If
only less expensive cars are used in the crash tests, then the results may be used to make
inferences only about the population of less expensive cars and not the population of all cars.
Chapter Quick Quiz
1. The one-way analysis of variance is used to test the null hypothesis that three or more samples
are from populations with equal means.
2. One-way analysis of variance tests are right tailed. The test statistic is the ratio of the variance
between groups to the variance within groups, and we reject the hypothesis that the groups
have equal means in favor of them having means that are not all equal only if the variance
between the groups is significantly larger than the variance between groups.
3. In one-way analysis of variance tests, the P-value is the probability of chance alone producing
a test statistic as large as or larger than the calculated test statistic. The larger the calculated
test statistic, therefore, the smaller the P-value.
4. The value of test statistic is F = 8.98.
5. The null hypothesis is that the three books (by Rowling, Clancy and Tolstoy) have the same
mean grade level reading scores. Since P-value = 0.001 < 0.05, we reject the null hypothesis
and conclude that the three books do not have the same mean grade level reading score.
6. The final conclusion is that at least one of the three books has a mean grade level reading score
that is different from the others.
7. One-way analysis of variance is used when data are categorized according to one factor, while
two-way analysis of variance is used when data are categorized according to two factors.
272
CHAPTER 12 Analysis of Variance
8. There is not sufficient evidence to reject the claim that a student’s estimate is unaffected by
interaction between that student’s gender and that student’s major.
9. No. Since P-value = 0.395 > 0.05 for testing for a gender effect, conclude that there is not
sufficient evidence to support a claim that students’ estimates of length are affected by their
genders.
10. No. Since P-value = 0.876 > 0.05 for testing for a major effect, conclude that there is not
sufficient evidence to support a claim that students’ estimates of length are affected by their
majors.
Review Exercises
1. a. The age data from the sample was categorized according to one variable, diet. Specifically,
it was noted whether the subject with each age in the age participated in the Atkins, Ornish,
Weight Watchers or Zones diet.
b. The data was analyzed using a one-way analysis of variance.
c. Since P-value = 0.41 > 0.05, we fail to reject the claim that the subjects come from
populations with the same mean age. It appears that the mean ages of the four treatment
groups are about the same.
d. A small P-value like 0.001 < 0.05 would lead to rejection of the claim that the subjects come
from populations with the same mean age. The conclusion would be that the four treatment
groups did not have the same mean age. If such were the case, it would not be possible to
declare whether any observed differences between the groups were due to the different
treatments or to the difference in ages.
2. Ho: μ1 = μ2 = μ3
H1: at least one μi is different
α = 0.05 and dfnum = 2, dfden = 29
C.V. F = Fα = F0.05 = 3.3277
calculations:
F = s 2B /s 2p
0.05
= 3083363/56373
F
1
3.3277
= 54.70 [Minitab]
2
P-value = P( F29 >54.70) = 0.000 [Minitab]
conclusion:
Reject Ho; there is sufficient evidence to reject the claim that μ1 = μ2 = μ3. There is
sufficient evidence to reject the claim that the different car categories (4 cylinder, 6
cylinder, 8 cylinder) have different weights. Yes, it does appear that cars with more
cylinders tend to weigh more – but a formal conclusion about such a relationship would
require additional statistical methodology.
2
29
Review Exercises
273
3. Ho: there is no type-size interaction
H1: there is type-size interaction
α = 0.05 and dfnum = 2, dfden = 12
C.V. F = Fα = F0.05 = 3.8853
calculations:
F = MSInt/MSErr
0.05
= 57385/155086
= 0.37 [Minitab]
F
1
3.8853
P-value = P( F122 >0.37) = 0.698 [Minitab]
conclusion:
Do not reject Ho; there is not sufficient evidence to reject the claim that there is no
interaction between type and size. There is not sufficient evidence to reject the claim that
left femur crash loads are unaffected by an interaction between a car’s type (foreign or
domestic) and its size (small, medium or large).
2
12
4. Ho: μ1 = μ2 [there is no type effect]
H1: at least one type μi is different
α = 0.05 and dfnum = 1, dfden = 12
C.V. F = Fα = F0.05 = 4.7472
calculations:
F = MSType/MSErr
0.05
= 282752/155086
= 1.82 [Minitab]
F
1
4.7472
1
P-value = P( F12 >1.82) = 0.202 [Minitab]
conclusion:
Do not reject Ho; there is not sufficient evidence to support the claim that type has an effect
on left femur crash loads. There is not enough evidence to support the claim that the
type of car (foreign or domestic) has an effect on left femur crash loads..
1
12
5. Ho: μ1 = μ2 = μ3 [there is no size effect]
H1: at least one size μi is different
α = 0.05 and dfnum = 2, dfden = 12
C.V. F = Fα = F0.05 = 3.8853
calculations:
F = MSSize/MSErr
= 74067/155086
0.05
= 0.48 [Minitab]
F
1
3.8853
P-value = P( F122 >0.48) = 0.632 [Minitab]
conclusion:
Do not reject Ho; there is not sufficient evidence to support the claim that size has an effect
on left femur crash loads. There is not sufficient evidence to support the claim that the
size of a car (small, medium or large) has an effect on left femur crash loads.
2
12
274
CHAPTER 12 Analysis of Variance
6. These are precisely the values from Data Set 4. The Minitab output is as follows.
Level
N
king
25
menthol 25
non-menth25
Mean
15.720
14.960
14.800
StDev
0.936
4.168
4.233
Source
Factor
Error
Total
DF
2
72
74
SS
12.1
868.0
880.1
MS
6.0
12.1
F
0.50
P
0.608
Ho: μ1 = μ2 = μ3
H1: at least one μi is different
α = 0.05 and dfnum = 2, dfden = 72
C.V. F = Fα = F0.05 = 3.1504
calculations:
F = s 2B /s 2p
0.05
= 6.0/12.1
= 0.50 [Minitab]
F
1
3.1504
P-value = P( F722 >0.50) = 0.608 [Minitab]
conclusion:
Do not reject Ho; there is not sufficient evidence to reject the claim that μ1 = μ2 = μ3. There
is not sufficient evidence to reject the claim that the three types of cigarettes yield the same
mean amount of carbon monoxide. No; it appears that filters do not make a difference in
the amount of carbon monoxide – but a formal test using only two groups (filtered and
non-filtered) would provide a better test for answering that question.
2
72
7. There are n=4 observations in each of the indicated gender-smoking cells.
GENDER
male
female
SMOKE?
yes
no
Σx = 394.8
Σx = 391.8
Σx2 =38967.44 Σx2 =38378.44
x = 98.700
x = 97.950
s2 = 0.22667
s2 = 0.54333
Σx = 393.9
Σx2 =38789.69
x = 98.475
s2 = 0.12917
Σx = 393.0
Σx2 =38613.34
x = 98.250
s2 = 0.36333
x = 98.5875
x = 98.1000
x = 98.3250
x = 98.3625
Σx = 1573.5
x = 98.34375
Σx2 = 154748.91 s2 = 0.334625
SSGen = Σni (x i -x) 2 = 8(98.3250-98.34375)2 + 8(98.3625-98.34375)2 = 0.005625
SSSmo = Σnj (x j -x)2 = 8(98.5875-98.34375)2 + 8(98.1000-98.34375)2 = 0.950625
SSTot = Σ (x -x)2 = df∙s2 = 15∙(0.334625) = 5.019375
SSErr = Σ (x -xij )2 = Σdfij sij2 = 3(0.22667) + 3(0.54333)
3(0.12917) + 3(0.36333) = 3.78750
SSInt = SSTot – SSSupp – SSWhey – SSErr = 5.019375 – 0.005625 – 0.950625 – 3.78750 = 0.275625
This is the ANOVA table for the tests of hypotheses in this exercise.
Source
Gender
Smoke
Interaction
Error
Total
df
1
1
1
12
15
SS
0.005625
0.950625
0.275625
3.787500
5.019375
MS
0.005625
0.950625
0.275625
0.315625
F
0.0178
3.0119
0.8733
Review Exercises
275
Ho: there is no gender-smoking interaction
H1: there is gender-smoking interaction
α = 0.05 and dfnum = 1, dfden = 12
C.V. F = Fα = F0.05 = 4.7472
calculations:
F = MSInt/MSErr
0.05
= 0.275625/0.315625
= 0.8733
F
1
4.7472
P-value = Fcdf(0.8733,99,1,12) = 0.3685
conclusion:
Do not reject Ho; there is not sufficient evidence to reject the claim that there is no
interaction between gender and smoking. There is not sufficient evidence to reject the
claim that a person’s body temperature is not affected by an interaction between gender
and smoking habits.
1
12
Ho: μ1 = μ2 [there is no gender effect]
H1: at least one μi is different
α = 0.05 and dfnum = 1, dfden = 12
C.V. F = Fα = F0.05 = 4.7472
calculations:
F = MSGen/MSErr
0.05
= 0.005625/0.315625
= 0.0178
F
1
4.7472
P-value = Fcdf(0.0178,99,1,12) = 0.8960
conclusion:
Do not reject Ho; there is not sufficient evidence to reject the claim that there is no gender
effect. There is not sufficient evidence to reject the claim that a person’s body temperature
is unaffected by that person’s gender.
1
12
Ho: μ1 = μ2 [there is no smoking effect]
H1: at least one μi is different
α = 0.05 and dfnum = 1, dfden = 12
C.V. F = Fα = F0.05 = 4.7472
calculations:
F = MSSmo/MSErr
0.05
= 0.950625/0.315625
= 3.0119
F
1
4.7472
P-value = Fcdf(3.0119,99,1,12) = 0.1082
conclusion:
Do not reject Ho; there is not sufficient evidence to reject the claim that there is no smoking
effect. There is not sufficient evidence to reject the claim that a person’s body temperature
is unaffected by that person’s smoking habits.
1
12
8. The following preliminary values are identified.
n
Σx
Σx2
x
s2
presidents
38
589
12627
15.500
94.5270
popes
24
315
5981
13.125
80.2880
monarchs
14
318
11722
22.714
346.0659
276
CHAPTER 12 Analysis of Variance
x = (Σni x i )/Σni = [38(15.500) + 24(13.125) + 14(22.714)]/[38+24+14] = 1222/76 = 16.079
Σni( x i – x )2 = 38(15.500-16.079)2 + 24(13.125-16.079)2 + 14(22.714-16.079)2 = 838.54
2
Σdfi s i = 37(94.5270) + 23(80.2880) + 13(346.0659) = 9842.98
s 2B = Σni( x i – x )2/(k-1) = 838.54/2 = 419.27
s 2p = (Σdfi s i2 )/Σdfi = 9842.98/73 = 134.84
Ho: μ1 = μ2 = μ3
H1: at least one μi is different
α = 0.05 [assumed] and dfnum = 2, dfden = 73
C.V. F = Fα = 3.1504
calculations:
F = s 2B /s 2p
0.05
= 419.27/134.84
F
1
3.1504
= 3.1095
P-value = Fcdf(3.1095,99,2,73) = 0.0506
conclusion:
Do not reject Ho; there is not sufficient evidence to reject the claim that μ1 = μ2 = μ3. There
is not sufficient evidence to reject the claim that the three groups have the same mean
longevity. No, the survival times for the three groups do not appear to differ significantly.
2
73
Cumulative Review Exercises
The following summary information applies to exercises 1-4.
n
Σx
Σx2
x
s2
presidents
38
589
12627
15.500
94.5270
popes
24
315
5981
13.125
80.2880
monarchs
14
318
11722
22.714
346.0659
1. a. The means for the three groups are also follows.
pres: 15.5 years
popes: 13.1 years
mon: 22.7 years
b. The standard deviations for the three groups are as follows.
pres: 94.5270 = 9.7 years popes: 80.2880 = 9.0 years mon: 346.0659 = 18.6 years
2. Let the presidents be group 1.
x1 -x 2 = 15.500 – 22.714 =
original claim: μ1 – μ2 ≠ 0 mg
Ho: μ1 – μ2 = 0 years
H1: μ1 – μ2 ≠ 0 years
α = 0.05 [assumed] and df = 13
C.V. t = ±tα/2 = t0.025 = ±2.160
calculations:
t x1 -x 2 = (x1 -x 2 - μ x1 -x 2 )/s x1 -x 2
= (-7.214 – 0)/ 94.5270/38 +346.0659/14
= -7.214/5.2160 = -1.383
P-value = 2∙tcdf(-99,-1.383,13) = 0.1899
0.025
0.025
-2.160
0
0
_ _
x 1- x2
2.160
t13
Cumulative Review Exercises
277
conclusion:
Do not reject Ho; there is not sufficient evidence to conclude that μ1 – μ2 ≠ 0. There
is not sufficient evidence to support a claim that there is a difference between the mean for
presidents and the mean for British monarchs.
3. Yes, the longevity times for presidents appear to come from a population that is approximately
normal. The histogram is approximately bell-shaped and the points on the normal probability
plot approximate a straight line. Because of the truncation of the times at 0, there is not the
opportunity for unusually low scores to symmetrically balance unusually high scores – and this
slight departure form normality is evident in both the histogram and the normal probability
plot.
U.S. Presidents
Normal Probability Plot
9
2.0
8
1.5
1.0
6
z score
Frequency
7
5
4
3
0
-0.5
-1.0
2
-1.5
1
0
0.5
-2.0
0
5
10
15
20
25
30
35
longevity following inauguration (years)
0
10
20
30
40
longevity following inauguration (years)
4. n = 39 and x = 15.000 and s = 94.5270 = 9.7224
σ unknown, n > 30: use t with df=37
α = 0.05, tdf, α/2 = t37,0.025 = 2.026
x  tα/2∙s/ n
15.500  2.026(9.7224)/ 38
15.500  3.195
12.3 < μ < 18.7 (years)
5. Listed in order, the n=9 values are: 96.5 97.3 97.6 97.6 98.0 98.2 98.3 98.4 98.9
Σx = 880.8
Σx2 = 86204.96
a. Temperatures are measurements at the interval level, because differences are meaningful but
ratios are not (since there is not a meaningful zero).
b. Body temperatures are continuous data. Even though they are typically reported to the
nearest tenth of a degree, they actually can take on any values on a continuum.
c. x = (Σx)/n = 880.8/9 = 97.8667, rounded to 97.87 °F
d. x = x5 = 98.0 °F
e. R = 98.9 – 96.5 = 2.4 °F
f. s2 = [n(Σx2) – (Σx)2]/[n(n-1)]
= [9(86204.96) – (880.8)2]/[9(8)] = 36.00/72 = 0.5000
rating
frequency
s = 0.5000 = 0.7071, rounded to 0.71 °F
1.0 – 1.9
1
2
2
g. s = 0.5000 (°F)
2.0 – 2.9
4
6. The requested frequency distribution is given
In the box at the right
3.0 – 3.9
4.0 – 4.9
5.0 – 5.9
6.0 – 6.9
7.0 – 7.9
8.0 – 8.9
0
1
4
9
9
7
35
278
CHAPTER 12 Analysis of Variance
7. The histogram is given below. The actual cut points for the bars are 0.95, 1.95, 2.95, etc.
9
8
Frequency
7
6
5
4
3
2
1
0
1.0
2.0
3.0
4.0
5.0
6.0
7.0
8.0
9.0
movie viewer rating
No; it does not appear that the ratings are from a population with a normal distribution. The
distribution appears to be negatively skewed, and not bell-shaped.
8. This exercise requires the following summary statistics.
n
Σx
Σx2
x
s2
PG
3
19.9
133.89
6.633
0.9433
PG-13
20
128.6
904.88
6.430
4.1043
R
12
74.1
510.05
6.175
4.7711
informal comparison of the three sample means:
The means of 6.63, 6.43, and 6.18 do not appear to be very different – especially
considering the fairly wide range of ratings that extend from 1.9 to 8.6.
x = (Σni x i )/Σni = [3(6.633) + 20(6.430) + 12(6.175)]/[3+20+12] = 222.6/35 = 6.360
Σni( x i – x )2 = 3(6.633-6.360)2 + 20(6.430-6.360)2 + 12(6.175-6.360)2 = 0.7328
2
Σdfi s i = 2(0.9433) + 19(4.1043) + 11(4.7711) = 132.3504
s 2B = Σni( x i – x )2/(k-1) = 0.7328/2 = 0.3664
s 2p = (Σdfi s i2 )/Σdfi = 132.3504/32 = 4.13595
Ho: μ1 = μ2 = μ3
H1: at least one μi is different
α = 0.05 and dfnum = 2, dfden = 32
C.V. F = Fα = F0.05 = 3.3158
calculations:
F = s 2B /s 2p
0.05
= 0.36644.13595
F
1
3.3158
= 0.0886
P-value = Fcdf(0.0886,99,2,32) = 0.9154
conclusion:
Do not reject Ho; there is not sufficient evidence to reject the claim that μ1 = μ2 = μ3. There
is not sufficient evidence to reject the claim that PG, PG-13 and R movies have the same
mean viewer rating.
2
32
9. These are the necessary summary statistics.
n=6
Σxy = 110.9247 n(Σxy) – (Σx)(Σy) = 5(110.9247) – (52.41)(10.40) = 9.5595
Σx = 52.41 Σx2 = 554.0767 n(Σx2) – (Σx)2 = 5(554.0767) – (52.41)2 = 23.5754
Σy = 10.40 Σy2 = 23.7642
n(Σy2) – (Σy)2 = 5(23.7642) – (10.40)2 = 10.6610
Cumulative Review Exercises
279
a. r = [n(Σxy) - (Σx)(Σy)]/[ n(Σx 2 ) - (Σx) 2 n(Σy 2 ) - (y) 2 ]
= 9.5595/ [ 23.5754 10.6610]
= 0.6030
Ho: ρ = 0
H1: ρ ≠ 0
α = 0.05 [assumed] and df = 3
C.V. t = ±tα/2 = ±t0.025 = ±3.182 [or r = ±0.878]
calculations:
tr = (r – μr)/sr
0.025
0.025
= (0.6030 – 0)/ [1 - (0.6030)2 ]/3
-0.878
0
0.878
r
= 0.6030/0.4605
-3.182
0
3.182
t
= 1.309
P-value = 2∙tcdf(1.309,99,3) = 0.2817
conclusion:
Do not reject Ho; there is not sufficient evidence to conclude that ρ ≠ 0. No; there is not
sufficient evidence to support the claim of a linear correlation between the amounts of
discarded paper and discarded plastic.
b. x = (Σx)/n = 52.41/5 = 10.482
y = (Σy)/n = 10.40/5 = 2.080
2
b1 = [n(Σxy) – (Σx)(Σy)]/[n(Σx ) – (Σx)2]
= 9.5595/23.5754 = 0.4055
bo = y – b1 x
= 2.080 – (0.4055)(10.482) = -2.1703
ŷ = bo + b1x
= -2.1703 + 0.4055x
c. No. Since the correlation in part (a) was not significant, the regression line in part (b)
should not be used for making predictions. The best predictive equation is simply ŷ = y
for all values of x.
3
10. Ho: gun type and circumstances are independent
H1: gun type and circumstances are related
α = 0.05 and df = 2
2
C.V. χ2 = χ 2 = χ 0.05
= 5.991
calculations:
CIRCUMSTANCES
TYPE
handgun
rifle
accident
31
(31.85)
13
(12.15)
44
self-inflict
35
(30.40)
7
(11.60)
42
assault
162
(165.75)
67
(63.25)
229
228
87
0.05
2
5.991
315
χ2 = Σ[(O – E)2/E]
= 0.023 + 0.696 + 0.085
0.059 + 1.824 + 0.223 = 2.909
P-value = χ2cdf (2.909,99,2) = 0.2335
conclusion:
Do not reject Ho; there is not sufficient evidence to reject the claim that gun type and
circumstances are independent. For firearm injuries, there is not sufficient evidence to
reject the claim that the type (handgun or rifle/shotgun) of gun involved and the
circumstances of the event (unintentional, self-inflicted or assault) are independent.
2
2
Download