Answers to Formative Exercise 4

advertisement
EDF 802
Dr. Jeffrey Oescher
Formative Exercise Topic 4 - Factorial ANOVA
Sample Responses
Descriptive Statistics
Very little data was given other than the gender and treatment received by each student.
Eight subjects participated in each of the three groups. Five males and three females were in the morning
and afternoon groups; five females and three males were in the noon group.
Table 1 describes the performance of the groups on the test. Scores for females were slightly higher than
those for males; both groups answered slightly more than three-fourths of the items correctly. Those
students in the morning group performed somewhat better than those in either the noon or afternoon
groups. On average the morning group answered about 90% of the items correctly while the noon and
afternoon groups answered approximately 70% of the items correctly. Variation across the scores of all
groups was moderate.
Please note I have chosen to discuss only the main effects. That is, I discussed differences between males
and females first followed by a discussion of morning, noon, and afternoon. I interpreted the scores from
both norm and criterion referenced perspectives.
The presentation of this data is always complicated. Usually I choose to use the factor with the fewest
number of levels as my first column. In this case there are only two levels of gender, so I have chosen it to
be the variable for my first column. Within each level of gender, I have reported the information for each
level of time as well as the total . The gender by time data is obviously the information for each time for
males as well as females. The Total within each level of gender represents the total for that level. For
example, the total listed under Sex 1 represents the scores for all females. The total rows in the bottom of
the table represent the total scores for each time. That is Total 1 is the total for the first time. Finally the
Total Total represents the grand total regardless of time or gender.
Table 1
Test Score Statistics by Sex and Time
Sex
1
2
Total
Time
1
2
3
Total
1
2
3
Total
1
2
3
Total
N
3
5
3
11
5
3
5
13
8
8
8
24
Mean
44.00
32.40
38.67
37.27
44.80
37.33
35.60
38.54
44.50
34.25
36.75
38.50
SD
4.58
5.73
4.16
6.84
4.32
3.21
4.16
5.74
4.11
5.31
4.17
6.23
Inferential Analyses
The hypotheses being tested in this analysis reflect a main effect for the time-of-day (Hypothesis 1), a main
effect for gender (Hypothesis 2), and an interaction effect for time-of-day by gender (Hypothesis 3).
-1-
1.
2.
3.
H0:µ1.= µ2. = μ3.
H1:µi.≠ µj.
H0:µ.1= µ.2
H1:µ.1≠ µ.2
H0:all αβ effects = 0
H1:all αβ effects ≠ 0
Alpha level was set at 0.05. This represents a reasonable level of control for Type I errors.
The sampling distributions for each hypothesis are F2,18 for time, F1,18 for sex, and F2,18 for the interaction of
time by sex. All inferential tests result in F-statistics.
The results of a factorial ANOVA are presented in Table 2. The assumptions underlying this analysis were
all met. The homogeneity of variance assumption was tested with Levene’s Statistic and found nonsignificant (F5,18 = 0.51, p=.764). The procedure is robust with respect to the violation of the assumption of
normality, and the assumption of independence of observations was assumed true.
Table 2
Factorial ANOVA Results
Source
SS
Time-of-day
372.07
Gender
4.44
Time-of-day*Gender
60.02
Error
372.53
df
2
1
2
18
MS
186.03
4.44
30.01
20.67
F
8.99
0.22
1.45
Sig
002
.649
..261
An examination of the information in Table 2 indicates a significant effect for time-of-day (F2,18 = 8.99,
p=.002) and non-significant results for the gender (F1,18 = 0.22, p=.649) or interaction (F2,18 = 1.45, p=.261)
effects. Sheffee post hoc analyses were used to identify which of the three pairs of means for the time-ofday effect were statistically different. The results indicated morning was statistically different from either
noon or afternoon (MD = 10.25, p = .001; MD = 7.75, p = .011 respectively). There was no statistical
difference between noon and afternoon (MD = -2.50, p = .557). Therefore, the null hypotheses for the
comparisons of morning to noon as well as morning to afternoon were rejected; the null hypothesis for the
comparison of noon to afternoon was accepted.
An examination of the mean scores indicated students performed statistically better in the morning than in
at noon or in the afternoon. This is likely due to the physical and psychological factors discussed earlier in
this report.
For a conceptual perspective, the analysis of the data began with the assumptions the null hypotheses were
true. This permitted the researcher to generate three sampling distributions of F, one for each of the three
hypotheses. One the actual observed F-statistics were calculated, they were mapped into the respective
sampling distribution. In the case of the time-of -day effect, the observed F-statistic was atypical of those in
the sampling distribution. It is reasonable to suggest the null hypothesis of equal means across the three
levels of time-of-day is false; there are differences. In the cases of the gender and interaction effects, the
observed statistics were typical of those in the respective sampling distributions. It is reasonable to suggest
the null hypotheses for the two levels of gender and the six levels of interaction are true.
-2-
Download