Turner Answer Key for Chapter Seven Using statistics in small

advertisement
Turner
Answer Key for Chapter Seven
Using statistics in small-scale language education research: Focus on non-parametric data
Chapter Seven
Practice Problems Answer Key
Study A: Re-analysis of Jen’s data
The data are presented in Table 6.2., but a downloadable dataset can be found on the Companion
Website (www.routledge.com/cw/turner). Go to the For Students tab and select Data Sets; you
can find the data for this problem from the Chapter Six menu (Box 6.5) or from the Chapter
Seven menu (Jen’s data). Click on the dataset and save it onto your computer using the csv
(DOS) format. Remember where you saved it so you can import it into R for analysis.
Remember from Chapter Six (pp. 141-154) that Jen wanted to determine whether learners’
perceived usefulness of authentic differed from their perceived usefulness of instructional
materials (see p. 142 for a detailed discussion of her dependent variable and data). Because the
assumption of normal distribution was not met (p. 144), we’ll now re-analyze the data using the
Wilcoxon Signed Rank statistic. The formal research hypotheses below are phrased for this nonparametric analysis.
Step 1. State hypotheses
H0: There is no statistically significant difference between the rankings of the learners’
perceptions of the usefulness of the authentic versus the instructional materials.
H1: There is a statistically significant positive difference in the rankings of learners’
perceptions of the usefulness of the authentic materials versus instructional materials.
H2: There is a statistically significant negative difference in the rankings of the learners’
perceptions of the usefulness of the authentic materials versus instructional materials.
Step 2. Set alpha
alpha = .01
Step 3. Identify the appropriate statistic for the analysis
I propose to analyze the data using the Wilcoxon Signed Rank statistic and converting the
observed value of the statistic to a z-score to interpret statistical significance because:
1) the independent variable is nominal and has two levels;
2) the comparison is of two sets of data for exactly the same set of participants;
3) there are 30 or more participants;
4) the dependent variable yields rankable data.
Step 4. Collect the data. (Retrieve from Companion Website at www.routledge.com/cw/turner.)
Step 5. Check the assumptions
1 & 2) the two levels of the independent variable are represented by the same people--there are two “perceived usefulness” scores for each person (one score is the perceived
usefulness of authentic materials and the other is the perceived usefulness score of
instructional materials);
3) there are 93 participants
1
Turner
Answer Key for Chapter Seven
Using statistics in small-scale language education research: Focus on non-parametric data
4) when I review the data, I see that they are rankable.
Step 6. Calculate the observed value of the statistic
I use R to calculate the observed value of the Wilcoxon Signed Rank statistic and the exact
probability of the observed value. (If you’d rather do the calculations with a calculator, see pages
203-206 for guidance.)
> jen.data =read.csv(file.choose (), header =T)
[import the Excel dataset in csv format; I
named the dataset “jen.data”]
> View (jen.data)
[view the dataset]
> summary(jen.data$totauth)
[calculate the mean and median]
Min. 1st Qu. Median Mean 3rd Qu. Max.
4.00 23.00 28.00 26.88 33.00 36.00
> summary(jen.data$totinst)
[calculate the mean and median]
Min. 1st Qu. Median Mean 3rd Qu. Max.
4.00 21.00 26.00 24.76 29.00 36.00
> sd(jen.data$totauth)
[1] 7.474376
[calculate standard deviation]
> sd(jen.data$totinst)
[1] 6.138612
[calculate standard deviation]
> par(mfrow = c(1,2))
[set up a space for side-by-side histograms]
> hist(jen.data$totauth, col = "tomato", breaks =10)
[make 1st histogram]
> hist(jen.data$totinst, col = "sea green 1", breaks =10)
[make 2nd histogram]
2
Turner
Answer Key for Chapter Seven
Using statistics in small-scale language education research: Focus on non-parametric data
> shapiro.test(jen.data$totauth)
[calculate Shapiro Wilk statistic—a
normal distribution isn’t required for this
analysis, but check it anyway!]
Shapiro-Wilk normality test
data: jen.data$totauth
W = 0.9195, p-value = 2.542e-05
> shapiro.test(jen.data$totinst)
[calculate Shapiro Wilk statistic—a
normal distribution isn’t required for this
analysis, but check it anyway!]
Shapiro-Wilk normality test
data: jen.data$totinst
W = 0.9519, p-value = 0.001797
> wilcox.test(jen.data$totauth, jen.data$totinst, paired = T) [calculate the observed value of the
Wilcoxon Signed Rank statistic;
perception for authentic materials
entered first]
Wilcoxon signed rank test with continuity correction
data: jen.data$totauth and jen.data$totinst
V = 2517.5, p-value = 0.005327
alternative hypothesis: true location shift is not equal to 0
3
Turner
Answer Key for Chapter Seven
Using statistics in small-scale language education research: Focus on non-parametric data
> wilcox.test(jen.data$totinst, jen.data$totauth, paired = T) [calculate the observed value of the
Wilcoxon Signed Rank statistic;
perception for instructional
materials entered first]
Wilcoxon signed rank test with continuity correction
data: jen.data$totinst and jen.data$totauth
V = 1223.5, p-value = 0.005327
alternative hypothesis: true location shift is not equal to 0
The observed value of the Wilcoxon Signed Rank statistic is the smaller of the two observed
values of V, V = 1223.5.
Step 7. Calculate the exact probability of the statistic
I simply retrieve the exact probability from the R output, so exact p = .005327.
Step 8. Compare the exact probability to alpha
The rules for interpreting exact probability are:
If exact probability ≥ alpha → accept the null hypothesis
If exact probability < alpha → reject the null hypothesis
The exact probability, p = .005227, is less than alpha, .01, so reject the null hypothesis. I accept
the first alternative hypothesis because I see that the mean perceived usefulness of the authentic
materials is higher than the mean perceived usefulness of the instructional materials.
H1: There is a statistically significant positive difference in the rankings of the learners’
perceptions of the usefulness of the authentic materials versus instructional materials.
Step 9. Make the probability statement
There’s 99% certainty the rankings of the learners’ perceptions of the usefulness of the authentic
materials are systematically higher than their perceptions of the usefulness of instructional
materials.
Step 10. Interpret the meaningfulness
There are two avenues for interpreting meaningfulness: 1) with reference to the research
question, and 2) by calculating effect size.
The research question that guided this part of Jen’s research was: What are language learners’
perceptions of the usefulness of authentic versus instructional materials?” (p. 142).
To calculate effect size, the observed value of the Wilcoxon Signed Rank statistic, V, is
converted to a z-score. The R commands for doing so are presented below—I use the command
with the values for “totinst” because it yields the lower value of V observed:
4
Turner
Answer Key for Chapter Seven
Using statistics in small-scale language education research: Focus on non-parametric data
> study.model = wilcox.test (jen.data$totinst, jen.data$totauth, paired =T)
> study.model
Wilcoxon signed rank test with continuity correction
data: jen.data$totinst and jen.data$totauth
V = 1223.5, p-value = 0.005327
alternative hypothesis: true location shift is not equal to 0
> z=qnorm(study.model$p.value/2)
[This command retrieves the value of z from
the calculation.]
>z
[1] -2.78657
[Type z to see the value of z.]
> effect.size = abs(z)/sqrt(93)
[Calculate the effect size using this formula;
93 is the number of participants.
> effect.size
[1] 0.2889538
[Type effect.size to see the effect size itself.]
Here is my interpretation of the meaningfulness of the findings:
For these 93 language learners, there is 99% certainty of a statistically significant difference in
their perception of the usefulness of authentic versus instructional materials, with the learners
reporting that authentic materials are more useful (Wilcoxon Signed rank value = 1223.5, p =
.005327; effect size = .29). The effect size is moderately strong. The ex post facto design does
not support causal interpretation of the findings, but indicates a need for further research on
learners’ perception of the usefulness of these two types of learning materials.
Study B:
1. The independent variable is +/- instruction in reading strategies.
2. The dependent variable is reading comprehension.
3. Student level of proficiency is a control variable—only high beginners are included in the
study. There is no explicit moderator variable.
4. The instruction on reading strategies is a new component added to the course; if the teacher
previously taught the course, she may have some difficulty fitting the new component into the
program. Perhaps the two teachers could meet regularly to discuss how their classes are
progressing and work together to find solutions to how to fit the new info into the existing
course.
5. External validity is threatened by the fact that only high beginners participated in the study.
This problem could be addressed by including intermediate level participants, too.
6. Descriptive statistics are presented in the chart below. The R commands are presented below
the chart.
5
Turner
Answer Key for Chapter Seven
Using statistics in small-scale language education research: Focus on non-parametric data
Experimental
Group
1
2
3
4
5
6
7
Nonexperimental
Group
47
42
39
37
37
36
32
mean
mode
median
sd
range
38.57
37.00
37.00
4.79
15 points
40.86
42.00
42.00
3.93
11 points
45
44
42
42
42
37
34
> non.exp = c(47, 42, 39, 37, 37, 36, 32)
[enter data for the non-experimental group]
> exp = c(45, 44, 42, 42, 42, 37, 34)
[enter data for the experimental group]
> summary (non.exp)
Min. 1st Qu. Median Mean 3rd Qu. Max. [calculate mean, median, and minimum &
maximum scores]
32.00 36.50 37.00 38.57 40.50 47.00
> summary (exp)
Min. 1st Qu. Median Mean 3rd Qu. Max. [calculate mean, median, and minimum &
maximum scores]
34.00 39.50 42.00 40.86 43.00 45.00
> subset(table(non.exp), (table(non.exp)==max(table(non.exp)))) [find mode]
37
2
> sd(non.exp)
[1] 4.790864
[calculate standard deviation]
> 47-32
[1] 15
[use minimum & maximum to find range]
6
Turner
Answer Key for Chapter Seven
Using statistics in small-scale language education research: Focus on non-parametric data
> subset(table(exp), (table(exp)==max(table(exp))))
42
3
[find mode]
> sd(exp)
[1] 3.933979
[calculate standard deviation]
> 45-34
[1] 11
[use minimum & maximum to find range]
> hist (non.exp, col = "dark sea green", breaks = 10)
> hist (exp, col = "medium spring green", breaks = 10)
7. Follow the steps in statistical logic....
Step 1. State hypotheses
H0: There is no statistically significant difference in the rankings of the reading
comprehension scores for the group that received reading strategies and the group that
did not.
H1: There is a statistically significant positive difference in the rankings of the reading
comprehension scores for the group that received reading strategies and the group that
did not.
7
Turner
Answer Key for Chapter Seven
Using statistics in small-scale language education research: Focus on non-parametric data
H2: There is a statistically significant negative difference in the rankings of the reading
comprehension scores for the group that received reading strategies and the group that
did not.
Step 2. Set alpha
I set alpha at .05.
Step 3. Identify the appropriate statistic for the analysis
I propose to the Wilcoxon Rank Sum statistic to analyze the data because:
1) the independent variable is nominal and has two levels;
2) the two groups include different participants;
3) the dependent variable yields rankable data.
Step 4. Collect the data
The data are presented in the table above.
Step 5. Check the assumptions
1 & 2) there are two levels of the independent variable and they are represented by
different people;
3) review of the data shows that the values are rankable.
Step 6. Calculate the observed value of the statistic.
The R commands are presented below:
> wilcox.test (non.exp, exp)
Wilcoxon rank sum test with continuity correction [The highlighted warning indicates that
“exact = FALSE” should be added so
the appropriate formula is used to
calculate the exact p-value.]
data: non.exp and exp
W = 16.5, p-value = 0.3304
alternative hypothesis: true location shift is not equal to 0
Warning message:
In wilcox.test.default(non.exp, exp) :
cannot compute exact p-value with ties
wilcox.test (non.exp, exp, exact =F)
[Calculate Wilcoxon Rank Sum value]
Wilcoxon rank sum test with continuity correction
data: non.exp and exp
W = 16.5, p-value = 0.3304
alternative hypothesis: true location shift is not equal to 0
8
Turner
Answer Key for Chapter Seven
Using statistics in small-scale language education research: Focus on non-parametric data
wilcox.test (exp, non.exp, exact =F)
[Reverse order of datasets to identify the
lower of the two observed values.
Wilcoxon rank sum test with continuity correction
data: exp and non.exp
W = 32.5, p-value = 0.3304
alternative hypothesis: true location shift is not equal to 0
Step 7. Retrieve the exact probability to consider the formal research hypotheses
Using the exact probability approach—
Retrieve the exact probability from the R output; p = .3304.
Step 8. Consider the formal research hypotheses
Using the exact probability approach—
Use the rules for comparing exact probability to alpha:
If exact probability ≥ alpha → accept the null hypothesis
If exact probability < alpha → reject the null hypothesis
The exact probability, p = .3304 is greater than alpha, .05, so accept the null hypothesis.
H0: There is no statistically significant difference in the rankings of the reading
comprehension scores for the group that received reading strategies and the group that
did not.
Step 9. Make the appropriate probability statement
There’s 95% certainty of no statistically significant difference in the rankings of the
reading comprehension scores for the group that received reading strategies and the group
that did not.
Step 10. Interpret meaningfulness
The researchers found that for this small group of participants there is no statistically significant
in the reading comprehension of learners who received instruction on reading strategies and
learners who did not receive instruction on these strategies (Wilcoxon Rank Sum W = 16.5, p =
.3304). Effect size cannot be calculated due to the small number of participants (Turner, 2014, p.
179).
Study C:
8. The independent variable is +/- study in the intermediate level Summer Intensive Arabic
Program
9. The dependent variable is Arabic oral abilities.
9
Turner
Answer Key for Chapter Seven
Using statistics in small-scale language education research: Focus on non-parametric data
10. The design is ex post facto, or if one considers participation in the Summer Intensive
Language Program to be a treatment, the study is pre-experimental (there is no legitimate
comparison group; there’s just the one group of students before and after participation in the
program).
11. Descriptive statistics & R commands
Posttest
80
60
50
48
44
26
30
R commands
1
2
3
4
5
6
7
Pretest
62
48
50
30
30
28
26
mean
39.14
48.29
> summary (pretest)
Min. 1st Qu. Median Mean 3rd Qu. Max.
26.00 29.00 30.00 39.14 49.00 62.00
> summary (posttest)
Min. 1st Qu. Median Mean 3rd Qu. Max.
26.00 37.00 48.00 48.29 55.00 80.00
> subset(table(pretest),
(table(pretest)==max(table(pretest))))
30
2
mode
> subset(table(posttest),
(table(posttest)==max(table(posttest))))
posttest
26 30 44 48 50 60 80
1 1 1 1 1 1 1
median
sd
range
30.00
14.04
36 points
48.00
18.24
54 points
See above (summary command for mean)
> sd (pretest)
[1] 14.04076
> sd(posttest)
[1] 18.23654
> 62-26
[1] 36
> 80-26
[1] 54
10
Turner
Answer Key for Chapter Seven
Using statistics in small-scale language education research: Focus on non-parametric data
12. Steps in statistical logic
Step 1. State hypotheses
H0: There is no statistically significant difference in the rankings of the Arabic oral skills
of the group before they received the SILP instruction and after they had completed the
SILP instruction.
H1: There is statistically significant positive difference in the rankings of the Arabic oral
skills of the group before they received the SILP instruction and after they had completed
the SILP instruction.
H2: There is statistically significant negative difference in the rankings of the Arabic oral
skills of the group before they received the SILP instruction and after they had completed
the SILP instruction.
Step 2. Set alpha
I set alpha at .05.
Step 3. Identify the appropriate statistic for the analysis
I propose to the Wilcoxon Signed Rank statistic to analyze the data because:
1) the independent variable is nominal and has two levels, represented by the same
people;
2) there are two sets of dependent variable data from the same group of people;
3) the dependent variable yields rankable data.
Step 4. Collect the data
The data are presented in the table above.
Step 5. Check the assumptions
1 & 2) there are two levels of the independent variable and they are represented by same
people;
3) review of the data shows that the values are rankable.
Step 6. Calculate the observed value of the statistic.
The R commands are presented below:
wilcox.test(pretest, posttest, paired=T, exact = F)
Wilcoxon signed rank test with continuity correction
data: pretest and posttest
V = 1, p-value = 0.05848
alternative hypothesis: true location shift is not equal to 0
> wilcox.test(posttest,pretest, paired =T, exact =F)
Wilcoxon signed rank test with continuity correction
11
Turner
Answer Key for Chapter Seven
Using statistics in small-scale language education research: Focus on non-parametric data
data: posttest and pretest
V = 20, p-value = 0.05848
alternative hypothesis: true location shift is not equal to 0
Step 7. Retrieve the exact probability to consider the formal research hypotheses
Using the exact probability approach—
Retrieve the exact probability from the R output; p = .05848
Step 8. Consider the formal research hypotheses
Using the exact probability approach—
Use the rules for comparing exact probability to alpha:
If exact probability ≥ alpha → accept the null hypothesis
If exact probability < alpha → reject the null hypothesis
The exact probability, p = .05848 is greater than alpha, .05, so accept the null hypothesis.
H0: There is no statistically significant difference in the rankings of the Arabic oral skills
of the group before they received the SILP instruction and after they had completed the
SILP instruction.
Step 9. Make the appropriate probability statement
There’s 95% certainty of no statistically significant difference in the rankings of the
group’s oral skills before and at the end of the SILP instruction.
Step 10. Interpret meaningfulness
The researchers found that for this small group of participants there is no statistically significant
in the oral skills of learners before and after they participated in the Summer Intensive Language
Program (fabricated data) (Wilcoxon Signed Rank V = 1, p = .05848). Effect size cannot be
calculated due to the small number of participants (Turner, 2014, p. 179).
12
Download