The most common application for this test is called a ‘repeated-measures design’ where two measures are obtained from the same group of subjects, and you are looking for a difference in the means of these measures.
Example: Suppose you wanted to test the drug that may affect
IQ, but this time you look for changes within each subject.
This is done by measuring the IQ for each subject before and after taking the drug. For 9 subjects, you might get a result like this:
6
7
8
9
4
5
Subject IQ before
(X)
1 117
2
3
109
101
105
90
125
101
127
104
IQ after
(Y)
132
105
105
125
96
136
107
130
101
The book discusses two ways of conducting t-tests for a repeated-measures design.
We will only cover the second (easy) way. So we’ll be skipping sections 15.1 and
15.2
The easy (‘alternative’) way is to treat each subject as a single measure by calculating the difference (D) for the scores within each subject, and conducting a ttest on these values of D.
D=Y-X Subject IQ before
(X)
1 117
2
3
4
5
6
109
101
105
90
125
7
8
9
101
127
104
IQ after
(Y)
132
105
105
125
96
136
107
130
101
15
-4
4
20
6
11
6
3
-3
Just perform a simple single sample t-test (like in Chapter 13) on the values of
D using n-1 degrees of freedom, where n is the number of pairs. t
D
s
D
All we are doing here is conducting a single-sample ttest on the differences.
Subject IQ before
(X)
1 117
2
3
4
5
6
109
101
105
90
125
7
8
9
101
127
104
IQ after
(Y)
132
105
105
125
96
136
107
130
101
D=Y-X
6
11
6
3
-3
15
-4
4
20
This is the same equation for a single sample t-test, but with D instead of X: t
D
s
D
Subject IQ before
(X)
1 117
2
3
4
5
6
109
101
105
90
125
7
8
9
101
127
104
IQ after
(Y)
132
105
105
125
96
136
107
130
101
D=Y-X
6
11
6
3
-3
15
-4
4
20
D
6 .
44 s
D
7 .
86 s
D
s
D n
7 .
86
9
2 .
62 t
6 .
44
0
2 .
62
2 .
46
Looking in Table D, df = 8, area in one tail: 0.025
t crit
= ±2.306
Decision: reject H
0 and conclude that the drug had a significant effect on IQ using a two-tailed test with a
= .05.
Using APA Format:
“There was a statistically significant change in IQ after taking the drug (M = 6.44, SD = 7.86), t(8)= 2.46, p<.05
The effect size is computed just as a single sample t-test. Again, just use ‘D’ for ‘X’: g
D s
D
6 .
44
7 .
86
0 .
819
Remember, the general convention is that g=.2 is small, g=.5 is medium and g=.8 is large, so this is a large effect size
What is the power of this test using this effect size?
We will use the power curve for one mean: (not two means, because we’re really just conducting a hypothesis test on the mean of the differences)
Here’s the power curve for a two-tailed test for one mean with a
=.05 from Chapter 13:
For an effect size of 0.819, sample size of 9, what is the power?
a
= 0.05, 2-tails, 1 mean
1
0.9
0.5
0.4
0.3
0.2
0.8
0.7
0.6
1000
500
250
150
100
75
50
40
30
25
20
15
12
10 n=8
0.1
0
0 0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1 1.1
1.2
1.3
1.4
Effect size (d)
Answer: about .525 or so.
Power with correlated groups is calculated the same way as for a single sample design.
Here’s the power curve for a two-tailed test with a
=.05 from Chapter 13:
For an effect size of 0.8, how many pairs (n) do we need for a power of 0.8?
a
= 0.05, 2-tails, 1 mean
1
0.9
0.8
0.7
0.6
0.5
0.4
0.3
0.2
1000
500
250
150
100
75
50
40
30
25
20
15
12
10 n=8
0.1
0
0 0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1 1.1
1.2
1.3
1.4
Effect size (d)
Answer: about 15 pairs (yellow curve)
Other examples (besides repeated measures) of testing difference of means between dependent (correlated) groups. The methods are the same as for repeated measures – calculate a t-test on the difference between each pair.
Matched pairs experiments: Where pairs of subjects are naturally grouped together. For example, in identical twin studies a measure is made from each twin in a pair. D is the difference in scores for each pair of twins.
Matched-subjects design: Where the experimenter matches pairs of subjects together based on some other variable such as age or visual acuity.
Example from the tutorial:
25) We measure the visual acuity of 31 republicans under two conditions: 'itchy' and 'tiny'.
You then subtract the visual acuity of the 'tiny' from the 'itchy' conditions for each republicans and obtain a mean pair-wise difference of -2.15 with a standard deviation is
7.37.
Using an alpha value of 0.01, is the visual acuity from the 'itchy' condition significantly different than from the 'tiny' condition?
What is the effect size?
Answer:
D̄ = -2.15
s
D̄ t obs t crit
= 1.32
= -1.63
= ± 2.75 (df = 30)
We fail to reject H
0
.
The visual acuity of itchy republicans is not significantly different than the visual acuity of tiny republicans (M=-2.15, SD = 7.37), t(30) = -1.6288, p = 0.1138.
Effect size: g = 0.29 (small)