Chapter 14 Hypothesis Tests: Two Independent Samples 2 Chapter 14 Two Independent Samples •Major Points • An example • Distribution of differences between means • Heterogeneity of Variance • Nonnormality • Confidence limits Cont. 3 Chapter 14 Two Independent Samples •Major Points--cont • SPSS printout • Review questions •Violent Videos Again Chapter 14 Two Independent Samples • Bushman (1998) Violent videos and aggressive behavior • Doing the study Bushman’s way— almost (previously we looked at a one group example, comparing group mean to population mean) • Bushman actually had two independent groups • Violent video versus non-violent video • We want to compare mean number of aggressive associations between groups 4 5 Chapter 14 Two Independent Samples •The Data Condition Mean St. Dev. Variance n Violent video 7.10 4.40 19.36 100 NonViolent video 5.65 3.20 10.24 100 Chapter 14 Two Independent Samples 6 •Analysis • These are Bushman’s data, though he had more dimensions. • We still have means of 7.10 and 5.65, but both of these are sample means. • We want to test differences between sample means. • Not between a sample and a population mean • Not between related samples • We call these Independent samples Cont. 7 Chapter 14 Two Independent Samples •Analysis--cont. • Q: How are sample means distributed if H0 is true? • Need sampling distribution of differences between means • Same idea as before, except statistic is (mean1 - mean2) • Distribution of Differences between means over repeated samplings X 8 Chapter 14 Two Independent Samples •Sampling Distribution of Mean Differences • Mean of sampling distribution = 1 - 2 (remember that the mean of the sampling distribution should equal the population value, here the pop. mean difference) • Standard deviation of sampling distribution (standard error of mean differences) = sX 1 X 2 s12 s22 n1 n2 Note. The variance of the differences between 2 samples is equal to the sum of their variances Cont. Chapter 14 Two Independent Samples •Sampling Distribution--cont. • Distribution approaches normal as n increases. • Later we will modify this to “pool” variances. 9 10 Chapter 14 Two Independent Samples •Analysis--cont. • Same basic formula as before, but with accommodation to 2 groups. X1 X 2 X1 X 2 t sX 1 X 2 s12 s22 n1 n2 • Note parallels with earlier t, p 291 11 Chapter 14 Two Independent Samples •Our Data X1 X 2 7.10 5.65 t 2 2 19.36 10.24 s1 s2 100 100 n1 n2 1.45 1.45 2.66 .296 .544 12 Chapter 14 Two Independent Samples •Degrees of Freedom • Need to know the df to look up the critical t value for rejecting null • Each group has 100 subjects. • Each group has n - 1 = 100 - 1 = 99 df • Total df = n1 - 1 + n2 - 1 = n1 + n2 - 2 100 + 100 - 2 = 198 df • t.025(198) = +1.97 (approx.) 13 Chapter 14 Two Independent Samples •Conclusions • Since 2.66 > 1.97, reject H0. • Conclude that those who watch violent videos produce more aggressive associates than those who watch nonviolent videos. Chapter 14 Two Independent Samples •Assumptions • Two major assumptions • Both groups are sampled from populations with the same variance • “homogeneity of variance” • Both groups are sampled from normal populations • Assumption of normality • Frequently violated with little harm. • Sample sizes greater than 30, Central Limit theorem says approaches normal 14 Chapter 14 Two Independent Samples 15 •Pooling Variances • If we assume both population variances equal, then a weighted average of sample variances would be better estimate (weighted by each 2 or n size). 2 samples df, (n 1) s n 1s s 2 p 1 1 2 2 n1 n2 2 99(19.36) 9910.24 29304 14.8 100 100 2 198 Cont. Chapter 14 Two Independent Samples •Pooling Variances--cont. • Substitute sp2 in place of separate variances in formula for t. • Will not change result if sample sizes equal • Do not pool if one variance more than 4 times the other AND unequal group sizes, or if the test of unequal variances (Levene’s test) is significant. 16 Chapter 14 Two Independent Samples •Heterogeneous Variances • Refers to case of unequal population variances (one variance is more than 4 times the other or test is significant) • We don’t pool the sample variances. • We adjust df and look t up in tables for adjusted df, this is a more conservative test (need larger t to be significant). • One option is to use the Minimum df = smaller group n - 1. • Most software calculates optimal df, slightly larger than smaller n-1. 17 18 Chapter 14 Two Independent Samples •Confidence Limits on the mean difference CI .95 X 1 X 2 t .025 s X 1 X 2 (7.1 5.65) 1.97 .544 1.45 1.97 .544 1.45 1.07 0.38 diff 2.53 Cont. Chapter 14 Two Independent Samples •Confidence Limits--cont. • p = .95 that interval formed in this way includes the true value of 1 - 2. • Not probability that 1 - 2 falls in the interval, but probability that interval includes 1 - 2. • Reference is to “interval formed in this way,” not “this interval.” Minor difference in wording. 19 20 Chapter 14 Two Independent Samples •Computer Example • SPSS reproduces the t value and the confidence limits. • All other statistics are the same as well. • Note different df depending on homogeneity of variance. • Don’t pool if heterogeneity • and modify degrees of freedom Chapter 14 Two Independent Compare Samples the two means Group Statistics AGGRESS Mean Std. Error Mean N 1.00 100 7.1000 4.4000 .4400 2.00 100 5.6500 3.2000 .3200 Quick check for normality, does mean + and – 2 SD equal range? If not, probably skewed, non-normal Independent Samples Test Levene’s test is decision making point, null is that there is no diff in variances, if it is sig (p < .05) assumption violoated, move down AGGRESS Std. Deviation GROUP Each group has 100 people 21 Estimate of sampling error, SD of sampling disctribution Equal variances assumed Equal variances not assumed Levene's T est for Equality of Variances 95% confident that the diff in group means in population falls between – and -, notice that 0 is not in interval This is the test of difference between group means, Rule of thumb, t > 2 reject t-test for Equality of Means Mean gr1 – mean gr2 95% Confidence Interval of the Difference Std. Error Differe nce Lower Upper Sig. (2-tailed) Mean Differen ce 198 .008 1.4500 .5441 .3771 2.5229 181 .008 1.4500 .5441 .3765 2.5235 F Sig. t df 8.8 .003 2.665 2.665 Optimal df computed, not pooled Sig or p </= alpha .05, reject null, there was a stat sig diff SE Diff in group means 22 Chapter 14 Two Independent Samples Interpretation • Have the assumptions been met? – Normality: histograms or mean + and – 2 SD’s – Equal variances: Interpret Levene’s test • Is the test of Ho statistically significant? – T and sig level • What is the effect size? Is the mean difference large? Is is about a 1 sd difference, 2 sd’s? How LARGE is it? – Diff in means/pooled SD (quick way is to average SD for 2 groups) 23 Chapter 14 Two Independent Samples •Review Questions • Why do you suppose it makes a difference whether we compare a sample and a population mean or two sample means? • What do we know about the sampling distribution of differences between means? Cont. Chapter 14 Two Independent Samples 24 •Review Questions--cont. • What do we mean by “the standard error of differences between means?” • How do we calculate the degrees of freedom with two groups? • What are the underlying assumptions? • What does it mean to “pool” the variances? Cont. Chapter 14 Two Independent Samples •Review Questions--cont. • When would we not pool the variances? • What is the difference between how we calculate confidence limits with one or with two means? • What does the text mean by modifying the df when we have heterogeneity of variance? 25