Author(s): Brenda Gunderson, Ph.D., 2011 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution–Non-commercial–Share Alike 3.0 License: http://creativecommons.org/licenses/by-nc-sa/3.0/ We have reviewed this material in accordance with U.S. Copyright Law and have tried to maximize your ability to use, share, and adapt it. The citation key on the following slide provides information about how you may share and adapt this material. Copyright holders of content included in this material should contact open.michigan@umich.edu with any questions, corrections, or clarification regarding the use of content. For more information about how to cite these materials visit http://open.umich.edu/education/about/terms-of-use. Any medical information in this material is intended to inform and educate and is not a tool for self-diagnosis or a replacement for medical evaluation, advice, diagnosis or treatment by a healthcare professional. Please speak to your physician if you have questions about your medical condition. Viewer discretion is advised: Some medical content is graphic and may not be suitable for all viewers. Some material may be sourced from: Mind on Statistics Utts/Heckard, 3rd Edition, Duxbury, 2006 Text Only: ISBN 0495667161 Bundled version: ISBN 1111978301 Material from this publication used with permission. Attribution Key for more information see: http://open.umich.edu/wiki/AttributionPolicy Use + Share + Adapt { Content the copyright holder, author, or law permits you to use, share and adapt. } Public Domain – Government: Works that are produced by the U.S. Government. (17 USC § 105) Public Domain – Expired: Works that are no longer protected due to an expired copyright term. Public Domain – Self Dedicated: Works that a copyright holder has dedicated to the public domain. Creative Commons – Zero Waiver Creative Commons – Attribution License Creative Commons – Attribution Share Alike License Creative Commons – Attribution Noncommercial License Creative Commons – Attribution Noncommercial Share Alike License GNU – Free Documentation License Make Your Own Assessment { Content Open.Michigan believes can be used, shared, and adapted because it is ineligible for copyright. } Public Domain – Ineligible: Works that are ineligible for copyright protection in the U.S. (17 USC § 102(b)) *laws in your jurisdiction may differ { Content Open.Michigan has used under a Fair Use determination. } Fair Use: Use of works that is determined to be Fair consistent with the U.S. Copyright Act. (17 USC § 107) *laws in your jurisdiction may differ Our determination DOES NOT mean that all uses of this 3rd-party content are Fair Uses and we DO NOT guarantee that your use of the content is Fair. To use this content you should do your own independent analysis to determine whether or not your use will be Fair. Module 7: Independent Samples t Procedures Objectives: In this module you will learn an important statistical technique that will allow you to compare two populations with respect to their means, that is, learn about 1 2 . The objective is to help you understand the ideas behind confidence intervals, tests of significance, and the statistical language involved in the comparison of two population means. Overview: The two independent samples t procedures (sometimes called Student's t procedures) are used when you want to compare the means of two populations that are not related or matched in any way. For example you might want to test if the G.P.A. of English majors is different than that of Math majors. The idea is to use the two sample means to estimate the corresponding population means. We may want to construct a confidence interval to estimate the difference between the two population means. Or we may wish to test if the difference in the two population means is equal to a value specified in a null hypothesis, the hypothesized value generally being 0. There are 2 versions of independent samples t procedures: pooled and unpooled (also known as general). For a pooled independent t procedure, we assume equal population variances for the two populations. Details on determining which version to use are below. We will assume that we have a random sample from each of the two populations and that the variable being measured has a normal model for each population, with possible different means. In addition, the two random samples are assumed to be independent of each other. Whether or not we can assume the two populations have equal variances will determine which of the two possible independent samples t procedures we use. There are several ways to check the equal population variances assumption required in order to perform the pooled t test instead of the general version. You can construct side-by-side boxplots and see if the IQRs are comparable or compare the two sample standard deviations. A third method is to perform a Levene’s test for homogeneity of variances. Levene’s test is, in fact, a hypothesis test, but it is a bit different from hypothesis tests you have seen previously since it is a test about variances rather than means. The null hypothesis is that the populations of interest do have equal variances and the alternative is that they do not have equal variances. So if the null hypothesis from the Levene’s test is NOT rejected, it would be reasonable to perform the pooled t-test for two population means. We will use an alpha of 10% for Levene’s tests. If the null hypothesis from Levene’s test IS rejected, then the assumption of equal population variances is unreasonable, and the unpooled (general) version should be used instead of the pooled. Be careful not to confuse the Levene’s test p-value with the p-value from the two independent samples t-test, since your output will contain both. Note: A summary of inference procedures for population means (one sample, paired, and independent samples) and the corresponding assumption checks can be found in Supplement 7. Supplement 6 is also quite helpful, giving examples of common interpretations. 68 Formula Card: Activity: Do Men and Women differ in their SSHA scores? Background: A total of 38 college freshmen were administered the Survey of Study Habits and Attitudes (SSHA), a psychological test designed to measure motivation and attitude towards study habits in college students, at a private college. The 38 students were a simple random sample, of which 18 were female and 20 were male. It is known that scores on the SSHA may explain success in college. Scores on the test range from a low of 0 to a high of 200. School administrators are interested in whether or not there is a difference between the mean scores for males and females. The scores for females (Group 1) and males (Group 2) are listed in the SSHA.sav data set (Source: Moore and McCabe (1999), pg 563). Task: Perform a two-independent samples t-test to assess if there is a difference between the mean score for women on the test and the mean score for men on the test. Recall: Write out the Five Steps for conducting a test of hypotheses (Reference page 51). 1. 2. 3. 4. 5. 69 Before conducting any test, here are a set of questions to ask yourself: How many populations are there? One Two More than two How many variables are there? One Two What is the response variable? What type of variable is the response? Categorical Quantitative What type of parameter would be useful for summarizing this response? Proportion Mean Other (see Supplement 3) Based on the answers to these questions, you should be able to identify the appropriate inference procedure. You may refer back to Supplement 3 – Name that Scenario for assistance. The appropriate inference procedure for this scenario is ______________________________ and the specific parameter of interest is ___________________ . Note: Why is this not a paired t procedure? 1. State the hypotheses: H0: __________ = __________ Ha: _______________________ where _____ represents Your parameter definition should always be a statement about the population(s) under study. 2. Assumption Checks and Computing the Test Statistic: Assumptions: NOTE: There are TWO versions of this test – pooled and unpooled (general). Both versions are on your formula card. The pooled test has an additional assumption that the unpooled test does not have; the pooled test also assumes that the POPULATIONS have equal variances. a. For this scenario, we need to assume that the two samples are ________________ from each other. b. We need to assume that each sample is a ___________ sample. To check this assumption, we would make a __________ plot (if there was time order) for each sample and look for ________________________________. c. Each sample needs to come from a normally distributed _________________ . To check this assumption, we would make a _______ plot for each __________. d. Finally, for the pooled test, we also need to assume that both populations have equal ____________________. 70 Some guidelines: Determining General vs. Pooled (i.e. checking the assumption of equal population variances) There are three ways to check this assumption depending on the information provided. o Examine the sample standard deviations. If they are similar, then the assumption is valid. (This is because variance is standard deviation squared). ‘Similar’ here roughly means one should not be more than two times the other. o Examine side-by-side boxplots of the sample data. If the IQRs are similar, then the assumption is valid (i.e. the lengths or sizes of the BOXES should be similar; but the two boxes do not need to line up right next to each other). o Use Levene’s test. Levene’s test is a test of assumptions done BEFORE the hypothesis test. SPSS will provide this output. The Levene’s test null hypothesis is that the populations have equal variances. Hence, this is a hypothesis test where you WANT to keep the null hypothesis if you want to do a POOLED test. The Levene’s test statistic is an F, and the p-value can be found under Sig. in the Levene’s section of the output. Using Levene’s test: o If the Levene’s test p-value is greater than 0.10 (or the specified significance level), the assumption of equal population variances appears to hold, and you can use a POOLED test. o If the Levene’s test p-value is less than or equal to 0.10 (or the specified significance level), the assumption of equal population variances does not appear to be met, and you would use the GENERAL test. e. Do the Assumptions Appear Valid? Comment on each assumption below, using graphs and output when appropriate. Are the two samples independent? Are the samples random samples? Note there is no time order for this data. If there was time order, since you need EACH sample to be a random sample, how many time plots would you need to make to check this assumption? ________ time plot(s) 71 Does it appear that the assumption that each sample comes from a normally distributed population is met? Why? Note: The equal population variances assumption will be considered after the t-test output is generated. The t-test output will include results from Levene’s test and allow us to decide between the pooled versus general two independent samples t procedure. 72 Test-statistic: e. Generate the t-test output. Use Analyze>Compare Means>Independent-Samples T-Test. f. The test value is _____ (this is the null value from the null hypothesis). g. The output provides both the pooled (equal variances assumed) and unpooled (equal variances not assumed) versions of the test. Use the output and the provided side-by-side boxplots to determine which version of the test can be used. i. The standard deviation for females is ______ and the standard deviation for males is _______. The standard deviations are: similar not similar. ii. Side-by-side boxplots show that the IQRS are: similar not similar. iii. Write the null and alternate hypothesis for Levene’s test using appropriate notation H0:__________________________ Ha:__________________________ The Levene’s test statistic is ____________ and the Levene’s p-value is __________. Therefore, we can cannot reject the null hypothesis that the population variances are equal. o Hence, for the above reasons, we can say the assumption of equal population variances is is not valid. o What is the symbol of the estimate of the common population standard deviation AND its value? Symbol: Estimate: h. Based on which version (pooled or general) of the test you decided to conduct in part g, what is the value of the test statistic? Note that there are two lines of output in your table. The first line corresponds to using a pooled procedure (equal variances assumed), whereas the second line corresponds to what you report if you were using the unpooled (equal variances not assumed). i. What is the distribution of the test statistic if the null hypothesis is true? This is the same as asking what model you use to find the p-value. 73 3. Calculate the p-value: a. What is the SPSS reported p-value? _____________. Is it the p-value we want? _____ b. Draw a picture of the p-value we want. c. So, our p-value is _____________________ d. Provide an interpretation of the p-value (see supplement 6). 4. Decision: What is your decision at a 5% significance level? Reject H0 Fail to reject H0 Remember: Reject H0 Fail to reject H0 Results statistically significant Results not statistically significant 5. Conclusion: What is your conclusion in context of the problem? Conclusions should not be too strong -- say you have sufficient evidence or equivalent, do NOT say we have proven. Conclusions should always include a reference to the population parameter of interest. 6. Confidence Intervals (CI): a. What is the 95% confidence interval given in the output for the difference in the two population mean scores? b. Based on the confidence interval, would you reject the null hypothesis at the 5% significance level? Circle one: Yes No Explain. Did your conclusions on 4 and 6b match? 74 c. Interpret the 95% confidence interval. d. Provide an interpretation of what the 95% confidence level means. Check Your Understanding: 1. T F The test statistic of 2.032 in the activity implies that the sample means differed by 2.032 pooled standard errors. 2. T F The additional assumption required by the pooled test is that the samples have equal variances. 3. Consider the following three sets of boxplots of scores between two age groups. Circle the sets that indicate a pooled test is appropriate. Set 1 Set 2 Set 3 For which set are you most likely to reject the null hypothesis that the population mean scores are equal? Set 1 Set 2 75 Set 3 Check Your Understanding: Paired or Independent Samples? Consider the following studies in which two sets of quantitative measurements were obtained. Determine whether the two samples are independent or paired. a. The effectiveness of a new headache medicine is tested by measuring the intensity of a headache in patients before and after drug treatment. The data consist of before and after intensities for each patient. independent samples paired samples b. The effectiveness of a new headache medicine is tested by measuring the amount of time before the headache is cured for patients who use the new medicine and for another group of patients who use a placebo drug. independent samples paired samples c. The accuracy of verbal responses is tested in an experiment in which individuals are first asked to report their height and then their actual height is measured. The data consist of the reported height and the measured height for each individual. independent samples paired samples 76 Example Exam Question on Two-Sample t-Test In an animal-learning experiment, a researcher wanted to assess if a particular drug speeds learning. One group of 5 rats (Group 1 = control group) is required to learn to run a maze without use of the drug, whereas a second independent group of 8 rats (Group 2 = experimental group) is administered the drug. The running times (time to complete the maze) for the rats in each group were entered into SPSS. Group Statistics Time to complete maze Group Control Experimental N 5 8 Mean 46.80 38.38 Std. Deviation 3.42 4.78 Std. Error Mean 1.53 1.69 Independent Samples Test Levene's Tes t for Equality of Variances Run Time Equal variances as sumed Equal variances not ass umed F 1.09 Sig. .32 t-tes t for Equality of Means t 3.41 3.70 df 11 10.653 Sig. (2-tailed) .006 .004 Mean Difference 8.42 8.42 Std. Error Difference 2.47 2.28 Conduct the two independent samples t-test to address the theory of the researcher (state the null and alternate hypotheses, report the test statistic, p-value, and state your decision and conclusion at the 5% level of significance). H0: _______________________ Ha: _______________________ Based on the above output, which results should you report? pooled or un-pooled? Explain. Test statistic: ______________ Decision: (circle one) Fail to reject H0 p-value: ____________________ Reject H0 Thus … Would the corresponding 95% CI for the difference in population means contain 0? Is it appropriate to use a 95% CI to perform the hypothesis test above? Why or why not? 77