Quiz CHAPTER 13 Name:____________________ 1. Suppose you observe a relationship between two categorical variables in a sample based on 22,000 people. If this observed relationship were based on a sample of 600 people rather than 22,000, how would this affect the believability of the results (if at all)? Explain your answer. ANSWER: AN OBSERVED RELATIONSHIP IS MUCH MORE BELIEVABLE IF IT IS BASED ON A MUCH LARGER SAMPLE SIZE. SAMPLE RESULTS FROM LARGER SAMPLES ARE LESS LIKELY TO VARY FROM SAMPLE TO SAMPLE AND ARE MORE LIKELY TO REPRESENT THE TRUTH ABOUT THE POPULATION. 2. Which of these does not apply to a ‘statistically significant’ relationship between two categorical variables? a. The relationship between the two variables is very important from a practical standpoint. b. The relationship observed in the sample was unlikely to have occurred unless there really is a relationship in the population. c. The notion that this relationship could have happened by chance is deemed to be implausible. d. All of the above apply to a ‘statistically significant’ relationship. ANSWER: A 3. Which of the following differences in percentages for the two categories of an explanatory variable would be considered to be statistically significant? a. 50% - 49 % = 1% b. 50% - 45% = 5% c. 50% - 40% = 10% d. Not enough information to tell. ANSWER: D 4. When researchers look for a relationship between two categorical variables for individuals in the __________, they measure those categorical variables on individuals in the __________. ANSWERS (RESPECTIVELY): POPULATION; SAMPLE 5. The question researchers are really asking when they are looking for a relationship between two variables is whether or not that relationship is present in the __________. ANSWER: POPULATION 6. The first two basic steps for conducting a hypothesis test are to: 1) determine the null and alternative hypotheses; and 2) collect the data and summarize it with a ‘test statistic.’ What is the third step in the process that allows you to then make a decision? ANSWER: DETERMINE HOW UNLIKELY THE TEST STATISTIC WOULD BE IF THE NULL HYPOTHESIS WERE TRUE. 7. Suppose you want to investigate whether there is a relationship between the gender of college students and whether or not they wear hats in school. What would be your null hypothesis and your alternative hypothesis (in words)? Be sure to label clearly which hypothesis is which. ANSWER: NULL HYPOTHESIS: NO RELATIONSHIP BETWEEN GENDER AND HAT WEARING IN COLLEGE STUDENTS; ALTERNATIVE HYPOTHESIS: THERE IS A RELATIONSHIP. . 8. Suppose you find a statistically significant relationship between two categorical variables (with no other supporting evidence available). When can such results correctly lead you conclude a cause and effect relationship? a. Never. b. Only when the data were from a randomized experiment. c. Only when the data were from a random sample. d. Always; a statistically significant relationship wouldn’t be significant unless there is a cause and effect relationship. ANSWER: B 9. Another name for the alternative hypothesis is the __________ hypothesis. ANSWER: RESEARCH 10. The __________ hypothesis is usually written to express the fact that ‘nothing is happening.’ ANSWER: NULL Narrative: Aspirin and polyps A well-designed experiment by the Ohio State University Medical Center studied whether or not taking an aspirin a day would affect colon cancer patients’ chances of getting subsequent colon polyps. In the study, 635 patients with colon cancer participated; 317 of them were randomly assigned to the aspirin group, and the other 318 patients were assigned to a placebo (non-aspirin) group. The results showed that 17% of the patients in the aspirin group developed subsequent polyps, compared to 27% of the patients in the non-aspirin group. The results are summarized in the table below. ASPIRIN PLACEBO TOTAL POLYPS 54 86 140 NO POLYPS 263 232 495 TOTAL 317 318 635 11. {Aspiring and polyps narrative} Write the expected counts in each of the four cells of the table in parentheses next to each of the observed counts. ANSWER: SEE ANSWERS IN PARENTHESES IN THE TABLE BELOW. ASPIRIN PLACEBO TOTAL POLYPS 54 (69.89) 86 (70.11) 140 NO POLYPS 263 (247.11) 232 (247.89) 495 TOTAL 317 318 635 12. {Aspirin and polyps narrative} Compute the test statistic for the data. ANSWER: 9.25 13. {Aspirin and polyps narrative} Is there a statistically significant relationship between taking aspirin and developing subsequent polyps in colon cancer patients? If so, what is the relationship? Justify your answer. ANSWER: THE CHI-SQUARE STATISTIC IS 9.25 > 3.84 (P-VALUE < .05), THE RELATIONSHIP IS STATISTICALLY SIGNIFICANT. SINCE THE ASPIRIN GROUP EXPERIENCED FEWER POLYPS (17% COMPARED TO 27%), IT APPEARS THAT ASPIRIN REDUCES THE CHANCES OF DEVELOPING SUBSEQUENT POLYPS IN COLON CANCER PATIENTS. 14. Suppose a researcher wanted to find out which of two chemicals works best to kill poison ivy. He conducted a controlled experiment involving 200 poison ivy plants, randomly assigned half of them to each treatment (chemical #1 or chemical #2). Five days after applying the chemicals, he recorded the data below. Is there a statistically significant relationship between type of chemical used and the results when applied to poison ivy? Conduct a hypothesis test to find out. CHEMICAL #1 CHEMICAL #2 TOTAL DIED 78 89 167 LIVED 22 11 33 TOTAL 100 100 200 ANSWER: YES; THE EXPECTED CELL COUNTS ARE IN PARENTHESES. THE CHISQUARE TEST STATISTIC IS 4.39 > 3.84, SO THE RELATIONSHIP IS STATISTICALLY SIGNIFICANT. NOTE THAT CHEMICAL #2 SEEMS TO DO BETTER SINCE IT KILLED A HIGHER PERCENTAGE OF PLANTS IN THE SAMPLE (89% COMPARED TO 78%). CHEMICAL #1 CHEMICAL #2 TOTAL DIED 78 (83.5) 89 (83.5) 167 LIVED 22 (16.5) 11 (16.5) 33 TOTAL 100 100 200 15. Which of the following statements is true about chi-square tests? a. A large chi-square test statistic results in a large p-value. b. A large p-value means that there is a good chance that the relationship is statistically significant. c. If the two variables are not related in the population, then less than 5% of the samples you could ever take would give you a test statistic of 3.84 or larger. d. All of the above. ANSWER: C 16. Which of the following statements is not true about the chi-square statistic for a 2 x 2 contingency table? a. If it is greater than 3.84, reject the null hypothesis and accept the alternative hypothesis. b. If it is greater than 3.84, the relationship in the table is considered to be statistically significant. c. 95% of the tables for sample data from populations in which there is no relationship will have a chi-square statistic of 3.84 or greater. d. All of the above are true. ANSWER: C 17. A __________ test is a statistical procedure that is used to determine whether or not there is a relationship between two categorical variables. ANSWER: CHI-SQUARE 18. If two variables have a statistically significant relationship that does not necessarily mean the two variables have a relationship of practical importance. Describe how this could happen. ANSWER: IT DEPENDS ON THE SAMPLE SIZE; A MINOR RELATIONSHIP OBSERVED IN A SAMPLE BASED ON A LARGE ENOUGH NUMBER OF OBSERVATIONS CAN EASILY ACHIEVE STATISTICAL SIGNIFICANCE. 19. Is it possible for an important relationship in a population to fail to achieve statistical significance in the sample and therefore go undetected, even if the sample is selected properly and the data are collected correctly? Explain your answer. ANSWER: YES; WHY: 1) THERE MAY HAVE ONLY BEEN A FEW OBSERVATIONS; OR 2) THE SAMPLE IS NOT REPRESENTATIVE OF THE POPULATION, JUST BY CHANCE. Narrative: Teenage voting Suppose you take a random sample of teenagers aged 18-19, and you find that 53% of the males in your sample are Democrats, and 54% of the females in your sample are Democrats, respectively. 20. {Teenage voting narrative} Should these results be considered significant from a practical standpoint? Why or why not? ANSWER: NO; THEY ARE NOT SIGNIFICANTLY DIFFERENT FROM A PRACTICAL STANDPOINT. THE DIFFERENCE IS TOO SMALL. 21. {Teenage voting narrative} Could these results ever be statistically significant? Why or why not? ANSWER: YES, IF THE SAMPLE SIZE IS LARGE ENOUGH, ANY NON-ZERO DIFFERENCE CAN BECOME STATISTICALLY SIGNIFICANT. 22. Suppose researchers say they ‘failed to find a relationship’ between two variables that they thought might have been related. What does this say to you, as an educated consumer of statistical information? a. You should check to make sure the study was not based on a small number of individuals. b. The researchers must not have observed any relationship in their sample. c. There must be no relationship between these two variables in the population. d. All of the above. ANSWER: A 23. Suppose there is not enough evidence to conclude that the relationship in the population is real. Which of the following is not an equivalent way of saying this? a. The relationship is not statistically significant. b. We cannot reject the null hypothesis. c. We accept the null hypothesis. d. All of the above. ANSWER: C 24. A minor relationship that is found to be __________ significant is not necessarily __________ significant. ANSWER: STATISTICALLY; PRACTICALLY 25. An interesting relationship in the population may fail to achieve __________ significance if there are too _____ observations. ANSWER: STATISTICAL; FEW 26. A minor relationship in the population may achieve __________ significance but fail to achieve __________ significance if there are an extremely large number of observations. ANSWERS (RESPECTIVELY): STATISTICAL; PRACTICAL