Chapter 11 Study Guide -----------------Chi-Square THE ANALYSIS OF CATEGORICAL DATA AND GOODNESS-OF-FlT TESTS This section will explore inference for categorical data using chisquare test procedures. These procedures are used with univariate as well as bivariate data sets if the variables are categorical. OBJECTIVES 3 Carry out a chi-square goodness-of-fit test. » Carry out chi-square tests for homogeneity of proportions and for independence. CHI-SQUARE TESTS FOR UNIVARIATE DATA (Introduction to Statistics & Data Analysis 3rd ed. pages 647-€55/4th ed. pages 700-708) In this section, we extend techniques for analyzing univariate categorical data sets. Here we can consider questions about variables that involve two or more categories. A chi-square test (y2} for univariate data allows us to test hypotheses about the proportions falling into the different categories for a categorical variable. We do this by looking at the frequencies observed in each of the categories and comparing them to what would be expected if a null hypothesis were true. The hypothesized proportions can be equal or they can be different for each category. EXAMPLE A new donut shop plans to sell plain, strawberry, blueberry, and cinnamon donuts. They wonder if there is a preference for one of 262 * Chapter 11 these types of donuts or if each type is preferred by the same proportion of customers. If we let pp = proportion ps = proportion pb = proportion p_ = proportion of customers preferring plain donuts of customers preferring strawberry donuts of customers preferring blueberry donuts of customers preferring cinnamon donuts We are interested in knowing if pp = ps = pb = pc = 0.25 or if there is evidence that these proportions are not all the same. To answer this question, a random sample of 60 customers is surveyed, and each person in the sample is asked which of the four donut types they prefer. Instead of running a separate two-sample proportions test for each of the possible pairs of proportions, a chi-square test will allow us to decide if the proportions we have observed in our sample are significantly different from the hypothesized proportions. The chisquare goodness-of-fit test will allow us to test the following hypotheses: Ha •. not all of the proportions are equal to 0.25 (For chi-square tests, it is acceptable to state hypotheses in words on the AP Exam.) To make the decision to reject or fail to reject the null hypothesis, the chi-square test will compare the number of responses observed in each category to what we would expect to see in each category if the null hypothesis is true. If the difference is too large, we reject the null hypothesis. Otherwise, we fail to reject, the null hypothesis. Data on a single categorical variable is usually summarized using a one-way frequency table. Returning to the previous example, suppose that the sample of 60 customers resulted in the data summarized in the table below. The table entries are observed frequencies or counts. Types of Ponut Preferred Observed Count Plain Strawberry Blueberry Cinnamon 13 12 16 19 These counts represent the number of times a person in the sample selected each particular donut type. Expected counts are calculated using the hypothesized proportions from the null hypothesis. In this example, all four hypothesized proportions are equal to 0.25, so if the null hypothesis is true, we would expect to see the same number preferring each type. We calculate expected counts by multiplying the sample size by each hypothesized proportion: The Analysis of Categorical Data and Goodness-of-Fit Tests * 263 npp = 60(0.25) = 15 nps = 60(0.25) = 15 npb = 60(0.25) = 15 npc = 60(0.25) = 15 The expected counts can now be entered into the table as shown here. Types of Donuts Sold Plain Strawberry Blueberry Cinnamon Observed Count 13 12 16 19 Expected Count 60(0.25) = 15 60(0.25) =15 60(0.25) = 15 60(0.25) = 15 Now we are ready to calculate the value of the test statistic. This test statistic is called the goodness-of-fit statistic because it considers how well the observed values fit to what we expected to see. Calculating the value of a chi-square (%2} test statistic is relatively easy. _ (observed cell count - expected cell count)2 expected cell count 2 which is often abbreviated ,(0 - E)2 For the example problem, the value of this test statistic is 2 (13-15)2 1 (12-15)2 (16-15)2 15 15 15 (-2) 2 ( C3)2 (I)2 (4)2 15 15 15 15 4 9 1 16 ~15 + 15 + 15 15 30 15 =2 _ (19-15)2 15 To find the associated P-value, we use a chi-square distribution with df = k - 1, where k is the number of categories of a categorical variable. The P-value is the area to the right of the computed test statistic value (the chi-square goodness-of-fit test is an upper tail test). This area is found by referring to the chi-square table or by using a graphing calculator or other technology. 264 * Chapter 11 Probabilityp The chi-square table is similar to the t table. You locate the df in the leftmost column then read across the row to find the value of the %2 test statistic. If the value is between two columns, then the P-value is between the two corresponding tail probabilities. Chi-Square Tail Probability (%2) df 0.25 0.20 0.15 0.10 0.05 0.025 1 1.32 1.64 2.07 2.71 3.84 5.02 2 2.77 3.22 3.79 4.61 5.99 7.38 3 < 3Lll 4.64 5.32 6.25 7.81 9.35 5.99 6.74 7.78 9.49 11.14 Z 5.39 1 4 : I^> 1 Since our xz = 2.00 and this value is smaller than 4.11, we know the tail probability is greater than 0.25. Using any reasonable « level will result in a failure to reject the null hypothesis. The only things that we have not yet discussed are the overall assumptions required for the chi-square test. The first assumption is that we have a random sample of observations of a categorical variable. In addition, the sample size must be large enough that the following conditions are met: 1. No expected counts are < 1, 2. All of our expected counts should be > 5 and if they are not, 3. No more than 20% of the expected counts are < 5. Returning to the donut example: HYPOTHESIS H0 : pp = 0.25, ps = 0.25, pb = 0.25, pc = 0.25 Ha : H0 is not true; in other words, not all of the proportions are equal We will use a significance level of 0.05 for this test. The Analysis of Categorical Data and Goodness-of-Fit Tests TEST AND ASSUMPTIONS Test: Chi-square goodness-of-fit test Assumptions: 1. The sample was a random sample of customers 2. All expected counts are > 5, so the sample size is large enough. TEST STATISTIC As seen earner, the calculated test statistic for this test would be 2 _(13-15) 2 [ (12-15)2 | (16-15)2 15 15 15 _(-2) 2 [ (-3)2 (I)2 (4)2 15 15 15 15 (19-15)2 + 15 30 ~15 =2 = with df = 3 Based on df = 3 and %2 = 2, the P-value is greater than 0.25. CONCLUSION Since P-value > a, there is insufficient evidence to reject the null hypothesis. In other words, there is not convincing evidence that the four types of donuts are not equally preferred. AP Tip For all chi-square tests, make sure to provide the expected counts and verify that they are large enough. SAMPLE PROBLEM 1 A music producer is interested in marketing a new artist via ads in movie theaters. The target age group is teenagers from 14-18 years of age. The company developing the advertisement has offered to run the ad in conjunction with the following types of movies: 20% of the time with comedies, 50% with dramas, and 30% with action films. However, the movie producer is not sure these percentages reflect the types of movies that teens attend. To investigate, each person in a random sample of teens was asked what type of movie they had seen most recently, resulting in the following data: Movie Most Recently Watched By 1OO Teens Comedy Observed Count 41 Drama 35 Action 24 265 266 * Chapter 11 Do these data provide convincing evidence that the proportions of teens watching the different types of movies are different than the proportions proposed by the company developing the ad? Use a significance level of 0.05. SOLUTION TO PROBLEM 1 HYPOTHESIS pc= proportion of teens who watched a comedy pd= proportion of teens who watched a drama pa= proportion of teens who watched an action films H0:pc=0.2; pd = 0.5; pa = 0.3 Ha : the null hypothesis is not true (at least one of the proportions of teens watching the various film types is not equal to the hypothesized proportion). a=0.05 ASSUMPTIONS The problem states the teens were a random sample. All the expected counts are greater than or equal to 5 (see table below). movie most Recently watcnea B f luu Teens Expected Counts Comedy Drama Action 100(0.2) = 20 100(0.5) = 50 100(0.3) = 30 TEST STATISTIC %z Goodness-of-fit test: -20)2 (35-50)2 _ (24-3Q)2 20 (21)2 20 - -| 50 (-15)2 50 30 i 1- - 30 = 22.05 + 4.5 + 1.2 = 27.75 with df = 2 CONCLUSION The P-value associated with %2= 27.72 and df = 2 is approximately 0.000. Since 0.000 < 0.05, there is strong evidence to reject the null hypothesis. In other words, based on this sample there is strong evidence that at least one of the proportions is not equal to the proportion stated by the ad company. Just as with other hypothesis tests, it is important to address all parts of the test. Clearly conveying your understanding of the procedure will include » explaining the notation used, a addressing all assumptions that are required for the test, • demonstrating correct mechanics, a and finally writing a conclusion in context. The Analysis of Categorical Data and Goodness-of-Fit Tests * As with other tests, the mechanics can be done using a graphing calculator, but it is strongly recommended that you still show the initial set-up. This lets the AP Reader know you understand how to compute the value of the test statistic. TESTS FOR HOMOGENEITY AND INDEPENDENCE IN A TWO-WAY TABLE (Introduction to Statistics & Data Analysis 3rd ed. pages 660-671/4th ed. pages 711-722) A chi-square test procedure can also be used with bivariate categorical data, which is usually displayed in a two-way frequency table. There are two different types of investigations that arise from this type of data. One type involves inferences about association between two different categorical variables being observed on a single sample. The other type involves comparing two or more populations or treatments when a single categorical variable is observed. The calculation procedure is the same for both types of investigation, but the primary question of interest is different. TEST FOR HOMOGENEITY In a chi-square test for homogeneity, we are interested in whether the proportions falling into each of the possible categories of a categorical variable are the same for all of the treatments or populations studied. In this case, the null hypothesis is that the distribution of the categorical variable is the same for each population or treatment. EXAMPLE A snack manufacturer produces three types of chips in two different locations (Location A and Location B). Sometimes bags of chips are damaged in the packaging process. Each bag in a random sample of 45 bags of chips packaged at Location A and in a random sample of 30 bags of chips packaged at Location B was classified into one of three categories: no damage, minimal damage, and severe damage. The resulting data is summarized in the table below. The manufacturer was interested in determining if there was sufficient evidence to conclude that the proportions falling into each of the three damage categories is not the same for the two locations. No Damage Minimal Damage Severe Damage Row Totals Location A 15 18 12 45 Location B 8 12 10 30 23 30 22 75 Column Totals 267 268 •:• Chapter 11 To answer the question of interest the hypotheses would be H0: There is no difference in the proportions falling into each damage category for the two locations Ha: The proportions falling into each damage category are not the same for the two locations The assumptions that need to be are (1) the sample must be a random sample and (2) all expected counts must be at least 5. The expected counts for a chi-square test of homogeneity are calculated using the following formula: (row totalKcolumn total) table total For example, the expected count for the cell of no damage and Location A is (53)(23) = 16.04 76 A way to record these expected counts is either in a separate twoway table of just the expected counts or in parenthesis beside each observed count as shown in the table below. All of the expected counts are greater than or equal to 5. No Damage Minimal Damage Severe Damage Row Totals Location A 19 (16.04) 20 (18.13) 14 (18.83) 53 Location B 4 (6.96) 6 (7.87) 13 (8.17) 23 23 26 27 76 Column Totals Degrees of freedom for the chi-square test of homogeneity is computed as follows: where r = number of rows and c = number of columns Note that the total column and row are not counted in computing df. The chi-square test statistic that was used in the goodness-of-fit test is also used here. %z test for homogeneity 2 _(19-16.04) 2 (20-18.13)2 (14-18.83)2 (4-6.96)2 { (6-7.S7)2 : (13-8.17)2 X ~ 16.04 + 18.13 + 18.83 + 6.96 + 7.87 8.17 = 6.5 withdf = (2-l)(3-l) = 2 The Analysis of Categorical Data and Goodness-of-Fit Tests •* 269 Using the graphing calculator technology, you first enter a matrix of the observed counts and then run a chi-square test from the test menu. The calculator will calculate the expected counts and store them in matrix[B]. MflTRIXIR] 2 x3 [ 19 [ H EO fi 1H 13 ] ] *2-Test Observed : [fi] Expected: [B] Calculate Draw X 2 -Test X*=6. 534131993 p=. 0381181018 df=2 Finally, since P < a (0.038 < 0.05), we reject the null hypothesis. There is strong evidence that the proportions falling into each of the three damage categories is not the same for both locations. TEST FOR INDEPENDENCE The chi-square test for independence, also known as the chi-square test for association, is used to investigate if there is an association between two categorical variables. The calculations will proceed in the same manner as the test for homogeneity; however, we are actually looking to see if knowing the value of one variable provides information about the value of the other variable. EXAMPLE A car manufacturer has two production lines building three types of cars. An engineer is wondering if there is an association between the type of car and the production line that made the vehicle for cars that are found to have major defects. Each car in a random sample of 75 cars selected from all cars found to have major defects was classified according to the type of car and the production line that produced the car. The resulting data is given in the table below. Sedan Wagon Truck Line A 13 9 12 Line B 18 12 11 The question of interest is whether there is an association between car type and production line for cars with major defects. To answer this question, we use the chi-square test for independence. HYPOTHESIS H0 : Production line and car type are independent. Ha : Production line and car type are not independent. a = 0.05 ASSUMPTIONS The sample was a random sample of cars with major defects. 270 * Chapter 11 The expected counts are all greater than or equal to 5, so we can proceed (see table below). Expected Counts for Auto Errors Sedan Wagon Truck Line A 14.1 9.5 10.4 LineB 16.9 11.5 12.6 TEST STATISTIC Since this is a %2 test for independence, the calculations would be 2 ^(13-14.1)2 | (9-9.S)2 | (12-10.4)2 14.1 9.5 10.4 [ (18-16.9)2 | (12-11.5)2 16.9 11.5 < (11-12.6)2 12.6 df = 2 P = 0.73 CONCLUSION Since 0.73 is not smaller than 0.05, we fail to reject the nuU hypothesis. Therefore, we do not have sufficient evidence of an association between car type and production line. Notice, the steps in the chi-square hypothesis tests are the same as for all other tests: State hypotheses, identify the test by name or formula and check assumptions required for the test, calculate the value of the test statistic and the associated P-value, and write a conclusion in context that is linked to the P-value. INTERPRETING AND COMMUNICATING THE RESULTS OF STATISTICAL ANALYSES (Introduction to Statistics & Data Analysis 3rd ed. pages 677-680/4th ed. pages 727-730) This particular set of tests involved writing our conclusions in terms that match the setting of the individual test. Each chi-square test is performed in a different setting depending on the data we have to work with. It is important to state which of the three chi-square tests you are using when carrying out one of these tests. The goodness-offit, homogeneity, and independence tests all have different hypotheses. Be clear which test you are using. CATEGORICAL DATA AND GQODNESS-OF-FIT TESTS: STUDENT OBJECTIVES FOR THE AP EXAM • You will be able to determine degrees of freedom for chi-square tests of goodness-of-fit, homogeneity, and independence. ^ You will be able to calculate expected counts for chi-square tests of goodness-of-fit, homogeneity, and independence. • You will be able to carry out chi-square tests of goodness-of-fit, homogeneity, and independence. The Analysis of Categorical Data and Goodness-of-Fit Tests * « You will identify the correct hypothesis for each of the three chisquare scenarios. « You will be able to interpret conclusions in contest for chi-square tests of goodness-of-fit, homogeneity, and independence. MULTIPLE-CHOICE QUESTIONS 1. Each car in a random sample of 200 cars sold in 2010 at a large car dealership was classified by color. Sixty of the cars were white, 80 were blue, 20 were silver and 10 were red. What is the value of the chi-square statistic in a test of H0 : pw = pb = ps = pr = 0.25 ? (A) 18 (B) 32 (C) 50 (D)70 (E) 110 2. Which of the following is not true of the %2 probability distribution? (A) For small degrees of freedom, the distribution is right skewed. (B) As the degrees of freedom increase, the %2 distribution approaches a normal distribution. (C) All of the area under a chi-square curve is associated with positive values. (D) The total area under the^2 curve is equal to one. (E) The mean of a chi-square distribution is 0. 3. There are four surgical methods currently being used to place medical implants in patients. After surgery, patients are monitored and their pain level is recorded as severe, moderate, or mild. What are the degrees of freedom for a test to determine if there is an association between surgical method and pain level? (A) 2 (B) 3 (C)4 (D)6 (E) 12 271 272 * Chapter 11 Questions 4-7 refer to the following set of data: A group of AP Statistics students wanted to see if plain, peanut and almond M&Ms have the same color distribution. To test this, students took a random sample of each type of M&M and classified the candies in the sample by color. They plan to carry out a chi-square test to decide if there is evidence that the color distributions are not the same for the three types of M&Ms. Red Blue Yellow 20 18 15 Peanut 8 6 Almond 7 11 Plain Orange Brown 10 14 12 8 25 5 7 10 12 10 9 Green 4. What is the expected count for green peanut M&Ms? 25 (A) 47 25x47 (B) 47 59x47 (C) 59 59x47 (D) 207 25x47 IE) 207 5. What are the correct degrees of freedom for the appropriate test? (A) 206 (B) 18 (C) 10 CD) 7 (E) 4 6. What is an appropriate set of hypotheses? (A) H0: There is no association between the type of M&M and color. Ha: There is an association between the type of M&M and color. (B) H0: There is an association between the type of M&M and color. Ha: There is no association between the type of M&M and color. (C) H0: The color proportions are the same for all 3 types of M&Ms. Ha: The color proportions are not the same for all 3 types of M&Ms. (D) H0: Tne proportions of red M&Ms are the same for all 3 types of M&Ms. Ha: The proportions of red M&Ms are not the same for all 3 types of M&Ms. (E) Ho: There is no difference in the type of M&M for green M&Ms. Ha: There is a difference in the type of M&M for green M&Ms. The Analysis of Categorical Data and Goodness-of-Fit Tests *• 273 7. The value of the chi-square test statistic is 22.9. What conclusion would be reached in a test of the hypotheses of interest using a significance level of 0.05? (A) Fail to reject H0 and conclude there is strong evidence of an association. (B) Fail to reject H0 and conclude there is strong evidence that the color distributions are the same. (C) Fail to reject H0 and conclude there is not strong evidence that the color distributions are the same. (D) Reject H0 and conclude there is strong evidence of an association. (E) Reject H0 and conclude there is strong evidence that the color distributions are not the same. 8. A cupcake store manager believes that half of the cupcakes sold are chocolate and that the other half is equally divided between vanilla and cherry. If the manager's belief is correct and 300 cupcakes are sold, how many would you expect to be cherry cupcakes? (A) 300 (B) 200 (C) 150 (D) 100 (E) 75 9. Each person in a random sample of patrons of a local mall was surveyed regarding a public smoking area outside one of the mall entrances. Each person was asked if they approved of the idea of a public smoking area in the mall. The resulting data is summarized in the table below. The mall management would like to know if there is a relationship between gender and approval of the smoking area. What would be an appropriate set of hypotheses? Public Smokers Approve Do Not Approve Males 28 57 Females 39 31 (A) H0: Gender and approval are not independent. Ha: Gender and approval are independent. (B) H0: Gender and approval are independent. Ha: Gender and approval are not independent. (C) H0: Knowing a person does not approve of a public smoking area indicates their gender. Ha: Knowing a person does not approve of a public smoking area does not indicate their gender. (D) H0: There is no difference in gender distribution based on approval. Ha: There is a difference in gender distribution based on approval. (E) H0: There is an association between gender and approval. Ha: There is no association between gender and approval. ' 274 * Chapter 11 10. A local bagel shop makes six types of bagels and eight types of cream cheese toppings. Suppose that all customers order one type of bagel and one type of cream cheese. If customers are classified according to the type of bagel and type of cream cheese, how many degrees of freedom would be appropriate for a test to determine if there is an association between type of bagel and type of cream cheese purchased? (A) 7 (B) 12 (C) 14 (D) 35 (E) 48 11. In a recent opinion poll, likely voters were asked if they would continue to support health care reform if the cost per taxpaying citizen was increased by $800 per year. Political preference was also recorded. The P-value for a chi-square test of independence is 0.001. Which of the following is a correct interpretation of this result? Democrat Republican Independent Libertarian Support 125 87 99 48 Don't Support 75 113 101 52 (A) Since P = 0.001, there is strong evidence that political preference and support are independent of each other. In other words, knowing their political preference gives no insight into their support of the increase. (B) Since P = 0.001, there is strong evidence that political preference and support are not independent of each other. In other words, knowing their political preference gives insight into their support of the increase. (C) Since P = 0.001, there is strong evidence that political preference and support are independent of each other. Since they are independent, there is no association between the two variables. (D) Since P = 0.001, there is insufficient evidence that political preference and support are independent of each other. Since they are not independent, there is no association between the two variables. (E) Since P = 0.001, there is insufficient evidence that political preference and support are not independent of each other. In this case, knowing their political preference would not help in knowing their support. The Analysis of Categorical Data and Goodness-of-Fit Tests 12. Two different cereal companies each make a bran cereal, a corn cereal, and a rice cereal. Each person in a random sample of 300 potential customers tasted all six cereals and selected their favorite. The choice was then classified according to type of cereal and the company that made the cereal. The data is shown in the table below. What type of test would be appropriate? Cereal 1 Cereal 2 Cereal 3 Company A 51 48 47 Company B 43 65 46 (A) Chi-sguare goodness-of-fit test (B) Chi-sguare test of independence (C) Chi-sguare test of homogeneity (D) Either a chi-sguare test for independence or homogeneity is appropriate (E) A chi-square test is not appropriate in this situation 13. Which is not true for a chi-sguare test of independence? I. The test is based on bivariate categorical data. II. df is calculated using k - 1, where k = the number of categories. III. The test is an upper-tailed test. (A) I, II, and III (B) I and II (C) II and HI (D) I only (E) II only Questions 14-15 refer to the following set of data: A movie theater recorded the type of snack item purchased and the type of movie the customer was attending for each person in a random sample of theater customers who made a snack bar purchase. The resulting data is given in the table below. Soda Popcorn Candy Children's Movie 70 83 47 Action Movie 61 49 20 14. If the manager wanted to determine if there is an association between the type of movie and type of snack sold, which type test would be appropriate? (A) Chi-sguare goodness-of-fit (B) Chi-sguare test for homogeneity (C) Chi-sguare test for independence (D) Two-sample proportions test (E) Multiple sample proportions test 275 276 * Chapter 11 15. Using the data in the table above, the value of the chi-square statistic is 5.66. Which of the following is a correct statement about the P-value for this test? (A) P-value > 0.10 (B) 0.05 < P-value < 0.10 (C) 0.01 < P-value < 0.05 (D) 0.001 < P-value < 0.01 (E) P-value < 0.001 FREE-RESPONSE PROBLEMS 1. Regina is worried that the color of her new cardigan will attract the attention of killer bees in southern California where she is going to hike. To settle her nerves she looks at the American Killer Bee Association website. It shows that these bees are highly agitated by various colors. They have found that 75% of bees are agitated by green, 9% by blue, 6% by purple or pink, and the remaining 10% by other colors. (a) In a random sample of 200 killer bees, how many would you expect to be agitated by each color? (b) A recent study of 120 randomly selected people stung by killer bees last year found the individuals were wearing the colors shown in the table below. Do these data provide convincing evidence that the color distribution of colors worn by people stung by killer bees is different from the percentages given on the website? Color Worn By Individual Stung By Killer Bees Green Purple/Pink Other Blue 86 21 6 7 2. A restaurant offers both dine-in and take-out service. Customers can pay for their meal in cash, by credit card or by debit card. The restaurant owner wonders if there is an association between the method of payment and the type of service. To investigate, a random sample is selected from the orders placed during the last year and the method of payment and the type of service is recorded for each of these orders. The data is summarized in the table below. Cash Credit Debit Dine-in 34 122 32 Take-out 70 95 47 (a) Should the restaurant owner carry out a test of homogeneity or a test of independence to answer his question? (b) Carry out a test to answer the question of interest to the owner. Use a significance level of 0.05 for your test. The Analysis of Categorical Data and Goodness-of-Fit Tests »> 277 Answers MULTIPLE-CHOICE QUESTIONS 1. D. The calculation would be found as follows: (Introduction to Statistics & Data Analysis 3rd ed. pages 647656/4th ed. pages 700-708). 2. E. The x2 distribution is right skewed with all of its area associated with positive values. The mean of a chi-square distribution can't be 0 (Introduction to Statistics & Data Analysis 3rd ed. pages 647-656/4th ed. pages 700-708). 3. D. There are four surgical methods and three pain levels. d f = ( r - l ) ( c - l ) = ( 4 - l ) ( 3 - l ) = 6 (Introduction to Statistics & Data Analysis 3rd ed. pages 660-671/4th ed. pages 711-722). 4. D. The expected counts in a two-way table are found using (row total x column total) ,r . „. - (Introduction to Statistics & Data table total Analysis 3rd ed. pages 660-671/4th ed. pages 711-722). 5. C. In a two-way table, degrees of freedom are calculated by (r - l)(c - 1) = (3 - 1)(6 - 1) = 10 (Introduction to Statistics & Data Analysis 3rd ed. pages 667-671/4th ed. pages 711-722). 6. C. This is a test of homogeneity. Three populations are being compared on the basis of a categorical variable (color) (Introduction to Statistics & Data Analysis 3rd ed. pages 660671/4th ed. pages 711-722). 7. E. x2 = 22.9 and df = 10, gives a P-value = 0.01 which is less than 0.05. This means there is strong evidence to conclude that the color distributions are not all the same (Introduction to Statistics & Data Analysis 3rd ed. pages 660-671/4th ed. pages 711-722). 8. E. The expected count is calculated is 300(0.25) = 75 (Introduction to Statistics & Data Analysis 3rd ed. pages 647-656/4th ed. pages 700-708). 9. B. A test of independence between the variables would be used to answer the question of interest (Introduction to Statistics & Data Analysis 3rd ed. pages 660-671/4th ed. pages 711-722). 10. D. (6 - 1)(8 - 1) = 5(7) = 35 (Introduction to Statistics & Data Analysis 3rd ed. pages 660-671/4th ed. pages 711-722). 278 * Chapter 11 11. B. The P-value is smaller than the significance level, so the null hypothesis of independence would be rejected. This means there is an association and knowing people's political party provides information about their support of this issue (Introduction to Statistics & Data Analysis 3rd ed. pages 660-671/4th ed. pages 711722). 12. B. Because a single sample is classified on the basis of two categorical variables, the appropriate test is a test for independence (Introduction to Statistics & Data Analysis 3rd ed. pages 660-671/4th ed. pages 711-722). 13. E. The degrees of freedom for a goodness-of-fit test is calculated using k-1. However, for a chi-square test of independence, degrees of freedom is calculated using (r - l)(c - 1) (Introduction to Statistics & Data Analysis 3rd ed. pages 660-671/4th ed. pages 711722). 14. C. This is a test of independence since we are trying to see if knowing the type of movie provides any information about type of snack purchased (Introduction to Statistics & Data Analysis 3rd ed. pages 660-671/4th ed. pages 711-722). 15. B. With df = 2, the P-value is 0.059 (Introduction to Statistics & Data Analysis 3rd ed. pages 660-671/4th ed. pages 711-722). FREE-RESPONSE PROBLEMS 1. (a) The expected counts by color for 200 bees would be: Green: 200(0.75) = 150, Blue: 200(0.09) = 18, Purple/Pink: 200(0.06) = 12, Other: 200(0.10) = 20. (b) Hypotheses H0 : The proportions of bee stings by color is as specified on the web site (pgreen = 0.75, pwue = 0.09, ppuvle/pmk = 0.06, pother = 0.10) Ha : At least one of the color proportions is different from what is specified inthe null hypothesis a = 0.05 Test: Chi-square goodness-of-fit test Assumptions 1. The problem states that the individuals in the sample were randomly selected. 2. The expected counts are shown in the table below (note that the sample size is 150, not 200 as in part (a). All expected counts are greater than 5, so the sample size is large enough. The Analysis of Categorical Data and Goodness-of-Fit Te5t5 * 279 Expected Counts for 15O Stings Green Blue Purple/Pink Other (90.0) (10.8) (7.2) (12.0) Test Statistic X2 = 12.09, df = 3, P = 0.007 Conclusion Since 0.007 < 0.05, there is strong evidence that to reject the null hypothesis. Therefore, there is reason to say the proportions of bee stings by killer bees are not all the same as reported on the website (Introduction to Statistics & Data Analysis 3rd ed. pages 647656/4th ed. pages 700-708). 2. (a) This is a test of independence. There was only one sample and each individual in the sample was classified according to two categorical variables, (b) Hypotheses H0 : method of payment and order type are independent. Ha : method of payment and order type are not independent. a = 0.05 Assumptions (1) The sample was a random sample of orders. (2) The expected counts are given in the table below. All expected counts are greater than 5, so the sample size is large enough. Expected Counts Cash Credit Debit Pine-in $48.88 $101.99 $37.13 Take-out $55.12 $115.01 $41.87 Test Statistic (34-48.88) (122-101.99)2 + ... = 17.291, withdf = 2 101.99 48.88 Conclusion With a test statistic so extreme, our P-value = 0.000. This is smaller than any reasonable significance level, so we reject the null hypothesis. There is strong evidence that there is an association between method of payment and order type. 2 X = (Introduction to Statistics & Data Analysis 3rd ed. pages 660671/4th ed. pages 711-722).