AP Statistics Frappy Packet for End-Of-Year Graded Homework Assignments Page |1 This Frappy Packet Belongs to _______________________________________. 1997-4- Unfortunate Remarks (2Prop) A random sample of 415 potential voters was interviewed 3 weeks before the start of a state-wide campaign for governor; 223 of the 415 said they favored the new candidate over the incumbent. However, the new candidate made several unfortunate remarks one week before the election. Subsequently, a new random sample of 630 potential voters showed that 317 voters favored the new candidate. Do these data support the conclusion that there was a decrease in voter support for the new candidate after the unfortunate remarks were made? Give appropriate statistical evidence to support your answer. Also include a confidence interval and explain how it supports your answer. I. Hypothesis P1: True proportion of voters who supported the new candidate before the unfortunate remarks P2: True proportion of voters who supported the new candidate after the unfortunate remarks Ho: P1=P2 The true proportion of voters who supported the new candidate were the same before and after the remarks. Ha: P1>P2 made. II. III. IV. The true proportion of voters supported the new candidate was smaller after the remarks were Conditions I. P1: SRS of independent voters selected 𝑛𝑝 = 223 ≥ 10 𝑛𝑞 = 415 − 223 = 192 ≥ 10 II. P2: SRS of independent voters selected 𝑛𝑝 = 317 ≥ 10 𝑛𝑞 = 630 − 317 = 313 ≥ 10 III. Assume the voters polled before the unfortunate remarks are independent of the voters polled after the unfortunate remarks 2-Prop Z Test z=1.081722368; p-value= 0.1396879905 Conclusion We do not reject Ho because our p=value (.140) is large at any acceptable significance level. We do not have enough evidence to support the claim that the true proportion of voters supported the new candidate was smaller after the remarks were made. V. Confidence Interval: 2-Prop z-interval (−.0277, .09603) We are 95% confident that the true difference in proportion of voters who support the new candidate is between 2.77% less and 9.603% more before the unfortunate remarks were made. This supports our conclusion to not reject Ho since 0 is in our confidence interval. AP Statistics Frappy Packet for End-Of-Year Graded Homework Assignments Page |2 2003B Vitamin C (2Prop) A study was conducted to determine if taking vitamin C reduces the occurrence of the flu. The study was conducted using 808 student volunteers who did not take a flu shot. The subjects were randomly assigned to one of two groups: a treatment group who received 1,000 milligrams of vitamin C daily or a control group who received a placebo flavored to taste like the vitamin C treatment. All participants were monitored to ensure that they adhered to their assigned treatment on a daily basis throughout the period of the study. At the end of the flu season, each subject’s medical record was reviewed by a physician to determine whether he or she had contracted the flu during the period of the study. The physician did not know which treatment each subject received. The results of the study are shown in the table to the right. Based on this study, a health expert claims that there is evidence to suggest that vitamin C reduces the occurrence of the flu in the population of students who would volunteer for such a study. Do these data support this conclusion? Give appropriate statistical evidence to support your answer. Also include a confidence interval and explain how it supports your answer. I. Hypothesis PC: True proportion of student volunteers who took Vitamin C and got the flu. PP: True proportion of student volunteers who did not take Vitamin C (took a placebo) and got the flu. Ho: PC=PP The true proportion of students who got the flu were the same whether they took Vitamin C or not. Ha: PC<PP The true proportion of student who got took Vitamin C and got the flu was smaller than the true proportion of student who did not take Vitamin C and got the flu. II. Conditions a. b. c. III. IV. PC: SRS of independent volunteer students selected 𝑛𝑝 = 302 ≥ 10 (𝑔𝑜𝑡 𝑓𝑙𝑢) 𝑛𝑞 = 101 ≥ 10(𝑑𝑖𝑑 𝑛𝑜𝑡 𝑔𝑒𝑡 𝑓𝑙𝑢) P2: SRS of independent volunteer students selected 𝑛𝑝 = 331 ≥ 10 (𝑔𝑜𝑡 𝑓𝑙𝑢) 𝑛𝑞 = 74 ≥ 10 (𝑑𝑖𝑑 𝑛𝑜𝑡 𝑔𝑒𝑡 𝑓𝑙𝑢) Students could not be in both groups (taking Vitamin C and not taking Vitamin C) 2-Prop Z Test z=-2.342945031; p-value= 0.0095660811 Conclusion We reject Ho because our p=value (.0096) is small at any acceptable significance level. We have enough evidence to support the claim that the true proportion of student who got took Vitamin C and got the flu was smaller than the true proportion of student who did not take Vitamin C and got the flu. V. Confidence Interval: 2-Prop z-interval (−.1245, −.0113) We are 95% confident that the true difference in proportion of students who took Vitamin C and got the flu was between 12.45% and 1.13% less than students who did not take Vitamin C and got the flu. This supports our conclusion to reject Ho since 0 is not in our confidence interval. AP Statistics Frappy Packet for End-Of-Year Graded Homework Assignments Page |3 2005-4 Cereal Boxes (1Prop) Some boxes of a certain brand of breakfast cereal include a voucher for a free video rental inside the box. The company that makes the cereal claims that a voucher can be found in 20 percent of the boxes. However, based on their experiences eating this cereal at home, a group of students believes that the proportion of boxes with vouchers is less than 0.2. This group of students purchased 65 boxes of the cereal to investigate the company’s claim. The students found a total of 11 vouchers for free video rentals in the 65 boxes. Suppose it is reasonable to assume that the 65 boxes purchased by the students are a random sample of all boxes of this cereal. Based on this sample, is there support for the students’ belief that the proportion of boxes with vouchers is less than 0.2? Provide statistical evidence to support your answer. Also include a confidence interval and explain how it supports your answer. I. Hypothesis P= True proportion of cereal boxes containing a free video rental voucher Ho: P=0.2 The true proportion cereal boxes containing a free video rental voucher is 20% Ha: P<0.2 The true proportion cereal boxes containing a free video rental voucher is less than 20% II. Conditions a. SRS (stated) of independent (assumed) boxes b. 𝑛𝑝 = 11 ≥ 10 (𝑖𝑛𝑐𝑙𝑢𝑑𝑒𝑑 𝑣𝑜𝑢𝑐ℎ𝑒𝑟𝑠) 𝑛𝑞 = 65 − 11 = 54 ≥ 10 (𝑑𝑖𝑑 𝑛𝑜𝑡 𝑖𝑛𝑐𝑙𝑢𝑑𝑒 𝑣𝑜𝑢𝑐ℎ𝑒𝑟𝑠) III. 1-Proportion Z Test z=-0.6201736729 p-value=.2675716664 IV. Conclusion We do not reject Ho because our p=value (.268) is large at any acceptable significance level. We do not have enough evidence to support the claim that the true proportion of cereal containing vouchers is less than 20%. V. Confidence Interval: 1-Prop Z-interval (0.07808, 0.26038) I am 95% confident that the true proportion of cereal boxes that contain a free video rental voucher is between 7.808% and 26.038%. This supports our conclusion not to reject Ho since 20% is within this interval. AP Statistics Frappy Packet for End-Of-Year Graded Homework Assignments Page |4 1998-5 Grad Students (1Prop) A large university provides housing for 10 percent of its graduate students to live on campus. The university’s housing office thinks that the percentage of graduate students looking for housing on campus may be more than 10 percent. The housing office decides to survey a random sample of graduate students, and 68 of the 481 respondents say that they are looking for housing on campus. On the basis of the survey data, is there enough evidence for housing office to consider increasing the amount of housing on campus available to graduate students? Give appropriate evidence to support your recommendation. Also include a confidence interval and explain how it supports your answer. I. Hypothesis P= True proportion of graduate students looking for housing on campus Ho: P=0.1 The true proportion of graduate students looking for housing on campus is 10% Ha: P>0.1 The true proportion of graduate students looking for housing on campus is more than 10% II. Conditions a. SRS (stated) of independent (assumed) graduate students b. 𝑛𝑝 = 68 ≥ 10 (𝑙𝑜𝑜𝑘𝑖𝑛𝑔 𝑓𝑜𝑟 ℎ𝑜𝑢𝑠𝑖𝑛𝑔) 𝑛𝑞 = 481 − 68 = 413 ≥ 10 (𝑎𝑟𝑒 𝑛𝑜𝑡 𝑙𝑜𝑜𝑘𝑖𝑛𝑔 𝑓𝑜𝑟 ℎ𝑜𝑢𝑠𝑖𝑛𝑔) III. 1-Proportion Z Test z=3.0245396459 p-value=.0012451283 IV. Conclusion We reject Ho because our p=value (.0012) is small at any acceptable significance level. We have enough evidence to support the claim that the true proportion of graduate students seeking housing is greater than 10% V. Confidence Interval: 1-Prop Z-interval (.11024, .17251) I am 95% confident that the true proportion of graduate students looking for housing on campus is between 11.024% and 17.251%. This supports our conclusion to reject Ho since 10% is not contained in (and is less than) this interval. AP Statistics Frappy Packet for End-Of-Year Graded Homework Assignments Page |5 2001-5 Generics (Paired Difference-Data) A growing number of employers are trying to hold down the costs that they pay for medical insurance for their employees. As part of this effort, many medical insurance companies are now requiring clients to use generic brand medicines when filling prescriptions. An independent consumer advocacy group wanted to determine if there was a difference, in milligrams, in the amount of active ingredient between a certain “name” brand drug and its generic counterpart. Pharmacies may store drugs under different conditions. Therefore, the consumer group randomly selected ten different pharmacies in a large city and filled two prescriptions at each of these pharmacies, one for the “name” brand and the other for the generic brand of the drug. The consumer group’s laboratory then tested a randomly selected pill from each prescription to determine the amount of active ingredient in the pill. The results are given in the following table. Based on these results, what should the consumer group’s laboratory report about the difference in the active ingredient in the two brands of pills? Give appropriate statistical evidence to support your response. Also include a confidence interval and explain how it supports your answer. I. Hypothesis 𝜇𝑑 : True difference in the average amount of active ingredient in the name brand verses generic brand Ho: 𝜇𝑑 = 0 There is no difference in the average amount of active ingredient in the name brand verses generic brand. Ha: 𝜇𝑑 ≠ 0 There is a difference in the average amount of active ingredient in the name brand verses generic brand. II. Conditions SRS of independent Pharmacies and SRS of independent pills for each prescription Nearly Normal: (Name Brand-Generic)* (Generic – Name Brand) III. Paired Difference Test (T-test) t=3.956835797 p-value=.003320146 IV. Conclusion We reject Ho because our p=value (.0033) is small at any acceptable significance level. We have enough evidence to support the claim that there is a difference in the amount of active ingredient in the name brand verses generic brand. V. Confidence Interval- T-Interval (2.8267, 10.373) I am 95% confident that the true difference in means of the active ingredient in generic verses name brand medicine is between 2.8267 and 10.373 milligrams. This supports our conclusion to reject Ho since 0 milligrams is not in our confidence interval. AP Statistics Frappy Packet for End-Of-Year Graded Homework Assignments Page |6 2005B #4 Tomato Plans (Paired Difference-Statistics) A researcher believes that treating seeds with certain additives before planting can enhance the growth of plants. An experiment to investigate this is conducted in a greenhouse. From a large number of Roma tomato seeds, 24 seeds are randomly chosen and 2 are assigned to each of 12 containers. One of the 2 seeds is randomly selected and treated with the additive. The other seeds serves as a control. Both seeds are then planted in the same container. The growth, in centimeters, of each of the 24 plants is measured after 30 days. These data were used to generate the partial computer output shown below. Graphical displays indicate that the assumption of normality is not unreasonable. Is there sufficient evidence to conclude that there is a significant mean difference in growth of the plants from untreated seeds and the plants from treated seeds? Give appropriate statistical evidence to support your response. Also include a confidence interval and explain how it supports your answer. I. Hypothesis 𝜇𝑑 : True difference in the average growth of tomatoes treated with additive and tomatoes not treated Ho: 𝜇𝑑 = 0 There is no difference in the average growth of tomatoes treated with additive and tomatoes not treated. Ha: 𝜇𝑑 ≠ 0 There is a difference in the average growth of tomatoes treated with additive and tomatoes not treated. II. Conditions SRS of independent seeds, seed given additive was randomly chosen. Container for each pair of seeds was randomly chosen. Nearly normal condition stated. III. Paired Difference Test (T-test) t=-6.001861354 p-value=.000089023026 IV. Conclusion We reject Ho because our p=value (.000089) is very small at any acceptable significance level. We have enough evidence to support the claim that there is a difference in the average growth of tomatoes treated with additive and tomatoes not treated. V. Confidence Interval- T-Interval (−2.754, −1.276) I am 95% confident that the true difference in average growth in tomato plants treated with an additive is between 1.276 cm and 2.754 cm more than those not treated with an additive. This supports our conclusion to reject Ho since 0 cm is not in our confidence interval. AP Statistics Frappy Packet for End-Of-Year Graded Homework Assignments Page |7 2006-4 Wait Time (2 Means-Statistics) Patients with heart-attack symptoms arrive at and emergency room either by ambulance or self-transportation provided by themselves, family, or friends. When a patient arrives at the emergency room, the time of arrival is recorded. The time when the patient’s diagnostic treatment begins is also recorded. An administrator of a large hospital wanted to determine whether the mean wait time (time between arrival and diagnostic treatment) for patients with heart-attack symptoms differs according to the mode of transportation. A random sample of 150 patients with heart-attack symptoms who had reported to the emergency room was selected. For each patient, the mode of transportation and wait time were recorded. Summary statistics for each mode of transportation are show in the table below. Based on this set of information, is there evidence to support that the average wait time for people who transport themselves to the hospital is different than the average wait time for patients who arrive via ambulance? Give appropriate statistical evidence to support your response. Also include a confidence interval and explain how it supports your answer. Also include a confidence interval and explain how it supports your answer. I. Hypothesis 𝜇𝐴 = Average wait time for patients transported to the hospital by Ambulance 𝜇𝑆 = Average wait time for patients transported to the hospital by Self 𝐻𝑜 : 𝜇𝐴 = 𝜇𝑆 Average wait time for patients transported to the hospital is the same for either mode of transportation 𝐻𝑜 : 𝜇𝐴 ≠ 𝜇𝑆 Average wait time for patients transported to the hospital is different for either mode of transportation II. Conditions SRS of patients, independent of one another No patient arrived using both methods of transportation 𝑛 = 77 ≥ 30 for patients arriving by Ambulance 𝑛 = 73 ≥ 30 for patients arriving by Self (Do not need to assume the distribution of wait time for both modes of transportation to be approximately normal since sample is large enough) III. 2-Sample T-Test t=-2.905892397 p-value= .0042566391 IV. Conclusion: We reject Ho because our p=value (.0042566391) is small at any acceptable significance level. We have enough evidence to support the claim that the average wait time for patients transported to the hospital is different depending on the mode of transportation. V. Confidence interval: 2-Sample T-Interval (−3.798, −.77224) I am 95% confident that the true difference in average wait time for patients arriving by ambulance between .77224 and 3.798 minutes less than patients arriving by themselves. This support our conclusion since 0 is not in the interval. AP Statistics Frappy Packet for End-Of-Year Graded Homework Assignments Page |8 2007B-5 Cholesterol #1 (2 Means-Statistics) A serum cholesterol level above 250 milligrams per deciliter (mg/dl) of blood is a risk factor for cardiovascular disease in humans. At a medical center in St. Louis, a study to test the effectiveness of a new cholesterol lowering drug was conducted. One hundred people with cholesterol levels between 250 mg/dl and 300 mg/dl were available for this study. Fifty people were assigned at random to each of two treatment groups. One group received the standard cholesterollowering medication and the other group received the new drug. After taking the drug for three weeks, the 50 subjects who received the standard treatment had a mean decrease in cholesterol level of 10 mg/dl with a standard deviation of 8 mg/dl, and the 50 subjects who received the new drug had a mean decrease of 18 mg/dl with a standard deviation of 12 mg/dl. Does the new drug appear to be more effective than the standard treatment in lowering mean cholesterol level? Give appropriate statistical evidence to support your conclusion. Also include a confidence interval and explain how it supports your answer. I. Hypothesis 𝜇𝑆 = Average decrease in cholesterol level using the standard drug 𝜇𝑁 = Average decrease in cholesterol level using the new drug 𝐻𝑜 : 𝜇𝑠 = 𝜇𝑁 The average decrease in cholesterol level is the same using either drug 𝐻𝑎 : 𝜇𝑆 < 𝜇𝑁 The average decrease in cholesterol level is greater for the new drug than the standard drug II. Conditions Patients were distributed to each treatment group randomly. Patients were independent of each other within each group 𝑛 = 50 ≥ 30 for both groups No patient was in both groups (Do not need to assume the distribution of Cholesterol level is approximately normal since sample is large enough) III. 2-Sample T-Test t=-3.922322703 p-value= .000088356653 IV. Conclusion: We reject Ho because our p=value (.000088) is small at any acceptable significance level. We have enough evidence to support the claim that the new drug lowers cholesterol better on average than the standard drug V. Confidence Interval: 2-Sample T-Interval (−12.06, −3.945) I am 95% confident that the true difference in average decrease in cholesterol using the new drug is between 3.945 and 12.06 mg/dl more than the standard drug. This supports our conclusion since 0 is not in our confidence interval. AP Statistics Frappy Packet for End-Of-Year Graded Homework Assignments Page |9 2004-6 Cholesterol #2 (1 Mean-Statistics) A pharmaceutical company has developed a new drug to reduce cholesterol. A regulatory agency will recommend the new drug for use if there is convincing evidence that the mean reduction in cholesterol level after one month of use is more than 20 milligrams/deciliter (mg/dl), because a mean reduction of this magnitude would be greater than the mean reduction for the current most widely used drug. The pharmaceutical company collected data by giving the new drug to a random sample of 50 people from the population of people with high cholesterol. The reduction in cholesterol level after one month of use was recorded for each individual in the sample, resulting in a sample mean reduction and standard deviation of 24 mg/dl and 15 mg/dl, respectively. Does the new drug have enough convincing evidence that the mean reduction in cholesterol level after one month of use is more than 20 mg/dl? Give appropriate statistical evidence to support your conclusion. Also include a confidence interval and explain how it supports your answer. I. Hypothesis 𝐻𝑜 : 𝜇 = 20 The average decrease in cholesterol level is 20 mg/dl. 𝐻𝑎 : 𝜇 > 20 The average decrease in cholesterol level is greater than 20 mg/dl. II. Conditions Patients were chosen randomly for each group Patients can be reasonably assumed as independent of one another 𝑛 = 50 ≥ 30 (Do not need to assume the distribution of cholesterol levels are nearly normal because sample is large enough) III. 1-Sample T-Test t=1.885618083 p-value= .0326419275 IV. Conclusion: We reject Ho because our p=value (.0326) is small at the 5% significance level. We have enough evidence to support the claim that the new drug lowers cholesterol more than 20mg/dl on average than the standard drug. OR: We do not reject Ho because our p=value (.0326) is large at the 1% significance level. We do not have enough evidence to support the claim that the new drug lowers cholesterol more than 20mg/dl on average than the standard drug. V. Confidence Interval: 2-Sample T-Interval (20.443,27.557) OR: (18.898, 29.102) I am 90% confident that the true average decrease in cholesterol using the new drug is between 20.443 mg/dl and 27.557mg/dl. This supports our conclusion since 20 mg/dl is not in our confidence interval. OR: I am 98% confident that the true average decrease in cholesterol using the new drug is between 18.898 mg/dl and 29.103 mg/dl. This supports our conclusion since 20 is in our confidence interval. AP Statistics Frappy Packet for End-Of-Year Graded Homework Assignments P a g e | 10 1999-2 Lost Hikers (Chi-Squared Independence) The Colorado Rocky Mountain Rescue Service wishes to study the behavior of lost hikers. If more were known about the direction in which lost hikers tend to walk, then more effective search strategies could be devised. Two hundred hikers selected at random from those applying for hiking permits are asked whether they would head uphill, downhill, or remain in the same place if they became lost while hiking. Each hikers in the sample was also classified according to whether he or she was an experienced or novice hiker. The resulting data are summarized in the following table. Do these data provide convincing evidence of an association between the level of hiking expertise and the direction the hiker would head if lost? Give appropriate statistical evidence to support your conclusion. I. Hypothesis Ho: Response to getting lost in the woods is independent of experience level (no association) Ha: Response to getting lost in the woods is dependent of experience level (association) II. Conditions Categorical Data SRS of independent Hikers Expected frequency count for each cell is greater than 5 III. Chi-Squared Test of Independence Chi-Squared: 4.452 p-value=.107959242 IV. Conclusion: We do not reject Ho because our p=value (.1079) is large at the 5% significance level. We do not have enough evidence to support the claim that response to getting lost in the woods is dependent of experience level. AP Statistics Frappy Packet for End-Of-Year Graded Homework Assignments P a g e | 11 2003-5 Presidential Affairs (Chi-Squared Independence) A random sample of 200 students was selected from a large college in the United States. Each selected student was asked to give his or her opinion about the following statement. “The most important quality of a person who aspires to be the President of the United States is a knowledge of foreign affairs.” Each response was recorded in one of five categories. The gender of each selected student was noted. The data are summarized in the table below. Is there sufficient evidence to indicate that the response is dependent on gender? Provide statistical evidence to support your conclusion. I. Hypothesis Ho: Response to survey is independent of gender (no association) Ha: Response to survey is dependent of gender (association) II. Conditions Categorical Data SRS of students Expected frequency count for each cell is greater than 5 III. Chi-Squared Test of Independence Chi-Squared: 8.922558923 p-value=.0630645568 IV. Conclusion: We do not reject Ho because our p=value (.063) is large at the 5% significance level. We do not have enough evidence to support the claim that response to the survey is dependent on gender. AP Statistics Frappy Packet for End-Of-Year Graded Homework Assignments P a g e | 12 2008-5 Moose (Chi-Squared Goodness of Fit) A study was conducted to determine where moose are found in a region containing a large burned area. A map of the study area was partitioned into the following four habitat types. (1) Inside the burned area, not near the edge of the burned area, (2) Inside the burned area, near the edge, (3) Outside the burned area, near the edge, and (4) Outside the burned area, not near the edge. The figure to the right shows these four habitat types. The proportion of total acreage in each of the habitat types was determined for the study area. Using an aerial survey, moose locations were observed and classified into one of the four habitat types. The results are given in the table below. The researchers who are conducting the study expect the number of moose observed in a habitat type to be proportional to the amount of acreage of that type of habitat. Are the data consistent with this expectation? Conduct an appropriate statistical test to support your conclusion. Assume the conditions for inference are met. I. Hypothesis Ho: The number of moose observed in each habitat is consistent with the expected number based on acreage. Ha: The number of moose observed in each habitat is not consistent with the expected number based on acreage. II. Conditions Assume they are met (stated) III. Chi-Squared Goodness of Fit Test (To get expected, multiply Proportion of Total Acreage by 117) Chi-Squared: 43.68926643 p-value=.000000017568857 IV. Conclusion: We reject Ho because our p=value (.00000002) is very small at any acceptable significance level. We have enough evidence to support the claim that the number of moose observed in each habitat is not consistent with the expected number based on acreage.