Chapter 3 What Do Samples Tell Us? Chapter 3 1 Thought Question 1 During a medical exam, the doctor measures your cholesterol two times. Do you think both measurements would be exactly the same? Why or why not? Chapter 3 2 Thought Question 2 To estimate the percentage of all adults who have an Internet connection in their homes, a properly chosen sample of 1100 adults across the U.S. was sampled, and 60% said “yes”. How close do you think that is to the percentage of the entire country who have an Internet connection? Within 30%? 10%? 5%? 1%? Exactly the same? Chapter 3 3 Sampling Terminology • Parameter – fixed, unknown number that describes the population – A parameter is a fixed number, but in practice we don’t know its value • Statistic – known value calculated from a sample – It can change from sample to sample – a statistic is used to estimate a parameter • Bias – in repeated samples, the sample statistic consistently misses the population parameter in the same direction • Variability – different samples from the same population may yield different values of the Chapter sample statistic 3 4 Figure 3.3 Bias and variability in shooting arrows at a target. Bias means the archer systematically misses in the same direction. Variability means that the arrows are scattered. Sampling Strategy • A good sampling method has both small bias and small variability. • To reduce bias, use random sampling. • To reduce variability, use larger samples – estimates from random samples will be closer to the true values in the population if the samples are larger – how close will they be? • margin of error Chapter 3 6 Proportions • The proportion of a population that has some outcome (“success”) is p. • The proportion of successes in a sample is measured by the sample proportion: number of successes in the sample p̂ = total number of observations in the sample “p-hat” Chapter 30 7 Example: Same-sex marriages • In 2004 President Bush made the declaration: “Today I call upon the Congress to promptly pass, and send to the states for ratification, an amendment to our Constitution defining and protecting marriage as a union of a man and woman as husband and wife.” • A Gallup Poll conducted from July 2003 to February 2004 asked the following question: • “Would you favor or oppose a constitutional amendment that would define marriage as being between a man and a woman, thus barring marriages between gay and lesbian couples?” Chapter 3 8 Example: Same-sex marriages • Gallup Poll found out that such an amendment is supported by a very slim majority of Americans, 51%, with 45% opposed. • Gallup talked with 2527 randomly selected adults to reach these conclusions. • How can 2527 people tell us about the opinions of the 307 million people? • A sample won’t tell us the truth about the population, so Gallup gives a margin of error. Here is Gallup’s claim: “For results based on a sample of this size, one can say with 95% confidence that the error attributable to sampling and other random effects could be plus or minus 2 percentage points.” Chapter 3 9 Margin of Error • The amount by which the proportion obtained from the sample ( p̂ ) will differ from the true population proportion (p) rarely exceeds the margin of error. Chapter 3 10 What Margin of Error means “Margin of Error plus or minus two percentage points is shorthand for the statement: If we took many samples using the same method we used to get this one sample, 95% of the samples would give a result within plus or minus 2 percentage points of the truth about the population. Chapter 3 11 Chapter 3 12 Margin of Error Chapter 3 13 Margin of Error and Sample Size Chapter 3 14 Figure 3.1 The results of many SRSs have a regular pattern. Here, we draw 1000 SRSs of size 100 from the same population. The population proportion is p = 0.5. The sample proportions vary from sample to sample, but their values center at the truth about the population. Figure 3.2 Draw 1000 SRSs of size 2527 from the same population as in Figure 3.1. The 1000 values of the sample proportion are much less spread out than was the case for smaller samples. Confidence statements A confidence statement has two parts: a margin of error and a level of confidence. The margin of error says how close the sample statistic lies to the population parameter. The level of confidence says what percentage of all possible samples satisfy the margin of error. Chapter 3 17 Population size doesn’t matter The variability of a statistic from a random sample does not depend on the size of the population as long as the population is at least 100 times larger than the sample. Chapter 3 18 Case Study Guaranteed Health Insurance in the U.S.? New York Times/CBS News Poll, January 2006 62% say it should be guaranteed by the government. same as in 2000, up 6 points from 1996 31% say it is not the responsibility of the government. Chapter 3 19 Case Study How the Poll was Conducted This New York Times/CBS News poll was based on telephone interviews conducted January 20 through January 25, 2006 with 1,229 adults throughout the United States. The survey has a random sampling error of approximately ±3 percent. Chapter 3 20 Case Study Conclusion (Confidence statement) For the proportion of the population who favor guaranteed health insurance, the sample proportion was = .62 (62%) and the margin of p̂ error was ±.03 (3%). We can then say that: “we are 95% confident that the proportion of the population who favor guaranteed health insurance was between .59 and .65 (59% and 65%).” Chapter 3 21 Key Concepts • • • • Parameter versus Statistic Bias and Variability Margin of Error Confidence Statements Chapter 3 22 Exercise 3.13 Which data have high/low bias, high/low variability? Chapter 3 23 Exercises 3.3-3.6 • Determine whether the highlighted number is a parameter or a statistic. • The Bureau of Labor Statistics announces that last month it interviewed all members of the labor force in a sample of 55,000 households; 4.7% of the people were unemployed. • A carload lot of ball bearings has an average diameter of 2.503 centimeters (cm). This is within the specifications for acceptance of the lot by the purchaser. The inspector happens to inspect 100 bearings from the lot with an average diameter of 2.515 cm. This is outside the specified limits, so the lot is mistakenly rejected. • Voter registration records show that 15.5% of all voters in Philadelphia are registered as Republicans. However, a radio talk show host in Philadelphia found that of 20 local residents who called the show recently, 60% were registered Republicans. • A national polling organization uses a random digit dialing device to dial residential phone numbers in the U.S. Of the first 100 numbers dialed, 43 are unlisted numbers. This is not surprising, because 40% of all residential phones in the U.S. are unlisted. Chapter 3 24 Exercise 3.9 • Canada’s national health care. The Ministry of Health in the Canadian province of Ontario wants to know whether the national health care system is achieving its goals in the province. Much information about health care comes from patient records, but that resource doesn’t allow us to compare people who use health services with those who don’t. So the ministry of Health conducted the Ontario Health Survey, which interviewed a random sample of 61,239 people who live in the province of Ontario. a) What is the population for this sample survey? What is the sample? b) The survey found that 76% of males and 86% of females in the sample had visited a general practitioner at least once in the past year. Do you think these estimates are close to the truth about the entire population? Why? Chapter 3 25 Exercise 3.11 Sampling variability. In thinking about Gallup’s sample of size 2527, we asked “Could it happen that one random sample finds that 51% of adults favor an amendment and a second random sample finds that only 37% favor one? ”. Look at the following figure which shows the results of 1000 samples of this size when the population truth is p=0.5 or 50%. Would you be surprised if a sample from this population gave 51%? Would you be surprised if a sample gave 37%? Chapter 3 26 Exercise 3.14 Is a larger sample size always better? In February 2004, USA Today conducted an online poll. Visitors to their website were asked the following question: “Should the U.S. pass a constitutional amendment banning gay marriage?” Visitors could vote by clicking a button. The results as of 3:30 PM on February 25 were that 68.61% voted “No” and 31.39% voted “Yes” A total of 63,046 votes had been recorded. Using our quick method, we find that the margin of error for 95% confidence for a sample of this size is roughly equal to four-tenths of one percentage points. Is it correct to say that, based on this USA Today online poll, we are 95% confident that 68.61 plus or minus 0.4% of American adults are opposed to having the U.S. pass a constitutional amendment banning gay marriage? Chapter 3 27 Exercise 3.22 The death penalty. The Gallup Poll asked a sample of 1010 adults, “Are you in favor of the death penalty for a person convicted of murder?” The proportion who said in favor was 69%. a) How many of the 1010 people interviewed said they were in favor of the death penalty for person convicted of murder? b) Gallup says that the margin of error for this poll is plus or minus 3 percentage points. Explain to someone who knows no statistics what “margin of error plus or minus 3 percentage points” means. Chapter 3 28 Exercise 3.24, and 3.28 Find the margin of error. Exercise 3.22 concerns a Gallup Poll sample of 1010 people. a)Use the quick method to estimate the margin of error for statements about the population of all adults. b)Is your result close to the 3% margin of error announced by Harris? c)Suppose you want a margin of error half as large as the one you found in (a). How many people must you interview? Chapter 3 29 Exercise 3.29 Satisfying the Congress. Exercise 3.12 describes a sample survey of 1004 adults, with margin of error plus or minus 4 percentage points for 95% confidence. a) A member of the Congress thinks that 95% confidence is not enough. He wants to be 99% confident. How would the margin of error for 99% confidence based on the sample compare with the margin of error for 95% confidence. b)Another member of the Congress is satisfied with 95% confidence, but she wants a smaller margin of error than plus or minus 4 percentage points. How can we get a smaller margin of error, still with 95% confidence? Chapter 3 30