Sec. 5.1 ~ Designing Samples Our goal in choosing a sample is to obtain a picture of the population that is disturbed as little as possible by the act of gathering information. Sample surveys are one kind of observational study. In other settings, we gather data from an experiment. Observation vs. Experiment An observational study observes individuals and measures variables of interest but does not attempt to influence the responses. An experiment, on the other hand, deliberately imposes some treatment on individuals in order to observe their responses. Both have important roles depending on the situation and the questions to be answered. See example 5.1 on p.270 Observational studies of the effect of one variable on another often fail because the explanatory variable is confounded with lurking variables. In some situations, it may not be possible to observe individuals directly or to perform an experiment. In other cases, it may be logistically difficult or simply inconvenient. Population and Sample The entire group of individuals that we want information about is called the population. A sample is a part of the population that we actually examine in order to gather information. Sampling involves studying __________________in order to ____________________________________ A census attempts to contact every individual in the entire population. The design of a sample refers to the method used to choose the sample from the population. Poor sample design can produce misleading conclusions. Voluntary Response Sample A voluntary response sample consists of people who choose themselves by responding to a general appeal. Voluntary response samples are ________________because people with____________________, especially________________________, are most likely to respond. See example 5.2 on p.272 Another type of bad sampling is convenience sampling, which chooses the individuals easiest to reach. See example 5.3 on p.272 Bias The design of a study is biased if it systematically favors certain outcomes. Choosing a sample by ______________________ attacks bias by ______________________________________ to be chosen. Simple Random Sampling A simple random sample (SRS) of size n consists of __________________________________ chosen in such a way that every set of n individuals ___________________________________________ An SRS not only gives each ____________________ an equal chance to be chosen, (thus avoiding bias in the choice) but also gives ______________________________an equal chance to be chosen. Not all random samples are simple random samples: If I have 10 names in a hat, the probability of any one name being drawn is 1 out of 10. After the first name is drawn, there are nine names left in the hat changing the probability of anyone being selected as the second name to be 1 out of 9. Since different names have different probabilities depending on the sequence in which the drawing is done, the resulting sample will not be a simple random sample. Random Digits A table of random digits is a long string of the digits 0,1,2,3,4,5,6,7,8,9 with the following two properties: 1. Each entry in the table is equally likely to be any of the 10 digits 0 through 9. 2. The entries ___________________________________. That is, knowledge of one part of the table gives no information about any other part. *Table B (in the back of your textbook) is a Random Digit Table Choose an SRS in two steps: 1. Table: Use Table B to select labels at random. 2. Label: Assign a numerical label to every individual in the population. See example 5.4 on p.276 A probability sample is a sample _________________________ We must know ___________________________and what ___________________________each possible sample has. Systematic sampling is a type of probability sampling in which the sampling starts by selecting an element from the list at random and then every kth element in the frame is selected to be part of the sample. The use of chance to select the sample is the essential principal of statistical sampling. Stratified Random Sample To select a stratified random sample, first ______________________________________________________ Then choose a separate ____________________________and combine these ____________________________ This method is usually used for sampling from large populations spread out over a wide area. Cluster Sampling With cluster sampling, the researcher _______________________________________________________ Then, an __________________________ is selected from the population and ____________________________ o For example, if a researcher is studying the attitudes of Catholic Church members surrounding the recent exposure of sex scandals in the Catholic Church, he or she might first sample a list of Catholic churches across the country. Let’s say that the researcher selected 50 Catholic Churches across the United States. He or she would then survey all church members from those 50 churches. Multistage Sampling A typical example of multistage sampling is Current Population Survey Sampling Design, which is conducted as follows: Stage 1: Divide the United States into 2007 geographical areas called Primary Sampling Units. Stage 2: Divide each PSU selected into smaller areas called “neighborhoods” using ethnic and other information and take a stratified sample of neighborhood. Stage 3: Sort the housing units in each neighborhood into clusters of four nearby units. Interview the households in a random sample of these clusters. This method saves time and money. Cautions about Sample Surverys We need a complete and accurate list of the population. Undercoverage occurs when some groups in the population are left out of the process of choosing the sample. See example 5.6 on p.281 Nonresponse occurs when an individual chosen for the sample can’t be contacted or does not cooperate. Response bias is_______________________________ , especially if asked about illegal or unpopular behaviors. An interviewer whose attitude suggests that some answers are more desirable than others will get these answers more often. The wording of questions is the most important influence on the answers given to a sample survey. See example 5.7 on p.282 Inference about the Population Using chance to choose a sample eliminates ______________ in the actual selection of the sample. Because we deliberately use chance, the results obey the laws of probability that govern chance behavior. Larger random samples give more accurate results than smaller samples.