Document 14323624

advertisement
Chapter 5 Review Watch the following video for a quick review of the sampling methods that we discussed this Chapter http://prezi.com/kibqe-­‐toeunr/13-­‐sampling-­‐techniques/ Also, review the random rectangles activity from this unit. Collecting Data
In order to better understand the characteristics of a population, statisticians and researchers often use a sample from that population and make inferences based on the summery results from the sample. Polling is an example of sampling from the population in order to get a better idea of the characteristics of a population. Because we make inferences about a population from the sample, it is very important that the sample is collected appropriately and that it is representative of the population being studied. The following is a list of possible sample designs and some of the advantages and disadvantages of each: 1) Convenience sampling – Uses subjects that are readily available. This type of sampling will be BIASED (judgmental sample from Random Rectangle activity) Advantage: Easy and less costly to collect Disadvantage: Not representative of the population Example: In order to get an idea of how students think of the new school policy, the principal stands outside the library and asks a few students their opinions. 2) Simple Random Sample (SRS) – consists of n individuals from the population chosen in such a way that every set of n individuals has an equal chance of being the sample actually selected. This is often the best and most appropriate way to collect data for a sample. Advantages – Easy to accomplish using a table of random digits; likely to produce samples that are good representatives of the population. Disadvantage – None (could be cost prohibited) Example: In order to determine how happy students are with their education at DHS, the principal assigns each student a number from 1 to 850 (the number of students at the school) and then uses a random number generator to choose 50 numbers between 1 and 850. He then surveys all the students with the chosen numbers. 3) Stratified random sampling – Divide the population into groups of similar individuals (strata) then select an SRS within each strata. Combine the SRSs from each strata to form your full sample. Advantage: Can produce more exact information (especially in large populations) by taking advantage of the fact that individuals in the same strata are similar to one another. Disadvantage: Not appropriate unless strata are easily defined. Example: In order to get a better idea of what DHS athletes thought about homecoming last year, the director divides all DHS athletes into the teams they play for, and then selects a random sample from each sports team. His full sample consists of aggregating the random samples form each team. Chapter 5 Review 4) Cluster Sampling– Divide the population into sections (clusters) then randomly choose a few of those clusters, and select every member of the clusters chosen. Advantage – Don’t need a list of entire population Disadvantage – More variability between samples depending on how clusters are determined. Example – A psychologist at the University of Pennsylvania collects a sample by first dividing up the students into their respective schools (Wharton, engineering, nursing, arts and sciences) then by the departments that their major is in, and then she selects a few departments at random and surveys every student within those chosen departments. 5) Systematic sampling – randomly select an arbitrary starting point, and then select every kth member of the population Advantage: Every member has an equal probability of being selected Disadvantage: Not every sample of size n has an equal chance of being selected Example: HP Selects every 200th computer off the assembly line and inspects it for quality control. 6) Multi-­‐Stage Sampling refers to a procedure involving two or more steps , each of which could involve any of the various sampling techniques. Example: The Gallup organization often follows a procedure in which nationwide locations are randomly selected, then neighborhoods are randomly selected in each of these locations and finally households are randomly selected in each of these neighborhoods. Identify which type of sampling method is used for questions #1-­‐7 1) 49, 34 and 48 students are selected from the sophomore, junior and senior classes with 496, 348 and 481 students respectively A) convenience B) simple random C) systematic D) stratified E) cluster 2) A sample consists of every 49th students from an ordered list of 496 students A) convenience B) simple random C) systematic D) stratified E) cluster 3) A market researcher randomly selects 500 drivers under 30 years of age and 500 drivers over 30 years of age A) convenience B) simple random C) systematic D) stratified E) cluster 4) A market researcher randomly selects 500 people from each of 10 cities A) convenience B) simple random C) systematic D) stratified E) cluster 5) A tax auditor selects every 1000th income tax return that is received A) convenience B) simple random C) systematic D) stratified E) cluster Chapter 5 Review 6) A pollster uses a computer to generate 500 random numbers, then interviews the voters that correspond to those numbers A) convenience B) simple random C) systematic D) stratified E) cluster 7) A education researcher randomly selects 48 middle schools and interviews all the teachers at each school A) convenience B) simple random C) systematic D) stratified E) cluster 8) An auto analyst is conducting a satisfaction survey, sampling from a list of 10,000 new car buyers. The
list includes 2500 Ford buyers, 2500 GM buyers, 2500 Honda buyers and 2500 Toyota buyers. The
analyst selects a sample of 400 car buyers by randomly selecting 100 buyers of each brand. Is this an
example of a simple random sample?
(A) Yes, because each buyer in the sample was randomly selected
(B) Yes, because each buyer in the sample had an equal chance of being sampled
(C) Yes, because car buyers of every brand were equally represented in the sample
(D) No, because every possible 400 buyer sample did not have an equal chance of being chosen
(E) No, because the population consisted of purchasers of four different brands of cars
9) You want to do a survey of members of the senior class at your school and want to select a simple
random sample. You intend to include 40 students in your sample. Identify the sampling method
described for each of the scenarios below:
(A) Write the names of each students in the senior class on a slip of paper and put the papers in a
container. Then randomly select 40 slips of paper from the container.
(B) Assuming that students are randomly assigned to classes, select two classes at random and include
those students in your sample.
(C) From a list of all seniors, select one of the first 10 names at random. Then select every nth name
on the list until you have 40 people selected.
(D) Select the first 40 seniors to pass through the cafeteria door at lunch
(E) Randomly select 10 students from each of the four senior calculus classes
SOLUTIONS:
1.) D
2.) C
3.) D
4.) D
9.) (A) simple random sample
(B) cluster sample
(C) systematic sample
5.) C
6.) B
7.) E
(D) convenience sample (biased!)
(E) stratified sample
8.) D
Chapter 5 Review Sources of Bias
Samples are biased if they are systematically not representative of the desired population. Under-coverage (Selection Bias): occurs when the way the sample is selected systematically excludes part
of the population .
Literary Digest Example (FDR vs. Alf Landon presidential race polls)
Non-response Bias: Occurs when an individual chosen for a sample can’t be contact or refuses to respond.
Non-response is a big problem in mail surveys.
Example: The DHS administration sends out 100 survey questions to a sample of DHS parents in order to
gage their attitudes toward the school. Only 23 surveys are returned. We have a non-response rate of 77%.
Voluntary Response Bias – samples based on individuals who offer to participate typically give too much emphasis to people with strong opinions. Advantage – Easy to collect Disadvantage – Over represents people with strong opinions. Example: radio call in programs about controversial topics such as gun control and abortion. Online surveys posted to websites are a modern example of voluntary response bias
Response Bias: Caused by the behavior of the respondent or the interviewer
Untruthful answers: people give untruthful answers for several reasons:
1) Sensitive questions
Example: Have you ever cheated on your spouse?
2) Socially acceptable answers
Example: Do you use corporal punishment with your children?
3) Telling the interviewer what he or she wants to hear.
Example: One year after the Detroit race riots of 1967, interviewers asked a sample of
black residents in Detroit if they felt they could trust most white people, some white
people, or none at all. When the interviewer was white, 35% answered "most"; when the
interviewer was black, 7% answered "most”
The fix: secret ballots, anonymous surveys, "sensitive question" techniques.
Chapter 5 Review Lack of memory: giving a wrong answer simply because respondent doesn’t remember the correct answer.
Example: Students were asked to report their grade point averages. Researchers then determined
the actual GPA's. Over 17% of the students reported a GPA that was .4 or more above their actual
average, and about 2% reported a GPA more than .4 below their actual GPA. (more inflated their
GPA's!)
Timing: When a survey is taken can have an impact on the answers.
Example: in January, the National Football League reported a poll that revealed football as the
nation's favorite sport (this is at the time of the Super Bowl)
Phrasing of questions: Subtle differences in phrasing make large differences in the results.
Example:
a) Should the president have the line-item veto to eliminate waste? 97% said “yes”
b) Should the president have the line item veto? 57% said “yes”
Sampling ERROR vs. Sampling BIAS
Sampling Error : The fact that we will get different results from sample to sample, and that no sample
perfectly mirrors the population. Sample to sample variability is expected and unavoidable
Example: Place 50 red and 50 green balls in a bag. Mix the balls thoroughly and randomly sample 30 balls.
In your sample you find that 12 balls are red and 18 are green. Your sample result (12:18 = 2:3) is different
than the true population ratio of 50:50 which is 1 to 1. This difference is due to sampling error. Virtually
any experiment involving a sample will have sampling error. We can minimize sampling error through
various statistical techniques; the most obvious is to increase the sample size.
Sampling Bias: Bias is about center and is a much more serious problem. Bias means that some flaw in
the way you are collecting your data will consistently make your samples off-target.
Example: Place 20 red and 80 green balls in a box. Place the red balls on the bottom and the green balls on
the top. Randomly sample 10 balls from the top without thoroughly mixing the balls. In your sample you
find that 0 balls are red and 10 are green. Your sample result of red to green (0:10) is different than the true
population ratio of 1:4. This difference is due to sampling bias. If we continue to replace the green balls and
sample from the top, we may never get a sample that includes red balls leading us to believe that there are
no green balls in the population.
Chapter 5 Review http://introductorystats.wordpress.com/2011/03/09/design-­‐of-­‐experiments/ Design of experiments
Posted on March 9, 2011 When the goal in a statistical study is to understand cause and effect, experiments are the only way to
obtain convincing evidence for causation. This is an introductory discussion on experimental design,
introducing its vocabulary, its characteristics and its principles. We use a hypothetical example of an
experiment to illustrate the concepts.
An observational study is a study in which the researchers observe individuals and measure variables of
interest but do not attempt to influence the response variable. In an experiment, the researchers deliberately
impose some treatment on individuals and then observe the response variables. When the goal is to
demonstrate cause and effect, experiment is the only source of convincing data.
Terminology
The individuals on which the experiment is performed are called the experimental units. If the experimental
units are human beings, they are called subjects. A treatment is an experimental condition applied to the
experimental units. The goal of an experiment is to determine whether changes in one or more explanatory
variables have any effect on some response variables. For this reason, the distinction between explanatory
variables and response variables is important. The explanatory variables are often called factors. Each
factor may have several values (called levels). Many experiments study the joint effects of several factors.
A treatment is then formed by combining a level of each of the factors.
Introduction of Examples
To illustrate the concepts, we use a hypothetical experiment. Suppose a new medication designed to reduce
fever (and relieve aches and pain) is being tested for efficacy and side effects. For convenience, we call this
new medication Drug X. There are three different dosages: 325 mg, 500 mg and 650 mg. The experiment
enrolls 1200 patients with high fever to test Drug X. Assume that the subjects in this experiment include
600 men and 600 women with age ranging from 18 to 70. The primary outcome measure is the drop in
body temperature three hours after taking the treatment, which is the yardstick by which to measure the
success of Drug X.
The three basic principles of statistical design of experiments are Control, Randomization and
Repetition. When we say the design of an experiment (or experimental design), we refer to the manner in
which these three principles are carried out. There are three main experimental designs: completely
randomized design, randomized block design and matched pairs design. We present several examples
based on the hypothetical experiment to illustrate these ideas.
More Terminology
In all the examples below, the new medication Drug X is compared to a group receiving placebo. A
placebo is a dummy treatment. In this example, it is a medication that has identical look, smell and taste as
Drug X. The experiments described in these examples are double-blind, meaning that both the subjects and
the experimenters do not know which treatment any subject has received.
Example 1a – Completely Randomized Design
The researchers randomly assigns the 1200 subjects into two treatment groups, Group 1 (600 subjects
taking Drug X 325 mg) and Group 2 (600 subjects taking placebo). Three hours after taking the treatments,
the researchers compare the change in body temperature between the treatment groups. In this examples,
there are two treatments, Drug X and placebo.
Chapter 5 Review The treatment of interest (Drug X) is called an intervention and the Drug X group is called the intervention
group. The placebo group is sometimes called the non-intervention group.
This is a one-factor experiment, i.e. only one explanatory variable, namely fever reducing medication. The
one factor has two levels (Drug X 325 mg and placebo). Figure 1 below is an outline of this design.
Example 1b – Completely Randomized Design
The example is similar to Example 1a except that there are four levels in the one factor. The researchers
randomly assign the 1200 subjects into four treatment groups, Group 1 (300 subjects taking Drug X 325
mg), Group 2 (300 subjects taking Drug X 500 mg), Group 3 (300 subjects taking Drug X 650 mg) and
Group 4 (300 subjects taking placebo). As in Example 1a, three hours after taking the treatments, the
researchers compare the change in body temperature between the several treatment groups.
The various Drug X groups are called the intervention groups and the placebo group is called the nonintervention group. Figure 2 below illustrates this design.
The Principles of Experimental Design
Let’s discuss the basic principles outlined in Figures 1 and 2. First, the principle of control. The placebo
group is called the control group, the group of subjects who receive a dummy treatment. Why is the control
group necessary? Why compare different Drug X groups with the placebo group? Why not just apply the
new fever reducing medication to all patients? Without the control group, we do not know whether the
favorable responses from the patients are due to the new medication or to the placebo effect. Some patients
respond well to any treatment, even a placebo. However, with a control group alongside Drug X groups,
both the placebo effect and other influences operate on both the control group and Drug X groups. The only
Chapter 5 Review difference between the groups is the varying levels of Drug X. Thus the purpose of having a control group
is to prevent confounding.
Two variables are confounded when their effect on a response variable (reduction in fever in our
examples) cannot be distinguished from one another. Without the control group as comparison, the effect of
Drug X and the placebo effect on the response variable (reduction in fever) cannot be distinguished from
one another. There could be other variables that may influence the response variable (these variables are
called lurking variables or confounding variables). Without the control group, the effect of Drug X and
these lurking variables may also be confounded.
The first principle of experimental design is control. We just illustrate the simplest form of control, that is,
the comparison of two or more treatments (other forms of control will be discussed below). The purpose of
comparing treatments is to prevent the effect of the explanatory variables (the effect of the new fever
reducing medication in our examples) being confounded with the placebo effect and other lurking
variables.
The second principle of experimental design is randomization. Notice that the patients are assigned to
either the Drug X groups or the placebo group through the use of random chance (conceptually, think
drawing names from a hat). The goal of randomization is to produce treatment groups that are similar
(except for chance variation) before the treatments begin.
The third principle of experimental design is replication, which refers to the practice of applying the
treatments to many experimental units. The goal of repetition is to reduce the role of chance variation on
the results of the experiment. For example, if each treatment group has only one patient, the results would
depend too much on which group gets lucky and is assigned a patient that is less sick (e.g. with milder
fever conditions). If we assign many patients to each group, it will be unlikely that all patients in the Drug
X groups will be less sick.
Prevention of Bias
Control (in particular, comparison of treatments) and randomization together prevent bias (i.e. systematic
favoritism). For example, because of the placebo effect, uncontrolled experiments in medicine can give
new medications or new therapies a higher rate of success. If patients are not assigned to treatment groups
by chance, the subjects in the new medication group and the placebo group may not have similar
characteristics and thus the results may become biased. For example, randomization prevents the possibility
that the researchers try to assign the sicker patients to the new medication groups in an effort to help them.
With randomization, there is no inherent bias resulting from some patients opting to take the new
medication. In a randomized controlled experiment, both the experimenters and the participants do not have
the right to choose the treatments.
In clinical trials involving medication, another way to prevent bias is through the technique of blinding,
which refers to the non-disclosure of the treatment a subject is receiving. There are two types of blinding.
An experiment is single-blind is one in which the subject does not know what treatment he or she is
receiving. A double-blind experiment is one in which both the subject and the medical personnel in contact
with the subject do not know which treatment the subject is receiving.
The double-blind technique avoids unconscious bias. In such an experiment, both the medical personnel
and the subject do not adjust their behavior that may bias the results (e.g. the researcher may think that a
placebo cannot help the patient).
Summary – Completely Randomized Design
The designs described in both Example 1a and Example 1b are called completely randomized designs and
are the simplest statistical designs for experiments. These designs incorporated all three principles of
control, randomization and repetition. A completely randomized design incorporates the simplest form of
control, namely comparison. The goal of comparing different treatments is to prevent the confounding of
Chapter 5 Review the explanatory variables with lurking variables. The element of randomization is to produce treatment
groups that are similar (except for chance variation) before the treatments begin. Comparison and
randomization together prevent bias. The goal of repetition is to reduce the role of chance variation on the
results of the experiment.
However, completely randomized designs are inferior to more elaborate designs. The reason is that it is
possible that not all potential cofounding variables are removed. For example, men and women respond
differently to medication. In the completely randomized designs in Examples 1a and 1b, the random
assignment to treatment groups are done without regard to gender. These two examples ignore the
differences between men and women. Though the patients are assigned by random chance to the treatment
groups, it is possible that one treatment group is assigned more men than women. A better design will look
separately at the responses of men and women. In other words, the researchers will separate out the men
from the women and then randomly assign each gender group to the different treatment groups. This is
called the randomized block design.
Example 2 – Randomized Block Design
The 1200 subjects are assigned to blocks, based on gender. Then subjects within each block are randomly
assigned to the two treatment groups (Drug X 325 mg, and Placebo). The variable of gender is called a
blocking variable. Three hours after taking the treatments, the researchers compare the change in body
temperature between the treatment groups within each block. Figure 3 below outlines this randomized
block design.
The randomized block design in this example is an improvement over the completely randomized design in
Example 1a. In both Example 1a and Example 2, comparison of treatment groups is used to implicitly
prevent confounding. However, the randomized block design in Example 2 explicitly controls the variable
of gender.
We can also create the blocking equivalence of Example 1b by randomly assigning subjects in each block
to four treatments (Drug X 325 mg, Drug X 500 mg, Drug X 650 mg, and Placebo). The outline of this
design is omitted.
Chapter 5 Review Summary – Randomized Block Design
A block is a group of experimental units that are known, prior to the experiment, to be similar according to
some variables and that these variables are expected to affect the response to the treatments. In the
randomized block design, the randomization to treatments is carried out separately winthin each block.
Blocks are another form of control. The block design is to control the variables that are used to form the
blocks (these variables are called the blocking variables). In Example 2, the blocking variable is the gender.
The third main type of design is the matched pairs design, which is a special case of the randomized block
design. This design is only applicable when the experiment has only two treatments and that the
experimental units can be separated into pairs according to some blocking variables. Consider the following
example.
Example 3 – Matched Pairs Design
The 1200 subjects are grouped into 600 matched pairs. The subjects in each pairs have the same gender and
have similar age. Moreover, the subjects in each matched pair are assigned by random chance to the two
treatments (Drug X 325 mg and placebo). The advantage of this design is that it explicitly controls both age
and gender. Each matched pair is like a block (based on age and gender). Randomization is done separately
within each pair. Three hours after taking the treatments, the researchers compare the change in body
temperature within each matched pair.
Summary – The Matched Pairs Design
The matched pairs design is, in some ways, superior to completely randomized design and randomized
block design. The requirements are that this design can only compare two treatments and that the group of
experimental units can be matched in pairs (thus requiring more work on the part of the experimenters).
Because matched subjects are more similar than unmatched subjects, the matched pairs design can
explicitly control the variables that are used to form the pairs.
Randomization remains important in the matched pairs design. For example, which one of the subjects in a
matched pair uses Drug X is decided by a coin toss. In contrast, in a completely randomized design,
random chance is used to assign all the subjects all at once to the treatment groups. In a randomized block
design, the random assignment is done separately within each block.
One common variation of the matched pairs design applies both treatments on the same subject. In such a
design, each subject serves as his or her own control.
Conclusion
One important advantage of experiments over observational studies is that well designed experiments can
provide good evidence for causation. In an experiment, an intervention (Drug X in our examples) is applied
to enough experimental units to ensure that the results of the experiments will not be dependent on chance
variation (the principle of repetition). The experimental units are randomly assigned to an intervention
group and a non-intervention group (placebo group). This refers to the principles of randomization and
control, which help reduce the potential of bias and prevent confounding by increasing the chance that
confounding variables will operate equally on the intervention group and the placebo group. Then the only
difference between the intervention group and the placebo group is the intervention. When the intervention
group experiences favorable results, we can be confident that the intervention makes the difference.
Chapter 5 Review AP Statistics Practice Multiple Choice Questions
1.
Can pleasant aromas help a student learn better? Two researchers believed that the
presence of a floral scent could improve a person’s learning ability in certain
situations. They had 22 people work through a pencil-and- paper maze six times,
three times while wearing a floral-scented mask and three times wearing an
unscented mask. The three trials for each mask closely followed one another. Testers
measured the length of time it took subjects to complete each of the six trials. They
reported that, on average, subjects wearing the floral-scented mask completed the
maze more quickly than those wearing the unscented mask, although the difference
was not statistically significant. This study is
A) a convenience sample
B) an observational study, not an experiment
C) an experiment, but not a double-blind experiment
D) a double-blind experiment
2.
A marketing research firm wishes to determine if the adult men in Laramie, Wyoming,
would be interested in a new upscale men’s clothing store. From a list of all residential
addresses in Laramie, the firm selects a simple random sample of 100 and mails a
brief questionnaire to each. The population of interest is
A) all adult men in Laramie, Wyoming
B) all residential addresses in Laramie, Wyoming
C) the members of the marketing firm that actually conducted the survey
D) the 100 addresses to which the survey was mailed
Twelve people, who suffer from chronic fatigue syndrome, volunteer to take part in an
experiment to see if, shark fin extract will increase one’s energy level. 8 of the volunteers
are men and 4 are women. Half of the volunteers are to be given shark extract twice a
day and the other half a placebo twice a day. We wish to make sure that 4 men and 2
women are assigned each of the treatments, so we decide to use a block design with the
men forming one block and the women the other.
3.
Referring to the information above, a block design is appropriate in this experiment if
A) we believe men and women will respond differently to treatments
B) gender equity is an important legal consideration in this study
C) we want the conclusions to apply equally to men and women
D) all of the above
Chapter 5 Review 4.
Referring to the information above, suppose one of the researchers is responsible for
determining if a subject displays an increase in energy level. In this case, we should
probably
A) use two placebos
B) use stratified sampling to assign subjects to treatments
C) use fewer subjects but observe them more frequently
D) conduct the study as a double-blind experiment
A study of human development showed two types of movies to groups of children.
Crackers were available in a bowl, and the investigators compared the number of
crackers eaten by children watching the different kinds of movies. One kind of movie
was shown at 8 AM (right after the children had breakfast) and another at 11 AM (right
before the children had lunch). It was found that during the movie shown at 11 AM, more
crackers were eaten than during the movie shown at 8 AM. The investigators concluded
that the different types of movies had an effect on appetite. .
5.
The results cannot be trusted because
A) the study was not double-blind. Neither the investigators, nor the children should
have been aware of which movie was being shown
B) the investigators were biased. They knew beforehand what they hoped the study
would show
C) the investigators should have used several bowls, with crackers randomly placed in
each
D) the time the movie was shown is a confounding variable.
6. The response variable in this experiment is
A) the number of crackers eaten
B) the different kinds of movies
C) the time the movie was shown
D) the bowls
Chapter 5 Review 7. In order to select a sample of undergraduate students in the United States, I select a
simple random sample of four states. From each of these states, I select a simple
random sample of two colleges or universities. Finally, from each of these eight
colleges or universities, I select a simple random sample of 20 undergraduates. My
final sample consists of 160 undergraduates. This is an example of
A) simple random sampling
B) stratified random sampling
C) multistage sampling
D) convenience sampling
8. A study of the effects of running on personality involved 231 male runners who each ran
about 20 miles a week. A news report (New York Times, Feb. 15, 1988) stated, “The
researchers found statistically significant personality differences between the runners and
the 30-year-oldmale population as a whole.” A headline on the article said, “Research has
shown that running can alter one’s moods.” Which of the following statements about the
study is true?
A) It was not a designed experiment
B) It was an experiment, but not a double-blind experiment
C) It was a double-blind experiment, but not a randomized
D) It was a randomized, double-blind experiment
One hundred volunteers who suffer from severe depression are available for a study.
Fifty are selected at random and are given a new drug that is thought to be particularly
effective in treating severe depression. The other 50 are given an existing drug for
treating severe depression. A psychiatrist evaluates the symptoms of all volunteers after
four weeks in order to determine if there has been substantial improvement in the
severity of the depression.
9.
The study described above would be double-blind if
A) neither drug had any identifying marks on it
B) the volunteers were not allowed to interact during the four weeks
C) neither the volunteers nor the psychiatrist knew which treatment any person had
received
D) all of the above
Chapter 5 Review 10.
Referring to the study described above, suppose volunteers were first divided into men
and women, and then half of the men were randomly assigned to the new drug and half
of the women were assigned to the new drug. The remaining volunteers received the
other drug. This would be an example of
A) Replication
B) confounding. The effects of gender will be mixed up with the effects of the drugs
C) a block design
D) a matched-pairs design
11. Will a fluoride mouthwash used after brushing reduce cavities? Twenty sets of twins
were used to investigate this question. One member of each set of twins used the
mouthwash after each brushing; the other did not. After six months, the difference in the
number of cavities of those using the mouthwash was compared with the number of
cavities of those who did not use the mouthwash. This experiment uses
A) random placebos
B) double-blinding
C) double replication
D) a matched-pairs design
12. A stratified random sample is similar to which of the following experimental designs?
A) a block design
B) a double-blind experiment
C) an experiment with a placebo
D) a confounded, nonrandomized study
Choose a simple random sample of size three from the following employees of a small
company.
1. Bechhofer 2. Brown 3. Ito 4. Kesten 5.Kiefer 6. Spitzer 7. Taylor 8.Wald 9. Weiss
Use the numerical labels attached to the names above and the list of random digits
below. Read the list of random digits from left to right, starting at the beginning of the list.
11793 20495 05907 11384 44982 20751 27498 12009 45287 71753 98236 66419
84533
Chapter 5 Review 13.)Referring to the information above, the simple random sample is
A) 117
B) Bechhofer, then Bechhofer again, then Taylor
C) Bechhofer, Taylor, Weiss
D) Kesten, Kiefer, Taylor
14.)
Referring to the information above, which of the following statements is true?
A) If we used another list of random digits to select the sample, we would get the same
result that we obtained with the list used here.
B) If we used another list of random digits to select the sample, we would get a
completely different sample than that obtained with the list used here.
C) If we used another list of random digits to select the sample, we would get at most
one name in common with the sample obtained here.
D) If we used another list of random digits to select the sample, it would be just as likely
that the sample that we obtained here would be selected as any other set of three
names.
15.) A simple random sample of 1200 adult Americans is selected, and each person is
asked the following question: In light of the huge national deficit, should the
government at this time spend additional money to establish a national system of
health insurance? Only 39% of those responding answered yes. This survey
A) is reasonably accurate since it used a large, simple random sample
B) probably overstates the percentage of people that favor a system of national health
insurance
C) probably understates the percentage of people that favor a system of national health
insurance
D) is very inaccurate, but neither understates nor overstates the percentage of people
that favor a system of national health insurance. Since simple random sampling was
used, it is unbiased
Chapter 5 Review 16.) A news release for a diet products company reports: “There’s good news for the 65
million Americans currently on a diet.” Its study showed that people who lose weight
could keep it off. The sample was 20 graduates of the company’s program who
endorse it in commercials. The results of the sample are probably
A) biased, overstating the effectiveness of the diet
B) biased, understating the effectiveness of the diet
C) unbiased since these are nationally recognized individuals
D) unbiased, but they could be more accurate. A larger sample size should be used
17.) A public opinion poll in Ohio wants to determine whether registered voters in the state
approve of a measure to ban smoking in all public areas. They select a simple
random sample of 50 registered voters from each county in the state and ask
whether they approve or disapprove of the measure. This is an example of a
A)
systematic sample
B) stratified sample C) multistage sample D) simple random sample
18.) A marketing research firm wishes to determine if the adult men in Laramie, Wyoming,
would be interested in a new upscale men’s clothing store. From a list of all residential
addresses in Laramie, the firm selects a simple random sample of 100 and mails a brief
questionnaire to each. The chance that all 100 homes in a particular neighborhood in
Laramie end up being the sample of residential addresses selected is
A) the same as for any other set of 100 residential addresses
B) exactly 0. Simple random samples will spread out the addresses selected
C) reasonably large due to the “cluster” effect
D) 100 divided by the size of the population of Laramie
19.) You are testing a new medication for relief of depression. You are going to give the
new medication to subjects suffering from depression and see if their symptoms
have lessened after a month. You have eight subjects available. Half of the subjects
are to be given the new medication and the other half a placebo. The names of the
eight subjects are given below.
1. Blumenthal 2. Costello 3. Duvall 4. Fan 5. House 6. Long 7. Pavlicova 8. Tang
Using the list of random digits 81507 27102 56027 55892 33063 41842 81868
71035 09001 43367 49497 starting at the beginning of this list and using single-digit
labels, you assign the first four subjects selected to receive the new medication,
while the remainder receive the placebo. The subjects assigned to the placebo are
A) Blumenthal, Costello, Duvall, and Fan
Chapter 5 Review B) Blumenthal, House, Pavlicova, and Tang
C) House, Long, Pavlicova, and Tang
D) Costello, Duvall, Fan, and Long
A television station is interested in predicting whether voters in its viewing area are in
favor of federal funding for abortions. It asks its viewers to phone in and indicate whether
they support/are in favor of or are opposed to this. Of the 2241 viewers who phoned in,
1574 (70.24%) were opposed to federal funding for abortions.
20. Referring to the information above, the viewers who phoned in are A) a voluntary
response sample B) a convenience sample C) a probability sample D) a population
21. Referring to the information above, the sample obtained is A) a simple random
sample B) a single-stage sample C) a census D) probably biased
22. In order to assess the opinion of students at the University of Minnesota on campus
snow removal, a reporter for the student newspaper interviews the first 12 students he
meets who are willing to express their opinion. The method of sampling used is
A) simple random sampling B) convenience sampling C) voluntary response D) a census
23.)In order to take a sample of 90 members of a local gym, I first divide the members
into men and women, and then take a simple random sample of 45 men and a
separate simple random sample of 45 women. This is an example of
A) a block design
B) a stratified random sample
C) a double-blind simple random sample
D) a randomized comparative experiment.
24. A1992 Roperpoll found that 22% of Americans say that the Holocaust may not have
happened.The actual question asked in the poll was: Does it seem possible or
impossible to you that the Nazi extermination of the Jews never happened? Twentytwo percent responded “possible.” The results of this poll cannot be trusted because
A) undercoverage is present. Obviously those people who did not survive the Holocaust
could not be in the poll
B) the question is worded in a confusing manner
C) we do not know who conducted the poll or who paid for the results
D) nonresponse is present. Many people will refuse to participate and those that do will
be biased in their opinions
Chapter 5 Review 25. A researcher is interested in the cholesterol levels of adults in the city in which she
lives. A free cholesterol screening program is set up in the downtown area during the
lunch hour. Individuals can walk in and have their cholesterol levels determined for
free. One hundred and seventy three people use the service, and their average
cholesterol is 217.8. The sample obtained is an example of
A) a SRS, since the experimenter did not know beforehand which individuals would
come to the screening
B) a stratified sample of high and low cholesterol individuals
C) a sample probably containing bias and undercoverage
D) a multistage sample of varying cholesterol levels
26.
In order to determine if smoking causes cancer, researchers surveyed a large sample
of adults. For each adult they recorded whether the person had smoked regularly at
any period in his or her life and whether the person had cancer. They then compared
the proportion of cancer cases in those who had smoked regularly at some time with
the proportion of cases in those who had never smoked regularly at any point. The
researchers found there was a higher proportion of cancer cases among those who
had smoked regularly than among those who had never smoked regularly. This is
A) an observational study
B) an experiment, but not a double-blind experiment
C) a double-blind experiment
D) a block design
27.)In order to investigate whether women are more likely than men to prefer Democratic
candidates, a political scientist selects a large sample of registered voters, both men
and women. She asks every voter whether they voted for the Republican or the
Democratic candidate in the last election. This is
A) an observational study
B) a multistage sample
C) A double-blind experiment
D) a block design
Chapter 5 Review 28. A market research company wishes to find out whether the population of students at a
university prefers brand A or brand B of instant coffee. A random sample of students is
selected, and each student is asked first to try brand A and then to try brand B, or vice
versa (with the order determined at random). They then indicate which brand they
prefer. This is an example of A) an experiment B) an observational study, not an
experiment C) stratified sampling design D) block design
29.
Sickle-cell disease is a painful disorder of the red blood cells that affects mostly blacks
in the United States. To investigate whether the drug hydroxyurea can reduce the
pain associated with sickle-cell disease, a study by the National Institute of Health
gave the drug to 150 sickle-cell sufferers and a placebo to another 150. The
researchers then counted the number of episodes of pain reported by each subject.
The response is A) the drug hydroxyurea B) the number of episodes of pain C) the
presence of sickle-cell disease D) the number of red blood cells
A group of college students believes that herbal tea has remarkable restorative powers.
To test their theory they make weekly visits to a local nursing home, visiting with
residents, talking with them, and serving them herbal tea. After several months, many of
the residents are more cheerful and healthy.
30. The explanatory variable in this experiment is the
A) emotional state of the residents
B) herbal tea
C) fact that this is a local nursing home
D) college students
31. The confounding variable in this experiment is the
A) emotional state of the residents
B) herbal tea
C) fact that this is a local nursing home
D) visits of college students
Chapter 5 Review 32.
A study to determine whether or not a football filled with helium traveled farther when
kicked than one filled with air found that, while the football filled with helium went, on
average, farther than the one filled with air, the difference was not statistically
significant. The response
A) is the gas, air or helium, with which the football is filled
B) does not exist without statistical significance
C) is the number of kickers
D) is the distance the football traveled
New varieties of corn with altered amino acid patterns may have higher nutritive value
than standard corn, which is low in the amino acid lysine. An experiment compares two
new varieties, called opaque-2 and floury-2, with normal corn. Corn- soybean meal diets
using each type of corn: are prepared at three different protein levels, 12%, 16%, and
20%, giving nine diets in all. Researchers assign 10 one-day-old male chicks to each
diet and record their weight gains after 21 days. The weight gain of the chicks is a
measure of the nutritive value of their diet.
33. Referring to the information above, the experimental units in this experiment are
A) variety and protein level
B) the weight gains
C) the 90 one-day-old male chicks
D) opaque-2 and floury-2
34.
Referring to the information above, the treatments are
A) 9 different combinations of variety and protein level
B) the three levels of protein
C) the 90 one-day-old male chicks
D) opaque-2 andfloury-2 varieties of corn
35.
Which of the following is not a major principle of experimental design?
A) control B) replication C) randomization D) segmenation
Chapter 5 Review 36. Two variables in a study are said to be confounded if
A) one cannot separate their effects on a response variable
B) they are highly correlated
C) they do not have a normal distribution
D) one of them is a placebo
Researchers wish to determine if a new experimental medication will reduce the
symptoms of allergy sufferers without the side effect of drowsiness. To investigate this
question, the researchers give the new medication to 50 adult volunteers who suffer
from allergies. 44 of these volunteers report a significant reduction in their allergy
symptoms without any drowsiness.
37. Referring to the information above, this study could be improved by
A) including people who do not suffer from allergies in the study in order to represent a
more diverse population
B) repeating the study with only the 44 volunteers who reported a significant reduction
in their allergy symptoms without any drowsiness, and giving them a higher dosage this
time
C) using a control group
D) all of the above
38.
Referring to the information above, the experimental units are
A) the researchers
B) the 50 adult volunteers
C) the 44 volunteers who reported a significant reduction in their allergy symptoms
without any drowsiness.
D) the six volunteers who did not report a significant reduction in their allergy symptoms
without any drowsiness.
SOLUTIONS: 1.) C 7.) C 2.) A 8.) A 3.) A 9.) C 4.) D 10.) C 5.) D 11.) D 6.) A 12.) A 13.) C 14.) D 15.) C 16.) A 17.) B 18.) A 19.) D 20.) A 21.) D 22.) B 23.) B 24.) B 25.) C 26.) A 27.) A 28.) A 29.) B 30.) B 31.)D 32.)D 33.)C 34.)A 35.)D 36.)A 37.) C 38.) B 
Download