PPT 18

advertisement
Chapter 18
Experimental and Quasi-Experimental Research
Research Methods in Physical Activity
Experimental research
Experimental research attempts to establish cause-and-effect
relationships. Three criteria must be present to establish cause and
effect:
1. The cause must precede the effect in time. For example, the starting
gun in a race precedes the runners’ beginning the race; the runners’ beginning does
not cause the starting gun to go off.
2. The cause and effect must be correlated with each other. Just
because two variables are correlated does not mean one causes the
other; cause and effect, however, cannot exist unless two variables
are correlated.
3. The correlation between cause and effect cannot be explained by
another variable. Recall that the relationship between the academic
performance of elementary children and shoe size was explained by a third
variable, age.
Research Methods in Physical Activity
Experimental research
Cause and effect in must be defined terms of necessary and
sufficient conditions. For example, if the condition is necessary and sufficient
to produce the effect, then it is the cause.
Alternative situations exist as well:
♦ Necessary but not sufficient: Some related condition likely
produces the effect.
♦ Sufficient but not necessary: Some alternative condition is likely
the cause.
♦ Neither necessary nor sufficient: Some contributing condition is
likely the cause.
Research Methods in Physical Activity
Experimental Research
Also, remember that statistics do not establish cause and effect.
Statistical techniques can only reject the null hypothesis (establish that
groups are significantly different) and identify the percentage of
variance in the dependent variable accounted for by the independent
variable or the effect size.
Neither of these procedures establishes cause and effect (they are
necessary but not sufficient).
Cause and effect can be established only by the application of logical
thinking to well-designed experiments.
This logical process establishes that no other reasonable
explanation exists (reject Null for alternate hypothesis) for the
changes in the dependent variable except the manipulation of the
independent variable.
Research Methods in Physical Activity
Experimental Research
The logical process of establishing changes in the dependent variable
require:
♦ The selection of a good theoretical framework
♦ The use of appropriate participants
♦ The application of an appropriate experimental design
♦ The proper selection and control of the independent variable
(treatment)
♦ The appropriate selection and measurement of the dependent
variable
♦ The use of the correct statistical model and analysis
♦ The correct interpretation of the results
Thus, the above are the methods required to establish valid findings
allowing the researcher to establish cause and effect.
Research Methods in Physical Activity
Sources of Invalidity
Both internal validity and external validity are important in experiments, but
they are frequently at odds in research planning and design.
Gaining internal validity involves controlling all variables so that the
researcher can eliminate all rival hypotheses as explanations for the
outcomes observed.
When controlling and constraining the research setting to gain internal
validity, the researcher places the generalization (external validity) of
the findings in jeopardy.
Thus, the researcher must decide whether it is more important to be certain
that the manipulation of the independent variable caused the observed
changes in the dependent variable or more important to be able to
generalize the results to other populations, settings, etc.
Research Methods in Physical Activity
Threats to Internal Validity in Research Design (see
examples of each pp. 332-334)
♦ History: events occurring during the experiment that are not
part of the treatment
♦ Maturation: processes within the participants that operate as
a result of time passing (e.g., aging, fatigue, hunger)
♦ Testing: the effects of one test on subsequent administrations
of the same test
♦ Instrumentation: changes in instrument calibration, including
lack of agreement within and between observers
♦ Statistical regression: the fact that groups selected based on
extreme scores are not as extreme on subsequent testing
♦ Selection bias: choosing comparison groups in a nonrandom
manner
♦ Experimental mortality: loss of participants from
comparison groups for nonrandom reasons.
Research Methods in Physical Activity
Threats to Internal Validity in Research Design (see
examples of each pp. 332-334)
♦ Selection–maturation interaction: the passage of time
affecting one group but not the other in nonequivalent group
designs
♦ Expectancy: experimenters’ or testers’ anticipating that
certain participants will perform better
Research Methods in Physical Activity
Controlling Threats to Internal Validity in Research
Design
Many threats to internal validity are controlled by making the
participants in the experimental and control groups as alike as
possible. This objective is most often accomplished by randomly
assigning participants to groups.
Random Assignment
1) Randomization allows the assumption that the groups do not
differ at the beginning of the experiment.
1) The randomization process controls for history up to the
point of the experiment; that is, the researcher can assume
that past events are equally distributed among groups
Research Methods in Physical Activity
Controlling Threats to Internal Validity in Research
Design
Random Assignment
3) Randomization also controls for maturation because the
passage of time is equivalent in all groups.
4) Statistical regression, selection biases, and selection–maturation
interaction are controlled because they occur only when
groups are not randomly formed.
Research Methods in Physical Activity
Controlling Threats to Internal Validity in Research
Design
Random Assignment
Ways other than random assignment of participants to groups are
sometimes used to control threats to internal validity. These
Include the matched-pair technique, and within-subjects designs.
Matched-Pair Technique - The matched-pair technique matches
pairs of participants who are equal on some characteristic and
then randomly assigns each to a different group.
Within-subjects designs - The participants are used as their own
controls. This means that each participant receives both the
experimental and the control treatment. In this type of design, the
order of treatments should be counterbalanced; that is, half the
participants should receive the experimental treatment first and
then the control, and the other half should receive the control
treatment first and then the experimental treatment.
Research Methods in Physical Activity
Controlling Threats to Internal Validity in Research
Design
Placebos, Blind Setups, and Double-Blind Setups
Other ways of controlling threats to internal validity include
placebos and blind and double-blind setups.
Placebo - Method of controlling a threat to internal validity in
which a control group receives a false treatment while the
experimental group receives the real treatment.
Blind setup - Method of controlling a threat to internal validity in
which the participant does not know whether he or she is
receiving the experimental or control treatment.
Double-blind setup - Method of controlling a threat to internal
validity in which neither the participant nor the experimenter
knows which treatment the participant is receiving.
Research Methods in Physical Activity
Uncontrolled Threats to Internal Validity in Research
Design
Three Threats to internal validity remained uncontrolled by the
randomization process : 1) Reactive or Interactive Effects; 2)
Instrumentation; and, 3) Experimental Mortality.
1) Reactive or interactive effects of testing can be controlled
only by eliminating the pretest.
2) Instrumentation problems cannot be controlled or evaluated
by any design. Only the experimenter can control this threat
to internal validity.
3) Experimental mortality is not controlled by any type of
experimental design. The experimenter can control this
threat only by ensuring that participants are not lost (at all, if
possible) from groups.
Research Methods in Physical Activity
Threats to External Validity in Research Design (see
examples of each pp. 334-336)
Threats to external validity, or the ability to generalize results to
other participants, settings, measures, etc:
♦ Reactive or interactive effects of testing : The pretest may
make the participant more aware of or sensitive to the upcoming
treatment. As a result, the treatment is not as effective without the
pretest.
♦ Interaction of selection bias and the experimental
treatment: When a group is selected on some characteristic, the
treatment may work only on groups possessing that characteristic.
♦ Reactive effects of experimental arrangements:
Treatments that are effective in constrained situations (e.g.,
laboratories) may not be effective in less constrained settings
(more like the real world).
♦ Multiple-treatment interference: When participants receive
more than one treatment, the effects of previous treatments may
influence subsequent ones
Research Methods in Physical Activity
Controlling Threats to External Validity in Research
Design
External validity is generally controlled by selecting the
participants, treatments, experimental situation, and tests to
represent some larger population.
The results of a particular treatment can be generalized to a
larger group if the sample is representative, this generalization may
apply only to the specific situation in the experiment. If the
experiment is conducted under controlled laboratory conditions,
then the findings may apply only under controlled laboratory
conditions.
Does the study have enough characteristics of real-world
settings that participants respond as if they were in the real
world ? (ie. Is ecological validity present ? )
Research Methods in Physical Activity
Types of Research Designs
Types of research designs are divided into three categories:
Pre - experimental designs,
True experimental designs, and,
Quasi-experimental designs. (not reviewed in this lecture)
With the following notation:
♦ Each line indicates a group of participants.
♦ R signifies random assignment of participants to groups.
♦ O signifies an observation or a test.
♦ T signifies that a treatment is applied; a blank space in a line where
a T appears on another line means that the group is a
control.
♦ A dotted line between groups means that the groups are used
intact rather than being randomly formed..
Research Methods in Physical Activity
Types of Research Designs
Notations (cont) :
♦ Subscripts (ex. O1, T2 ) indicate either the order of observations
and treatments (when they appear on the same line) or observations
of different groups or different treatments (when they appear on
different lines). For example, when the terms T1 and T2 appear on
different lines, they refer to different treatments; when they appear
on the same line, they mean that the treatment is administered more
than once to the same group.
Research Methods in Physical Activity
Types of Research Designs
Pre - experimental Designs
Pre-experimental design - One of three types of research design that
control few of the sources of invalidity and that do not have random
assignments of participants to groups: one-shot study, one-group
pretest - posttest design, and static group comparison.
One-shot study
In a one-shot study design, a group of participants receives a
treatment followed by a test to evaluate the treatment:
T
O
This design fails all the tests of good research. All that can be said is
that at a certain point in time this group of participants performed at
a certain level. In no way can the level of performance (O) be
attributed to the treatment (T).
Research Methods in Physical Activity
Types of Research Designs
Pre - experimental Designs
Pretest - posttest design
The one-group pretest–posttest design, although very weak, is better
than the one-shot design. At least we can observe whether any
change in performance has occurred:
O1 T O2
If O2 is better than O1, we can say that the participants improved.
Complications with this design :
1) The improvement could be due to the treatment, but it could also
be due to history. Some event other than the treatment (T) may
have occurred between the pretest (O1) and the posttest (O2);
Research Methods in Physical Activity
Types of Research Designs
Pretest - posttest design
Complications with this design (cont) :
2) Some event other than the treatment (T) may have occurred
between the pretest (O1) and the posttest (O2); (ex. the participants
may have exercised at home on the other days. )
3) Maturation is a rival hypothesis. The participants may have gotten
better (or worse) just as a result of the passage of time.
4) Testing is a rival hypothesis; the increase at O2 may be the result
only of experience with the test at O1.
5) If the group being tested is selected for some specific reason, then
any of the threats involving selection bias can occur.
Research Methods in Physical Activity
Types of Research Designs
Pre - experimental Designs
Static Group Comparison
The static group-comparison design compares two groups, one of
which receives the treatment and one of which does not:
T
O2
O1
Complications with this design :
1) We do not know whether the groups were not equivalent when
the study began, as indicated by the dotted line between the groups.
This means that the groups were selected intact rather than being
randomly formed. Therefore, we are unable to determine whether
any differences between O1 and O2 are because of T or only because
the groups differed initially.
Research Methods in Physical Activity
Types of Research Designs
Static Group Comparison
Complications with this design (cont) :
2) This design is subject to invalidity because of selection biases and
the selection–maturation interaction.
Research Methods in Physical Activity
Types of Research Designs
True Experimental Designs
True experimental design - Any design used in experimental research
in which groups are randomly formed and that controls most
sources of invalidity.
This controls for past (but not present) history, maturation (which
should occur equally in the groups), testing, and all sources of
invalidity that are based on nonequivalence of groups (statistical
regression, selection biases, and selection–maturation interaction).
Only the experimenter can make sure that nothing happens to one
group (besides the treatment) and not the other (present history),
that scores on the dependent measure do not vary as a result of
instrumentation problems, and that the loss of participants is not
different between the groups (experimental mortality).
Research Methods in Physical Activity
Types of Research Designs
True Experimental Designs
True experimental designs include:
1) Randomized–Groups Design
2) Pretest–Posttest Randomized–Groups Design, and
3) Solomon Four-Group Design
Research Methods in Physical Activity
Types of Research Designs
True Experimental Designs
Randomized–Groups Design
The randomized-groups design resembles static group comparison
except that groups are randomly formed:
R
R
T
O1
O2
If the researcher controls the threats to internal validity that are not
controlled by randomization, and has a sound theoretical basis
for the study, and meets the necessary - and-sufficient rule, then
this design allows the conclusion that significant differences
between O1 and O2 are due to T.
An independent t test is used to analyze the difference between O1 and
O 2.
Research Methods in Physical Activity
Types of Research Designs
True Experimental Designs
Randomized–Groups Design (cont)
This design can be extended to any number of levels of an
independent variable:
R
R
R
T1
T2
O1
O2
O3
Here, three levels of the independent variable exist, where one is the
control and T1 and T2 represent two levels of treatment.
This design can be analyzed by simple ANOVA, which contrasts the
dependent variable as measured in the three groups (O1, O2, O3).
Research Methods in Physical Activity
Types of Research Designs
True Experimental Designs
Pretest–Posttest Randomized–Groups Design
In the pretest–posttest randomized-groups design, the groups are
randomly formed, but both groups are given a pretest as well as
a posttest:
R
R
O1
O3
T
O2
O4
The major purpose of this type of design is to determine the amount
of change produced by the treatment; that is, does the
experimental group change more than the control group?
Research Methods in Physical Activity
Types of Research Designs
True Experimental Designs
Pretest–Posttest Randomized–Groups Design (cont)
This design threatens the internal validity through testing, but the
threat is controlled because the comparison of O3 to O4 in the
control group and the comparison of O1 to O2 in the
experimental group includes the testing effect. Thus, although
the testing effect cannot be evaluated in this design, it is
controlled.
This design can (best) be analyzed by repeated measures ANOVA.
Research Methods in Physical Activity
Types of Research Designs
True Experimental Designs
Solomon Four-Group Design
The Solomon four-group design is the only true experimental design to
specifically evaluate one of the threats to external validity: reactive or
interactive effects of testing. The design is depicted as follows:
R
O1
T
O2
R
O3
O4
R
T
O5
R
O6
This approach combines the randomized-groups and the pretest–posttest
randomized-groups designs. The purpose is explicitly to determine whether
the pretest results in increased sensitivity of the participants to the
treatment.
Research Methods in Physical Activity
Types of Research Designs
Solomon Four-Group Design (cont)
R
R
R
R
O1
O3
T
T
O2
O4
O5
O6
This design allows:
1) a replication of the treatment effect (is O2 > O4? and is O5 >
O6?),
2) an assessment of the amount of change due to the treatment (is
O2 – O1 > O4 – O3?),
3) an evaluation of the testing effect (is O4 > O6?), and,
4) an assessment of whether the pretest interacts with the treatment
(is O2 > O5?).
Research Methods in Physical Activity
Types of Research Designs
Solomon Four-Group Design (cont)
In addition, no good way exists to analyze this design statistically. The best
alternative (one that does not use all the data) is a 2 X 2 ANOVA set up as
follows:
Pretested
Unpretested
No T
T
O4
O6
O2
O5
Thus, independent variable 1 has two levels (pretested and not pretested), and
independent variable 2 has two levels (treatment and no treatment).
In the ANOVA, the F ratio for independent variable 1 establishes the effects of
pretesting, the F for independent variable 2 establishes the effects of the
treatment, and the F for interaction evaluates the external validity threat of
interaction of the pretest with the treatment.
End of Lecture
Research Methods in Physical Activity
Download