between groups variance

advertisement
Chapter 10 – Lecture 10
Internal Validity – Control through
Experimental Design
1) Test the effects of IV on DV
2) Protects against threats to internal validity
Causation
Experimental Design
 Highest Constraint
 Comparisons btw grps
 Random sampling
 Random assignment
Infer Causality
Experimental Design
(5 characteristics)
1) One or more hypothesis
2) Includes at least 2 “levels” of IV
3) Random assignment
4) Procedures for testing hypothesis
5) Controls for major threats to internal validity
Experimental Design









•
Develop the problem statement
Define IV & DV
Develop research hypothesis
Identify a population of interest
Random sampling & Random assignment
Specify procedures (methods)
Anticipate threats to validity
Create controls
Specify Statistical tests
Ethical considerations
Clear Experimental Design…
Experimental Design
2 sources of variance
1. between groups variance (systematic)
no drug
drug
2. Within groups variance (nonsystematic)
(error variance)
Remember…
Sampling error
Significant differences…variability btw means is larger than expected
on the basis of sampling error alone (due to chance alone)
Variance
Need it!
Without it…
No go
VARIANCE
“Partitioning of the variance”
Between Group
Experimental Variance
Within Group
(Due to your treatment)
+
Extraneous Variance
(confounds etc.)
CON
TX
Error Variance
(not due to treatment – chance)
Subs
Variance: Important for the statistical analysis
F=
F=
F=
between groups variance
Within groups variance
Systematic effects + error variance
error variance
1.00
No differences btw groups
Variance
Your experiment should be designed to
• Maximize experimental variance
•Control extraneous variance
•Minimize error variance
Maximize “Experimental” Variance
• At least 2 levels of IV (IVs really vary?)
•Manipulation check: make sure the
levels (exp. conditions) differ each other
Ex: anxiety levels (low anxiety/hi anxiety)
 performance on math task
anxiety scale
Control “Extraneous” Variance
1. Ex. & Con grps are similar to begin with
2. Within subjects design (carryover effects??)
3. If need be, limit population of interest (o vs o )
4. Make the extraneous variable an IV
(age, sex, socioeconomic) = factorial design
M
F
Lo Anxiety
M-low
F-low
Hi Anxiety
M-hi
F-hi
Factorial design
(2 IV’s)
YOUR Proposals
Control through Design – Don’ts
1.
2.
3.
4.
Ex Post Facto
Single-group, posttest only
Single-group pretest-posttest
Pretest-Posttest natural control group
1. Ex Post Facto – “after the fact”
Group A
Naturally Occurring Event
No manipulation
Measurement
Control through Design – Don’ts
Single group posttest only
Group A
TX
Posttest
Single group Pretest-posttest
Pretest Group A
TX
Compare
Posttest
Control through Design – Don’ts
Pretest-Posttest Naturalistic Control Group
Group A
Pretest
TX
Posttest
Compare
Group B
Natural
Occurring
Pretest
no TX
Posttest
Control through Design – Do’s – Experimental Design
•
•
•
Manipulate IV
Control Group
Randomization
4 Basic Designs
Testing One IV
1. Randomized Posttest only, Control Group
2. Randomized Pretest-Posttest, Control Group
3. Multilevel Completely Randomized Between Groups
4. Solomon’s Four- Group
Randomized Posttest Only – Control Group
(most basic experimental design)
R Group A
(Ex)
TX
R Group B
(Con)
no TX
Posttest
Compare
Posttest
Randomized, Pretest-Posttest, Control Group Design
R Group A
(Ex)
Pretest
R Group B
(Con)
Pretest
TX
Posttest
Compare
no TX
Posttest
Multilevel, Completely Randomized Between
Subjects Design (more than 2 levels of IV)
R Group A
Pretest
TX1
Posttest
R Group B
Pretest
TX 2
Posttest
R Group C
Pretest
TX3
Posttest
R Group D
Pretest
TX4
Posttest
Compare
Solomon’s Four Group Design
(extension Multilevel Btw Subs)
R Group A
Pretest
TX
R Group B
Pretest
----
Posttest
R Group C
--------
TX
Posttest
R Group D
--------
----
Posttest
Powerful Design!
Posttest
Compare
What stats do you use to analyze experimental designs?
Depends the level of measurement
Test difference between groups
Nominal data  chi square (frequency/categorical)
Ordered data  Mann-Whitney U test
Interval or ratio  t-test / ANOVA (F test)
t-Test
Independent
Samples (between Subs)
Evaluate differences bwt
2 independent groups
Compare 2 groups
One sample (Within)
Evaluate differences bwt two
conditions in a single groups
Assumptions to use t-Test
1. The test variable (DV) is normally distributed
in each of the 2 groups
2. The variances of the normally distributed test
variable are equal – Homogeniety of Variance
3. Random assignment to groups
t-distribution
Represents the distribution of t that would be obtained if a
value of t were calculated for each sample mean for all possible
random samples of a given size from some population
Degrees of freedom (df)
When we use samples we approximate means & SD
to represent the true population
Sample variability (SS = squared deviations) tends
to underestimate population variability
Restriction is placed = making up for this mathematically
by using n-1 in denominator
S2 = variance
ss (sum of squares)
df (degrees of freedom)
(x - )2
n-1
x
Degrees of freedom (df): n-1
The number of values (scores) that are free to
vary given mathematical restrictions on a
sample of observed values used to estimate
some unknown population = price we pay for
sampling
Degrees of freedom (df): n-1
Number of scores free to vary
Data Set  you know the mean
(use mean to compute
variance)
n=2 with a mean of 6
X
8
?
6
In order to get a mean of 6
with an n of 2…need a sum
of 12…second score must be
4… second score is restricted
by sample mean (this score is not
free to vary)
=x
Group Statistics
ENDURANC
DRUG
doped
no dope
N
10
10
Mean
7.9000
2.6000
Std. Error
Mean
.3786
.4000
Std. Deviation
1.1972
1.2649
Independent Samples Test
Levene's Test for
Equality of Variances
F
ENDURANC Equal variances
assumed
Equal variances
not assumed
.065
Sig.
.801
t-test for Equality of Means
t
Mean
Sig. (2-tailed) Difference
df
Std. Error
Difference
95% Confidence
Interval of the
Difference
Lower
Upper
9.623
18
.000
5.3000
.5508
4.1429
6.4571
9.623
17.946
.000
5.3000
.5508
4.1427
6.4573
ANOVA
ENDURANC
Between Groups
Within Groups
Total
Sum of
Squares
140.450
27.300
167.750
df
1
18
19
Mean Square
140.450
1.517
F
92.604
Sig.
.000
Analysis of Variance (ANOVA)
Two or more groups ….can use on two groups…
t2 = F
Variance is calculated more than once
because of varying levels (combo of differences)
Several Sources of Variance
SS – between
Partitioning
SS – Within
the variance
SS – Total
Sum of Squares: sum of squared deviations from the mean
Assumptions to use ANOVA
1. The test variable (DV) is normally distributed
2. The variances of the normally distributed test
variable is equal – Homogeniety of Variance
3. Random assignment to groups
F=
Systematic effects + error variance
error variance
F=
F=
between groups variance
Within groups variance
1.00
F = 21.50
No differences btw groups
22 times as much variance between
the groups than we would expect by chance
After Omnibus F…
Planned comparisons & Post Hoc tests
A Priori (spss: contrast)
A Posteriori
part of your hypothesis…before
Not quite sure where
data are collected…prediction is made differences will occur
Why not just do t-tests!
2 types of errors that you must consider when
doing Post Hoc Analysis
Alpha
1. Per-comparison error (PC)
2. Family wise error (FW)
Inflate Alpha!!!!
FW = c()
c = # of comparisons made
 = your PC
Ex: IV ( 5 conditions)
1 vs 2
1 vs 3
1 vs 4
1 vs 5
2 vs 3
2 vs 4
2 vs 5
3 vs 4
3 vs 5
4 vs 5
FW = c()
10 (0.05) = .50
HSD
Download