Choosing Research Designs II

advertisement
Choosing Research Designs II
Nonexperimental Methods
The Purpose of Control Variables
• We use control variables to account for
possible alternative explanations we can
think of.
• For example, when I examined whether
democracies are generally more peaceful
than autocracies I included several
control variables.
Explaining Pacifistic Democracy
• Peace (Y) = Democracy (X1) + State
Power (X2) + Development (X3) + # of
Bordering States (X4)
• In the model above, I have more
confidence that Democracy is related to
peace considering I control for the other
variables that may skew my test.
• We need to take care that our theory is not
missing other factors that may undermine
the validity of our theory and tests.
• Our inferences will be flawed if we are
actually capturing other processes through
our variables.
• This means that the validity of our
measures would be undermined.
Several possible problems arise that are
related to model misspecification and
spurious relationships.
•
Thus, we need to control for confounding
factors and alternative explanations!!!
Model Misspecification and
Spuriousness
• Antecedent variable: A variable that
indirectly affects the relationship
between two other variables.
• For example, Ivy league education
increases income.
• However, parental wealth and legacy
admissions affect Ivy league education.
Thus, income of graduates from Ivy
League schools may not be random.
Here Ivy League Parents is an
antecedent variable
Ivy League Parents
Kids
kids
Ivy League
high income
Hence, admission to Ivy schools clearly
not random or pure merit-based, and
thus the income earned by these
people.
Model Misspecification and Spuriousness
• Intervening Variable: These may be
spuriously related to another relationship.
• How can states fight each other if they are not
contiguous with each other? Only the
strongest, with large navies, bases, etc., could
do so.
• Hence, geographic contiguity or distance
is an intervening variable. States may or
may not be more peaceful, but it is hard
to avoid conflict when it is on your
borders.
Model Misspecification and Spuriousness
• Alternative Variables: We also want
to control for variables that would
bias our results if omitted.
• In this case, the X variables in a
model would produce biased
estimates, undermining their validity
and producing error that leads to
inaccurate inferences.
Here is a spurious relationship from my
research
IGOs +
conflicts
+
+
Powerful states
Powerful states both in more IGOs and
conflicts, but these two variables not
directly related but a function of
state power.
Classic Spurious Case
???
Ice Cream Consumption
Crime
+
+
+
Summer Temperatures
Hence we see that despite the fact that ice
cream consumption is correlated with crime,
the real cause is that summer temperatures
increase both ice cream consumption and
crime.
Veronica Says, Beat Marshall!!!
Go Miners!!!
UTEP Fight!
UTEP Win!
I’m going to
Homecoming,
Are you?
Non-Experimental Designs
These studies use data collected or
aggregated from surveys, history, or
government indicators:
• Cross sectional studies
• Panel (cross sectional over a few time
points)
• Longitudinal (time series and pooled
cross-sectional time series)
• Case studies and focus groups
CROSS SECTIONAL Designs
• Statistical or case studies that compare
individuals or subjects across several
variables:
• Surveys comparing peoples’ political views
• Comparison of countries, groups,
organizations along different dimensions,
such as countries with different levels of
development (low, medium, high) relative to
other factors.
Non-Experimental Designs
These studies use data collected or
aggregated from surveys, history, or
government indicators:
• Cross sectional studies
• Panel (somewhat rare)
• Longitudinal (time series and pooled
cross-sectional time series)
• Case studies and focus groups
CROSS SECTIONAL Designs
• Statistical or case studies that compare
individuals or subjects across several
variables:
• Surveys comparing peoples’ political views
• Comparison of countries, groups,
organizations along different dimensions,
such as countries with different levels of
development (low, medium, high) relative to
other factors.
Cross-Sectional Data
ID
State
Abortions/1,000
women
%Bush Conservative score for
04
House delegation
1
Alabama
15
62.5
73
3
Arizona
19.1
54.8
67
4
Arkansas
11.1
54.3
48
5
California
33.4
44.4
41
6
Colorado
18
51.7
67.8
7
Connecticut
23
44
37.6
8
Delaware
34.4
45.8
40
10
Georgia
21.2
58
63.7
12
Idaho
5.8
68.4
90
13
Illinois
25.6
44.5
48.9
14
Indiana
10.6
59.9
69
15
Iowa
9.8
49.9
64.6
16
Kansas
18.3
62
75
Example of a Panel Study
State
Democracy
Illiteracy
HDI
Islamic
Argentina91
7
4.3
0.81
0
Argentina95
7
3.7
0.832
0
Argentina00
8
3.3
0.854
0
Armenia91
7
2.57
0.751
0
Armenia95
3
2
0.708
0
Armenia00
5
1.69
0.754
0
Australia91
10
0
0.892
0
Australia95
10
0
0.932
0
Australia00
10
0
0.942
0
Azerbaijan91
-3
3
.
1
Azerbaijan95
-6
3
.
1
Azerbaijan00
-7
3
0.746
1
Bangladesh91
6
65
0.417
1
Bangladesh95
6
61.9
0.445
1
Bangladesh00
6
59.2
0.497
1
Time Series
• Observations are made over time, which
can provide descriptive information or
used to test hypotheses.
• If testing hypotheses, we track data for a
dependent variable and at least one
independent variable over time (based on
some measure e.g. days, weeks, months,
or years)
Example of a Time Series: Presidential
Approval
Download