Sample - Research Methodology and Methods of Social Inquiry

advertisement
GSSR
Research Methodology and Methods of Social Inquiry
www.socialinquiry.wordpress.com
November 15, 2011
Representation of Units of Analysis. Sampling
Population & Sample
Population: everybody whom we want to generalize to.
- no strict rules to follow; the researcher must rely on logic
and judgment.
The population is defined in keeping with the objectives of
the study.
Census study: data are gathered on every member of the
population.
Parameters:
If we measure the entire population & calculate a value (e.g.
mean, st. deviation), we do not refer to this as a statistic; 
parameter of the population
Problems: time-consuming; expensive.
Sampling
• .
Sampling
The sample should reflect the characteristics of the
population from which it is drawn.
Sampling Frame:
the listing of the (reachable) population from which we draw
the sample.
Sample:
the group of people who we select to be in our study.
Statistic: when we look across responses for entire sample,
we use a statistic
- nonrespondents; dropouts.
Sampling methods:
method by which units of observation are selected
random (probability) or non-random sampling
Random Sampling
Basic requirements of a random sample:
Every case has:
1) the same chance of being selected
2) the chance of selection does not change (i.e. constant
probability).
The case that is selected (theoretically) is put back in the pot =
“sampling with replacement”.
- allows calculating the probability that we have
represented the population well;
- automatically eliminates selection bias in large N studies;
- allows to estimate sampling error
Basic Terms
N = number of cases in the sampling frame
n = number of cases in the sample
NCn = number of combinations (subsets) of n from N
f = n/N = the sampling fraction
Simple random sampling
Stratified random sampling (proportional/quota random
sampling)
Systematic random sampling
Cluster (Area) random sampling
Random route sampling
Multi-stage random sampling
See www.socialresearchmethods.net/kb/sampprob.php
Nonrandom Samples
- selected cases do not have an equal chance of
selection (some people have a greater, but
unknown, chance than others to be selected)
1. Convenience sampling
2. Purposive sampling
- likely to overweight subgroups in the population that are
more readily accessible.
Modal Instance Sampling
- sampling the most frequent case (i.e. "typical" case)
Expert Sampling
- sample of persons with known/demonstrable experience
& expertise in some area: "panel of experts;"
- often, used in combination with modal sampling, as
means of validation of the former.
Quota Sampling: non-random selection of people according to
some fixed quota.
a. Proportional quota sampling: want to represent the major
characteristics of the population by sampling a proportional amount
of each.
Ex: Population has 40% women, 60% men. We want N = 100.
Problem: decide the specific characteristics on which to base the
quota.
b. Non-proportional quota sampling: specify the minimum no. of
sampled units we want in each category.
Goal: have enough cases to assure that you will be able to talk
about even small groups in the population.
- non-probabilistic analogue of stratified random sampling (i.e.
used to assure that smaller groups are adequately
represented in the sample).
Heterogeneity (diversity) Sampling
- want to include all opinions or views, without concern
about representing these views proportionately.
Ex: brainstorming (including concept mapping)
Snowball Sampling
- begin by identifying someone who meets the criteria for
inclusion in your study; then ask them to recommend
others who they may know who also meet the criteria;
Respondent Driven Sampling (RDS)
- combines "snowball sampling" with a mathematical
model that weights the sample to compensate for the
fact that the sample was collected in a non-random way.
Heckathorn, Douglas D. 1997." Respondent-Driven Sampling: A New
Approach to the Study of Hidden Populations." Social Problems.
http://www.respondentdrivensampling.org/
Selection Bias
-
distortion of analysis, resulting from data collection method;
-
can occur at various(multiple) points in the research design
Selecting on the DV
- allow for the possibility of at least some variation on DV!
If we do not take into account other instances when DV takes other
values, we can learn nothing about the causes of the DV.
King et al. discussion of Porter’s (1990) cross-national work on competitive
advantage for contemporary industries and firms (see p. 133-134 in King et
al. 1994)
Historical records as data source:
History differentially selects what it ‘keeps’ according to a set of rules
that are not always clear from the record.
• Target population --|> Frame population: Coverage error
• Frame population --|> Selected sample: Sampling error
• Selected sample --|> Collected sample: Non-response error
Coverage error & non-response error as the most serious
errors in both qualitative and quantitative research
Indeterminate Research Designs
Research design: plan that shows, through the discussion
of the causal model (theoretical) & the data, how we
expect to make inferences.
A research design should not be indeterminate!
1. More Inferences than Observations (units of
observations/cases)
Rule: One fact (observation) cannot give independent
information about more than another fact.
Ex: a study with 1 observation (units), and 2 causal (IV)
variables, cannot determine which, if any of the
hypotheses is correct.
Observations
(cases)
1
Variables
X1
X2
Y (DV)
3
5
35
Y = expected value of the DV
Regression equation: Y = X1*B1 + X2*B2 + e
Prediction equation E(Y) = X1*B1 + X2*B2
Where E(Y) = expected value of Y; here E(Y) = 35.
In practice, we never know this, because of the
randomness inherent in Y (i.e. because of e)
Observations
(cases)
1
Variables
X1
X2
Y (DV)
3
5
35
E(Y) = X1*B1 + X2*B2
This equation has no unique solution.
B1 = 10, B2 =1;
B1 = -10, B2 = 13
35 = 3* B1 + 5*B2
DV: successful joint collaboration on capital-defense
projects (high-tech weapon system) King et al. Pp. 119-122
Observations
(cases)
1: Countries A
&B
2: Countries
C&D
3: Countries
D&E
Variables
Geographical
proximity
Frequency
Of
Negotiations
Ec.
Resources
Type of
pol.
regime
Existence
of
other coop
...
…
Final product
(cooperation or
not)
Download