Lecture 21 - BYU Department of Economics

advertisement
Econ 388 R. Butler 2014 revisions lecture 21
I. Endogeneous Variables
The assumption E (  | X )  0 is very important in the regression setup, and assumes,
among other things, that none of our right hand sign regressors are correlated with the
error in the regression model. If this is true, the variables are said to be exogenous in the
equation. Then regressors that are correlated with the error are said to be endogenous.
There are three common reasons to suspect endogeneity in the model, and instrumental
variable estimation can address them all. Hence, chapter 15 is very important in the
course.
Three reasons when E (  | X )  0 :
1.Omitted Variables. Section 15.1 treats this very well and provides a nice introduction
to instrumental variable estimation: What to do when one of the regressors are correlated
with the error? The textbook case given there has the true log(wage) regression depend
on education and ability, where ability is unobserved. If we ignore ability in the
specification, it becomes part of the error term, and if it is correlated with education, then
this induces an omitted variable bias and our estimate of the impact of education on
wages will be inconsistent. We can still consistently estimate the effect of education on
wages, however, when we can find an instrumental variable that is correlated with
education and uncorrelated with ability (see the discussion in 15.1 for possible candidates
for an instrumental variable in this case).
2. measurement error. Suppose we want to measure the effect of back injury severity on
the cost of back pain treatments, but our self reported measures of back pain (RolandMorris scale, the back-pain index, and the leg-pain index) are known to be measured with
error. Then if use the Roland Morris index, our estimate of its impact will be bias
towards zero (this is called attenuation bias). Let RM* be the true level of pain, and RM
be the measured level of pain (given by the Roland-Morris scale):
RM=RM* + 
 is independent of RM* (that is, the measurement error,  , is not systematic), but RM
obviously depends on  . However, RM* should be in the equation, but RM is there
instead. Hence the true equation is
Costs  0  1 RM *  X 2   , but we estimate
𝐶𝑜𝑠𝑡𝑠 = 𝛽0 + 𝛽1 𝑅𝑀 + 𝑋𝛽2 + (𝜇 − 𝛽𝜀)
Hence, the error in this last estimable equation is, by construction, correlated with RM
(through  ), even though, by assumption,  is uncorrelated with X , RM*, and  . What
can be done in this case is to use the other (imperfectly measured) indices of back pain,
the back-pain index (“On a score of 0 to 100, how do you rank your back pain?”) and the
leg-pain index, as instruments for RM. This indeed, will fix the attenuation bias.
1
3. Simultaneity. When one of the right hand side variables are jointly determined along
with the outcome variable, y. This is more difficult, more tricky to get right, because we
must be able to imagine a time when we could vary this right hand side variable
exogeneously, even if in the data we have the y and r.h.s. endogenous variable were
determined together. More on this below.
II. INSTRUMENTAL VARIABLE ESTIMATION
Review from lecture 10:
If you take the standard model and pre-multiply both sides by X’ you go from:
Y  X  
to X ' Y  X ' X  X ' 
In the standard model, X '  is unknown (  is unknown), but it was zero expectation and
its plim is zero as well. If this is the case then we X ' Y  X ' X which we can solve for
 . Note that this procedure will also work—more generally--with an appropriately
dimensioned Z matrix, which we can also pre-multiply the standard model by to get
Z ' Y  Z ' X  Z ' 
Z will work in this role (given it has the right size so that Z ' X can be inverted) as long as
two conditions are met
in “small” samples: 1) Z is correlated with X, and 2) Z is uncorrelated with 
1
1
in “large” samples: 1) plim [ Z ' X ] 1   ZX1 , and 2) plim [ Z '  ]  0 .
n
n
What is the best Z? The one that is most correlated with the Xs but uncorrelated with (or
un-plim-ed with) the error term. If the model assumptions of the classical regression
model hold, then this best Z is X. Although g(.)-transforms of X would work too (as
elements of the Z matrix, see Wooldridge chapter 5), they wouldn’t be quite as efficient
as X itself (among unbiased or consistent estimators, these alternative estimators
wouldn’t have as small variance as the OLS estimators have).
But sometimes (like when we have simultaneous equation problems), it may not be the
1
case that X '  has zero expectation, nor will plim [ X '  ] =0 . But there may be
n
instrumental variable matrix Z consisting of n-rows that does have zero expectation, or at
1
least is such that plim [ Z '  ]  0 . Then these instrumental variables could be used to
n
get consistent estimates of  . From the results above, the instrumental variables
estimator would be
ˆ IV  (Z ' X ) 1 Z 'Y
And the “instrumental” variable estimator of our model would be consistent since
2
1
1
plim ˆ IV  plim [ Z ' X ] 1 plim[ Z ' ( X '    )]
n
n
1
1
=   plim [ Z ' X ] 1 plim [ Z '  ]
n
n
= 
If the Zs are as well behaved as the Xs, then we can apply central limit theorem results to
1
1
n ( ˆ IV   )  [ Z ' X ]1[
Z' ]
n
n
to get the asymptotic distribution of the estimator: the instrumental variable estimator will
be asymptotically normally distributed with mean  and an asymptotic covariance that
can be estimated as
s2 ( Z ' X ) 1 ( Z ' Z )( X ' Z ) 1
(Note that is the generalization of the result in chapter 15 of Wooldridge; see also below
under the 2SLS).
III. Simultaneity: Structural and Reduced form presentations of a model
Suppose we want to estimate, from the CPS March supplement, a labor supply education
in which the hours of work (Hi) depends on the wage rate (Wi)and non-wage income (Oi).
However, we realize that the observed hours of work for each individual is not only the
result of what they want to do (demand) but also of the opportunities that are offered to
them (supply). Is it important
Alternative Representations
A. Structural Representation (Based on Theory)
Demand: Wi = 
Supply:

1
< 0, 
1
Hi + 
0
+  2Ei +  i
Hi =  1Wi +  0 +  2Oi +  i
2
> 0,  1> 0,  2< 0
where W, H, E and O denote wage
rate. hours worked, educational
attainment, and other (non-wage)
income.  i ,  i are the errors in the
respective demand and supply equations.
We would like to estimate the supply equation, but realize that one of the assumptions is
violated. Namely, Wi and  i are correlated: as  i increases in the supply equation, then
Hi increases in the supply equation as well. But from the demand equation, we see that
higher Hi yields a lower Wi. So changes in Wi are correlated with changes in  i . This
makes OLS estimates inefficient, biased, and inconsistent.
3
B.
Reduced Form Representation
The reduced form representation is logically equivalent to the structural
representation and involves expressing each dependent variable as a function of the
independent variables. In the previous example the reduced form representation for the
labor supply equation is obtained by solving for W by substituting the labor demand
expression in the labor supply expression and simplifying:
H i  1 (1 H i   0   2 Ei   i )   0   2 Oi   i , or
H i (1  11 )  (1 0   0 )  1 2 Ei   2Oi  ( i  1 i ) , or
H i   0   1 Ei   2 Oi   * i
(1  0   0 )
(   1 i )
1  2
2
, 1 
, 2 
, and  i*  i
.
(1  1 1 )
(1   1 1 )
(1  1 1 )
(1  1 1 )
A similar derivation for the wage rate yields Wi   0   1 Ei   2 Oi   * i
(    0 )
(  1  i )
 2 1
2
Where  0  0 1
, 2 
, 1 
, and  i*  i
,
(1   1 1 )
(1  1 1 )
(1  1 1 )
(1   1 1 )
where the endogeneous variables (H, W are jointly determined within this supply and
demand framework) are written as a function of all of the exogeneous variables (E, O are
variables determined outside of this supply and demand framework).
where  0 
The reduced form representation is generally used for purposes of obtaining
predictions and the  i s can be interpreted as "multipliers." This can be illustrated by our
example.
Hi =  1Wi +  0 +  2Oi
W1
W0
Wi = 
1
Hi + 
0
+  2Ei
H   2 E
H0
H1
 H   2 E
where “ W0 , H 0 ” is the equilibrium when eduation equals E, and “ W1 , H 1 ” is the
equilibrium when education equals E + E , that is, it is the new equilibrium after
education has increased by E . The reduced form representation expresses the
equilibrium values as a function of E and O. Note that the structural coefficients of the
independent variables indicate the magnitude of shifts of the graphs of the structural
relations; whereas, the reduced form coefficients are useful in characterizing values
associated with the actual observed changes in the equilibrium values (those with the
little bar over them).
4
Based upon the restrictions on the signs of  s and  s, can we unambiguously
deduce the signs of the  s?
Identification. : if you can retrieve the structural estimates ( s ,  s ) from the reduced
form parameters (  s ), then model is identified. Reduced form parameters can always be
estimated; structural parameters can not be unambiguously estimated unless there are
identified. (The identification problem is whether there are more than one (  s,  s )
associated with a particular set of  -values for the reduced form equations (for both H
and W).) If there is a unique set of  s,  s associated with the reduced form
coefficients, then the equations are identified. This is a topic discussed in chapter 16 of
Wooldridge, which we will not discuss further in this course.
STATA commands to estimate the above model is as follows:
ivreg h o (w=e); *for the hours model where (rhs endo. vars=omitted exogen. vars);
iverg w e (h=o); *for the wage model;
where the rhs endogeneous (w) is instrumented with e (plus o), as indicated in the
parenthesis (instrumental variables are e,o).
5
Download