INSTRUMENTAL VARIABLES (Take 1): CONSTANT EFFECTS Josh Angrist MIT 14.387 (Fall 2014) 1 Organizing IV I tell the IV story in two iterations, first with constant effects, then in a framework with heterogeneous potential outcomes. • The constant effects framework focuses attention on the IV solution for selection bias and on essential IV mechanics • But first: Why do IV? • I can’t say "because the regressors are correlated with the errors." • As we’ve seen, regressors are uncorrelated with errors by definition • The (short) regression of schooling on wages produces residuals uncorrelated with schooling (that’s how the good lord made ’em) • The problem, therefore, must be that the regression you’ve got is not the regression you want (and that’s your fault!) 2 IV Goes Long • Suppose the causal link between schooling and wages can be written fi (s ) = α + ρs + η i • Imagine a vector of control variables, Ai , called “ability”; write η i = Ai γ + vi where γ is a vector of pop. reg. coeffi cients, so vi and Ai are uncorrelated by construction • We’d happily include ability in the regression of wages on schooling, producing this long regression: Yi = α + ρsi + Ai γ + vi (1) The error term here is the random part of potential outcomes, vi , left over after controlling for Ai • If E [si vi ] = 0, a version of the CIA, the population regression of Yi on si and Ai identifies ρ. That’s like saying: "Ai is the only reason schooling is correlated with potential outcomes." 3 IV and OVB • IV allows us to estimate the long-regression coeffi cient, ρ, when Ai is unobserved. The instrument, Zi , is assumed to be: (1) correlated with the causal variable of interest, si ; and (2) uncorrelated with potential outcomes • Here, "uncorrelated with potentials" means Cov (η i ,Zi ) = 0, or, equivalently, Zi is uncorrelated with both Ai and vi • This is a version of the exclusion restriction: Zi can be said to be excluded from the causal model of interest • Given the exclusion restriction, it follows from equation (1) that ρ = = Cov (Yi , Zi )/V (Zi ) Cov (Yi , Zi ) = Cov (si , Zi ) Cov (si , Zi )/V (Zi ) ”RF ” ”1st” • The IV estimator is the sample analog of (2) (2) 4 Two-stage least squares (2SLS) • In practice, we do IV by doing 2SLS. This allows us to add covariates (controls) and combine multiple instruments. Returning to the schooling example, a causal model with covariates is Yi = α0 Xi + ρsi + η i , (3) where η i is the compound error term, Ai γ + vi . The first stage and reduced form are si Yi = Xi0 π 10 + π 11 Zi + ξ 1i = Xi0 π 20 + π 21 Zi + ξ 2i (4) (5) • The reduced form is obtained by substituting (4) into (3): Yi = α0 Xi + ρ[Xi0 π 10 + π 11 Zi ] + ρξ 1i + η i = Xi0 [α + ρπ 10 ] + ρπ 11 Zi + [ρξ 1i + η i ] = Xi0 π 20 + π 21 Zi + ξ 2i (6) 5 2SLS Notes • Again, it’s all about the ratio of RF to 1st: π 21 =ρ π 11 In simultaneous squations models, the sample analog of this ratio is called an Indirect Least Squares (ILS) estimator of ρ • Where does two-stage least squares come from? Write the first stage as the sum of fitted values plus first-stage residuals: si = Xi0 π 10 + π 11 Zi + ξ 1i = ŝi + ξ 1i 2SLS estimates of (3) can be constructed by substituting first-stage fitted values for si in (3): Yi = α 0 Xi + ρŝi + [η i + ρξ 1i ], (7) and using OLS to estimate this "second stage" (a version of eq. 6) • In practice, we let Stata do it: "manual 2SLS" doesn’t get the standard errors right 6 2SLS example: Angrist and Krueger (1991) • AK-91 argue that because children born in late-quarters start school younger, they are kept in school longer by birthday-based compulsory schooling laws • There’s a powerful first stage supporting this: Schooling tends to be higher for late-quarter births; this is driven by high school and not college, consistent with the CSL story • The QOB first stage and reduced form are plotted in Figure 4.1.1 • The corresponding 2SLS estimates appear in Table 4.1.1 • 2SLS matches the QOB pattern earnings (the RF) to the QOB pattern in schooling (the first stage). • The exogenous covariates include year-of-birth and state-of-birth dummies, as well as linear and quadratic functions of age in quarters • QOB Questioned: Bound, Jaeger, and Baker (1995) and Buckles and Hungerman (2008) argued QOB is correlated with maternal characteristics. Allowing for this fails to overturn AK conclusions 7 2SLS is a many-splendored thing • 2SLS is the same as IV where the instrument is ŝi∗ , the residual from a regression of ŝi on Xi • One-instrument 2SLS equals IV, where the instrument is z̃i , the residual from a regression of Zi on the covs, Xi • One-instrument 2SLS equals indirect least squares (ILS), that is, the ratio of reduced form to first stage coeffi cients on the instrument. In other words, Cov (Yi , ŝi∗ ) V (ŝi∗ ) = = Cov (Yi , ŝi∗ ) Cov (si , ŝi∗ ) Cov (Yi , z̃i ) π 21 = Cov (si , z̃i ) π 11 • With more than one instrument, 2SLS is a weighted average of the one-at-time (just-identified) estimates (In a linear homoskedastic constant-effects model, this is effi cient) 8 Multi-Instrument 2SLS (details; mistakes) • Let ρj = Cov (Yi , Zji ) ; j = 1, 2 Cov (Di , Zji ) denote two IV estimands using Z1i and Z2i to instrument Di . • The 2SLS estimand is ρ2SLS = ψρ1 + (1 − ψ)ρ2 , where ψ is a number between zero and one that depends on the relative strength of the instruments in the first stage. • Angrist and Evans (1998) use twins and sex-mix instruments • Using a twins-2 instrument alone, the IV estimate of the effect of a third child on female labor force participation is -.084 (s.e.=.017). The corresponding samesex estimate is -.138 (s.e.=.029). • Using both instruments produces a 2SLS estimate of -.098 (.015). • The 2SLS weight in this case is .74 for twins, .26 for samesex, due to the stronger twins first stage. 9 2SLS Mistakes 2SLS . . . so simple a fool can do it . . . and many do! What can go wrong? • As explained in MHE 4.6.1, three mistakes have yet to be relegated to the dustbin of IV history: • Manual 2SLS • Covariate ambivalence • Forbidden regressions (from the left and the right) • These can be interpreted as the result of failed attempts to get round hard-wired 2SLS protocols • Avoid temptation: let Stata do it! 10 2SLS Lingo • These terms come to us from simultaneous equations modeling, the intellectual birthplace of IV: • Endogenous variables are the dependent variable and the independent variable(s) to be instrumented; in a simultaneous equations model, endogenous variables are determined by solving the system • To treat an independent variable as endogenous is to instrument it, i.e., to replace it with fitted values in the 2SLS second stage • Exogenous variables include covariates (not instrumented) and the excluded instruments themselves. In a simultaneous equations model, exogenous variables are determined outside the system • In any IV study, variables are either: dependent or (other) endogenous variables, instruments, or covariates • If you’re unsure what’s what, or find yourself asking variables to play more than one role . . . seek counseling 11 The Wald estimator • How were Vietnam-era vets affected by their service? • Let Di indicate veterans. A causal constant-effects model is: Yi = α + ρDi + η i , (8) where η i and Di may be correlated. B/c Zi is a dummy, Cov (Yi , Zi ) = E [Yi |Zi = 1] − E [Yi |Zi = 0], V (Zi ) with an analogous formula for ρ= Cov (Di ,Zi ) . V ( Zi ) It follows that, Cov (Yi , Zi ) E [Yi |Zi = 1] − E [Yi |Zi = 0] = Cov (Di , Zi ) E [Di |Zi = 1] − E [Di |Zi = 0] (9) • A direct route uses (8) and E[ηi |Zi ] = 0: E[Yi |Zi ] = α + ρE[Di |Zi ] (10) Solving this for ρ produces (9) 12 Earnings Consequences of Vietnam-Era Military Service (Angrist, 1990) • Key variables Zi Di = randomly assigned draft-eligibility in the 1970-72 draft lotteries = a dummy indicating Vietnam-era veterans • The causal effect of Vietnam-era military service is the difference in average earnings by draft-eligibility status (RF) divided by the difference in the probability of service (first stage): Cov (Di , Zi ) V (Zi ) = E [Di |Zi = 1] − E [Di |Zi = 0] = P [Di = 1|Zi = 1] − P [Di = 1|Zi = 0] • See RF, first stage, and IV in Angrist (1990), Figures 1-2 and MHE Table 4.1.3 (based on Angrist 1990,Table 3). • Draft lottery updates: Angrist, Chen, and Song (2011) 13 Multiple groups and 2SLS • There’s more to the draft lottery than draft-eligibility: Angrist and Chen (2008), Figure 1 • Let Ri denote draft lottery numbers. The draft-eligibility Wald estimator uses 1[Ri < 195] as an instrument in a just-identified setup • The first stage that uses everything we know can be written: E [Yi |Ri ] = α + ρP [Di = 1|Ri ], (11) since P [Di = 1|Ri ] = E [Di |Ri ]. Suppose Ri ∈ j = 1, ...,J. We can estimate ρ using J grouped obs by fitting ȳj = α + ρp̂j + η̄ j (12) • Effi cient GLS for grouped data in a constant-effects linear model is weighted least squares, in this case weighted by the variance of η̄ j (Prais and Aitchison, 1954). If η i has variance σ2η , the grouped variance is σ2η nj , where nj is the group size. 14 Visual IV, Grouping, and GLS • Equation (12) in action: Angrist (1990), Figure 3. This illustrates visual instrumental variables (VIV) • GLS (weighted least squares) applied to equation (12) is 2SLS • The instruments in this case are dummies for each lottery-number cell. Define Zi ≡ {rji = 1[Ri = j ]; j = 1, ...J − 1}. The first stage for Di on Zi plus a constant is saturated, so fitted values are cond. means, p̂j , repeated nj times for each j. The second stage slope estimate is therefore weighted least squares on the grouped equation, (12), weighted by the cell size, nj • Because GLS is effi cient, 2SLS is also the effi cient linear combination of the underlying just-identified IV (Wald) estimates (earlier, we saw that 2SLS is a weighted average of just-identified estimates in a two-instrument example) • That’s why we call Figure 3 "VIV" 15 Specification Testing [TSIV] Suppose the residuals, ηi , are conditionally homoskedastic. GLS on the grouped equation, (12), chooses parameter estimates a and b to minimize ĴN (a, b ) = (1/σ2η ) × ∑ nj (ȳj − a − bp̂j )2 (13) j (If the residuals are heteroskedastic, replace constant σ2η with σ2j , the variance of η i in group j) • The minimized GLS minimand is the (Sargan) over-id test statistic for 2SLS estimates constructed using group dummies as instruments (MHE 4.2.2). This test statistic is distributed χ2 (J − 2) if J groups are used to estimate a slope and intercept • Over-id for dummy instruments measures the fit of the line connecting ȳj and p̂j in a VIV plot like Fig. 3 in Angrist (1990) • The over-identification test statistic is also a (Wald) test statistic for equality of a full set of linearly independent Wald estimates (implied by Newey and West (1987); see Angrist (1991)) 16 Two-Sample IV • Let {Yj , Wj , Zj ; j = 1, 2} be data from two samples, where Wj and Zj Z 0W include exog covs. Angrist (1990) constructs (first-stage) 2N 2 2 from military records, while Social Security records were used to construct Z 0Y (reduced form) N1 1 1 • AK-95 and Inoue and Solon (2010) simplify: First-stage fits in ds2 are (Z20 Z2 )−1 Z20 W2 . Carry over to ds1 by constructing the cross-sample fitted value, Ŵ12 ≡ Z1 (Z20 Z2 )−1 Z20 W2 . The second stage for this version of TSIV (which AK-95 call SSIV) regresses Y1 on Ŵ12 . The cross-sample fitted value is ŵ12,i = Z1i π̂ 2 , where π̂ 2 is the first-stage effect estimated using ds2 and the Z1i (i = 1, ..., N1 ) are the instruments in ds1 • Manual 2SLS, yikes! Inoue and Solon (2010) get the standard errors right, among other TSIV improvements 17 The Bias of 2SLS • Cross-section OLS estimates are typically unbiased for the pop BLP, as well as consistent, but this might not be the regression you want • 2SLS estimates are consistent for causal effects but biased towards OLS estimates • Endogenous var. is vector x; dep. var. is vector y ; no covs: y = βx + η (14) The N ×Q matrix of instruments is Z , with first-stage x = Z π + ξ (15) Outcome error η i is correlated with ξ i . Instruments are uncorrelated with ξ i by construction and with η i by assumption • The 2SLS estimator is � β2SLS = x 0 PZ x −1 x 0 PZ y = β + x 0 PZ x −1 x 0 PZ η where PZ = Z (Z 0 Z )−1 Z 0 is the projection matrix that produces fitted values 18 The Bias of 2SLS (cont.) • Substituting for x in x 0 PZ η, we get b � β2SLS − β = = x 0 PZ x −1 π 0 Z 0 + ξ 0 PZ η x 0 PZ x −1 π 0 Z 0 η + x 0 PZ x (16) −1 ξ 0 PZ η (17) • Expectation of the ratios on the right hand side of (16) are closely approximated by the ratio of expectations: � E [b β2SLS − β] ≈ E [x 0 PZ x ] −1 E [ π 0 Z 0 η ] + E [ x 0 PZ x ] −1 E [ ξ 0 PZ η ] . This Bekker (1994) approximation ("group asymptotics" in AK-95) gives a good account of finite-sample behavior • Using the fact that E [π 0 Z 0 ξ ] = 0 and E [π 0 Z 0 η ] = 0, we have −1 b E [� β2SLS − β] ≈ E π 0 Z 0 Z π + E (ξ 0 PZ ξ ) E ξ 0 PZ η (18) • 2SLS is biased b/c E ξ 0 PZ η 6= 0 unless η i and ξ i are uncorrelated 19 The Bias of 2SLS: First-stage F • Manipulation of (18) generates: σηξ E [� βb2SLS − β] ≈ 2 σξ E (π 0Z 0Z π ) /Q +1 σ2ξ −1 (1/σ2ξ )E (π 0 Z 0 Z π ) /Q is the "population F" for joint significance of instruments in first-stage, so we can write σηξ 1 b E [� (19) β 2SLS − β ] ≈ σ2ξ F + 1 • As F gets small, the bias of 2SLS approaches OLS estimator is σηξ , σx2 which also equals σηξ σ2ξ σηξ . σ2ξ The bias of the if π = 0. 2SLS estimates are therefore said to be "biased towards OLS estimates" when there isn’t much of a first stage. On the other hand, the bias of 2SLS vanishes when F gets large, as it should happen in large samples 6 0. when π = 20 The Bias of 2SLS: First-stage F (cont.) • First-stage F varies inversely with the number of instruments if they’re weak. • Adding instruments with no effect on the first-stage R-squared, the model sum of squares, E (π 0 Z 0 Z π ), and the residual variance, σ2ξ , are fixed while Q increases • The F-statistic shrinks as a result. From this we learn that the addition of weak instruments increases bias • Holding the first-stage sum of squares fixed, bias is least in the just-ID case when the number of instruments is as low as it can get • 2SLS bias is a consequence of first-stage estimation error. We’d like to use b xpop = Z π as IVs since these fits are uncorrelated with the � second stage error • In practice, we use b � x = PZ x = Z π + PZ ξ, which differs from � b xpop by the term PZ ξ • 2SLS bias arises from the corr between PZ ξ and η 21 IV without bias or tears • Just-identified 2SLS (say, the Wald estimator) is approximately unbiased (this isn’t clear from the Bekker sequence). The just-ID sampling distribution has no moments, yet it’s approximately centered where it should be unless the instruments are really weak • The Reduced Form is unbiased: if you can’t see the relationship you’re after in the reduced form, it ain’t there! In just-identified models, the p-value for the reduced-form effect of the instrument is approximately the p-value from the second stage. (Chernozhukov and Hansen, 2008, use this to do bias-free inference) • LIML is approximately median-unbiased for over-identified constant-effects models, and therefore provides an attractive alternative to just-identified estimation using one instrument at a time (see, e.g., Davidson and MacKinnon, 1993, and Mariano, 2001). (LIML=2SLS in just-identified models) 22 Alternative Estimators • Riff on the SSIV idea: JIVE (Angrist, Imbens, and Krueger, 1999) removes bias using leave-out first stage fits for each observation (the fitted value for observation i is Zi π̂ (i ) where π̂ (i ) is an estimate that throws out observation i). JIVE sounds appealing, but AIK and others have found that it rarely beats LIML) • The right linear combination of OLS and 2SLS should be approximately unbiased. It turns out that LIML is just such a "combination estimator" (see the working paper version of AIK-99). You might also try Fuller’s (1977) modified LIML, discussed by Hahn, Hausman, and Kuersteiner (2004). Fuller may be more precise than LIML in finite samples since it has moments • Hausman, et al. (2007) modify LIML and Fuller to deal with heteroscedasticity • Kolesar et al. (2011) modify LIML to allow for random effects 23 Monte Carlo yi = βxi + η i xi = Q ∑ πj zij + ξ i j =1 with β = 1, π 1 = 0.1, π j = 0 ∀j > 1, joint normal errors with corr (η i , ξ i ) = .8, where the instruments, zij , are independent, standard normals. The sample size is 1000. • Figure 4.6.1: OLS, just identified IV (Q=1, labeled IV; F=11.1), 2SLS (Q=2, labeled 2SLS; F=6.0), LIML (Q=2) • Figure 4.6.2: OLS, 2SLS, and LIML with Q=20 (1 good instrument, 19 worthless; F=1.51) • Figure 4.6.3: OLS, 2SLS, and LIML with Q=20 but π j = 0; j = 1, ..., 20 (all 20 worthless; F=1.0) • Quarter of birth estimates of the returns to schooling (reprise): Table 4.6.2 24 Tables and Figures 25 A. Average Education by Quarter of Birth (first stage) 13.2 4 13.1 2 3 Years of Education 13 4 4 4 4 3 12.8 4 3 12.7 4 2 4 3 4 4 2 1 3 3 12.9 3 1 2 1 3 2 1 2 1 12.6 1 2 A. Average Education by Quarter of Birth (first stage) 3 12.5 2 1 2 3 1 1 2 12.4 12.3 1 12.2 30 31 32 33 34 35 36 37 38 39 Year of Birth B. Average Weekly Wage by Quarter of Birth (reduced form) 5.94 5.93 B. Average Weekly Wage by Quarter of Birth (reduced form) Log Weekly Earnings 5.92 4 3 4 3 1 2 1 2 3 3 4 2 2 5.9 3 3 3 5.91 3 2 3 4 4 2 4 2 4 3 2 2 1 5.89 4 4 4 1 1 1 1 2 1 1 5.88 1 5.87 5.86 30 31 32 33 34 35 36 37 38 39 Year of Birth © Princeton University Press. All rights reserved. This content is excluded from our Creative Commons license. For more information, see http://ocw.mit.edu/help/faq-fair-use/. 26 Table 4.1.1 2SLS estimates of the economic returns to schooling OLS Years of education Exogenous Covariates Age (in quarters) Age (in quarters) squared 9 year-of-birth dummies 50 state-of-birth dummies Instruments dummy for QOB = 1 dummy for QOB = 2 dummy for QOB = 3 QOB dummies interacted with year-of-birth dummies (30 instruments total) 2SLS (1) (2) (3) (4) (5) (6) (7) (8) .071 (.0004) .067 (.0004) .102 (.024) .13 (.020) .104 (.026) .108 (.020) .087 (.016) .057 (.029) Notes: The table reports OLS and 2SLS estimates of the returns to schooling using the Angrist and Krueger (1991) 1980 census sample. This sample includes native-born men, born 1930–39, with positive earnings and nonallocated values for key variables. The sample size is 329,509. Robust standard errors are reported in parentheses. QOB denotes quarter of birth. © Princeton University Press. All rights reserved. This content is excluded from our Creative Commons license. For more information, see http://ocw.mit.edu/help/faq-fair-use/. 27 TABLE 6. IV REGRESSIONS ON RETURNS TO EDUCATION: RESULTS FROM THE CENSUS Years of Education Family Controls? Wages, Logged: QOB Instruments 0.103 0.147 [0.083] [0.081] No Yes Wages, Logged: Year*QOB Instruments 0.075 0.09 [0.040] [0.040] No Yes Wages, in Levels: QOB Instruments 33.16 49.16 [24.21] [23.5] No Yes Wages, in Levels: Year*QOB Instruments 24.13 30.94 [11.55] [10.59] No Yes Instruments QOB QOB YOB*QOB YOB*QOB QOB QOB YOB*QOB YOB*QOB Age Controls? Yes Yes Yes Yes Yes Yes Yes Yes State Dummies? Yes Yes Yes Yes Yes Yes Yes Yes Year Dummies? Yes Yes Yes Yes Yes Yes Yes Yes Weights? Yes Yes Yes Yes Yes Yes Yes Yes Notes: Robust standard errors in brackets. Observations are county-of-birth/quarter-of-birth/year-of-birth cells and all regressions weight by total individuals reporting positive earnings in a cell. The dependent variable in the first two pairs of regressions is the log of average wages in a cell, in the last two pairs of regressions it is the average of cell wages in levels. Regressions are from cohorts of males born between 1944 and 1960; see Table 5 for a description of family characteristic and wage and age variables. Courtesy of Kasey Buckles and Daniel M. Hungerman. Used with permission. 28 Courtesy of Joshua Angrist and the American Economic Association. Used with permission. 29 Table 4.1.3 IV Estimates of the Effects of Military Service on the Earnings of White Men born in 1950 Earnings Mean Eligibility Effect Mean Eligibility Effect Wald Estimate of Veteran Effect (1) (2) (3) (4) (5) 1981 16,461 -435.8 (210.5) 1971 3,338 -325.9 (46.6) 1969 2,299 -2.0 (34.5) Earnings year Veteran Status .267 .159 (.040) -2,741 (1,324) -2050 (293) Note: Adapted from Table 5 in Angrist and Krueger (1999) and author tabulations. Standard errors are shown in parentheses. Earnings data are from Social Security administrative records. Figures are in nominal dollars. Vete status data are from the Survey of Program Participation. There are about 13,500 individuals in the sample. © Princeton University Press. All rights reserved. This content is excluded from our Creative Commons license. For more information, see http://ocw.mit.edu/help/faq-fair-use/. 30 0.15 0.1 0.05 0 ‐0.05 ‐0.1 ‐0.15 ‐0.2 ‐0.25 estimate estimate + 1.96*se ‐0.3 estimate ‐ 1.96*se ‐0.35 1970 1973 1976 1979 1982 1985 1988 1991 1994 1997 2000 2003 2006 Figure 1. Draft‐lottery Estimates of Vietnam‐era Service Effects on ln(Earnings) for White Men Born 1950‐52 31 Courtesy of Joshua D. Angrist, Stacey H. Chen, and the American Economic Association. Used with permission. 32 Courtesy of Joshua Angrist and the American Economic Association. Used with permission. 33 0 .25 .5 .75 1 4.7. APPENDIX 0 .5 1 x OLS 2SLS 1.5 2 2.5 IV LIML Figure 4.6.1: Distribution of the OLS, IV, 2SLS, and LIML estimators. IV uses one instrument, while 2S and LIML use two instruments. © Princeton University Press. All rights reserved. This content is excluded from our Creative Commons license. For more information, see http://ocw.mit.edu/help/faq-fair-use/. 34 1 .75 .5 .25 0 0 .5 1 x OLS LIML 1.5 2 2.5 2SLS Figure 4.6.2: Distribution of the OLS, 2SLS, and LIML estimators with 20 instruments © Princeton University Press. All rights reserved. This content is excluded from our Creative Commons license. For more information, see http://ocw.mit.edu/help/faq-fair-use/. 35 1 .75 .5 .25 0 0 .5 1 x OLS LIML 1.5 2 2.5 2SLS Figure 4.6.3: Distribution of the OLS, 2SLS, and LIML estimators with 20 worthless instruments © Princeton University Press. All rights reserved. This content is excluded from our Creative Commons license. For more information, see http://ocw.mit.edu/help/faq-fair-use/. 36 Table 4.6.2 Alternative IV estimates of the economic returns to schooling (1) 2SLS LIML F-statistic (excluded instruments) Controls Year of birth State of birth Age, age squared Excluded instruments Quarter-of-birth dummies Quarter of birth*year of birth Quarter of birth*state of birth Number of excluded instruments (2) .105 .435 (.020) (.450) .106 .539 (.020) (.627) 32.27 .42 (3) (4) (5) (6) .089 (.016) .093 (.018) 4.91 .076 (.029) .081 (.041) 1.61 .093 (.009) .106 (.012) 2.58 .091 (.011) .110 (.015) 1.97 180 178 3 2 30 28 Notes: The table compares 2SLS and LIML estimates using alternative sets of instruments and controls. The age and age squared variables measure age in quarters. The OLS estimate corresponding to the models reported in columns 1–4 is .071; the OLS estimate corresponding to the models reported in columns 5 and 6 is .067. Data are from the Angrist and Krueger (1991) 1980 census sample. The sample size is 329,509. Standard errors are reported in parentheses. © Princeton University Press. All rights reserved. This content is excluded from our Creative Commons license. For more information, see http://ocw.mit.edu/help/faq-fair-use/. 37 MIT OpenCourseWare http://ocw.mit.edu 14.387 Applied Econometrics: Mostly Harmless Big Data Fall 2014 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.