Causality in Social Work Research via Instrumental Variable Estimation Roderick A. Rose Senior Research Associate Carolina Institute for Public Policy The University of North Carolina at Chapel Hill Summary • Rationale: endogeneity & causality • Instrumental variable estimation • The exclusion restriction: justification and identification; logic; example • Recommendations Rationale: Endogeneity & Causality Endogeneity is pervasive throughout nonexperimental research. What is Endogeneity? This is a Directed Acyclic Graph, or DAG. Causality flows in the direction of the arrow. This DAG represents what we might want to show (3 examples), representing a causal relationship between X and Y. X Y Rental housing assistance Employment Career-relevant instruction Math achievement Health insurance Health at time t What is Endogeneity? Endogeneity occurs as an undesired relationship between variables in this causal relationship and other variables not accounted for in the model. Education Level C X Rental housing assistance Y Employment Eliminating endogeneity requires removing one of the arrows. What is Endogeneity? The arrows corresponding to the confounder may go in any direction; endogeneity has no regard for the direction of the arrow. Teaching quality C X Career-relevant instruction Y Math achievement Note that any relationships not modeled occur through the error term. This is the definition of endogeneity: a correlation between an effect an the error. What is Endogeneity? When simultaneity or reverse causality are needed, they are represented in a DAG by lagged versions of variables. Health at time t-1 C b<0 X Public Health insurance Y b>0 Health at time t A Common Risk in Social Work Research? When simultaneity occurs in the context of cross-sectional data collection (i.e., one time point), we cannot be sure which effect we are observing. X Public Health insurance Y b>0 or b<0 Health Note that this is not technically a DAG (because of the two headed arrow). Description vs. Causality It is entirely appropriate to use regression methods to describe relationships among variables without inferring causality. Race/ethnicity C X Y Public Health insurance Health at time t This curved line represents an unspecified relationship between two variables; implicit in this is that the relationship occurs through or because of other variables. Instrumental Variable Estimation Notation X = EV = endogenous variable Y = DV = dependent variable Z = IV = instrumental variable or instrument The Instrumental Variable or IV In large samples, the instrument is a variable Z that is highly correlated with X and uncorrelated with the error. Health at time t-1 Not measured (x-sect) Z C X Y Public Health insurance Health at time t The IV must effectively randomize participants to conditions of X. Like random assignment, it cannot be confounded by any C. IV Estimator using Covariance (Diagram) Circles represent variance; Source: Kennedy, 2003. error in Y Y = DV Used to estimate βOLS Note the loss of efficiency. Used to estimate βIV X = Endog Var Note overlap with error Z = Instr Var No overlap with error No overlap with Y outside of X IV Estimator (Diagram) Circles represent variance; Source: Kennedy, 2003. Y = DV Predict X’s covariance with Y Predicted X X = Endog Var Z = Instr Var IV Estimator (Diagram) Circles represent variance; Source: Kennedy, 2003. Y = DV r X X = Endog Var Z = Instr Var Common Instruments • “Natural” phenomena including births, weather events, geographical boundaries. Katrina is a good example. • Randomized treatments. These are a particular class of IVE that are used when doing mediation or dosage type models (local average treatment effects). This is also (under certain assumptions) the Bloom Estimator (Bloom, 1984): Divide the ITT of the outcome by the ITT of the dosage. • Functions and aggregates of observed characteristics. A person’s ethnic background will not satisfy exclusion, but membership in a “cluster” defined by ethnicity, age, income, job and other characteristics might. See Cutler & Gruber, 1996. • Taking the cluster idea one step further, multilevel data yield “natural” clusters that can be used, under certain assumptions, to produce useful instruments. This is often referred to as a “Wald instrument” because it accords with Wald’s hypothesis. See Cobb-Clark & Crossley, 2003. The Wald Estimator and IVE • “If Z is a dummy variable indicating membership in one of two groups, then zy′/n captures the difference in the average outcomes of the two groups and zd′/n captures the difference in the treatment intensity across the two groups. Thus the IV estimator simply scales the difference in average outcomes by the difference in treatment intensities, as our intuition suggests.” -Cobb-Clark and Crossley, 2003. page 504. The Exclusion Restriction: Justification and Identification Controversy Surrounding IVs • The most problematic aspect of IVE: search for a variable satisfying the requirements of no correlation with error and high correlation with endogenous variable. • Even some economists are skeptical of the value of certain approaches taken to estimating IVs: Heckman (1990; 1997). • Heckman even questions random assignment as an instrument! • Inadequate attention to ensuring exclusion is a frequent problem. • This is changing particularly as researchers outside of economics begin to see the value of the method. Can Exclusion be Proven? The (flawed) logic of a test for exclusion is that if Z is associated with Y after conditioning on X, then it is not an IV, because that means it must operate through C (or any other unmeasured variable). Health at time t-1 Not measured (x-sect) C Z X Y Food stamp receipt Public Health insurance Health at time t The problem is that this test always fails, because conditioning on X creates an artifactual association between Z and C. Exclusion must be “proven” by inductive methods only. Can Exclusion be Proven? Propensity to receive food stamps 1 0 Health at time t-1 Green: does not have public health insurance pink: has public health insurance Refer to: Morgan & Winship, 2007; p. 67. Exclusion Logic: Example • A school district recently implemented a policy change, promoting students to middle school in 7th grade rather than 6th. • You are an evaluator given the task of determining whether this policy is beneficial to students, based on 7th grade exams. • Promotion is endogenous because it is determined partly by performance on end-of-grade exams. • Having been hired after implementation, randomization is not an option. • If a valid IV can be identified, you might have a causal estimate. • Birth date is random, right? (Right?) Exclusion Logic: Example of Potential IV Jan 1 Aug 31 Sept 1 Dec 31 Yr A Yr B Yr C • Blue cohort: advances in 6th grade. • Green cohort: advances in 7th grade. • Students in different cohorts nearest the star are most alike, and thus less subjected to random differences. • In calendar Year B, birth date determines cohort membership. • Correlation between birth date and cohort determined by dispersion over year-long period before/after star. • Quality of IV (correlation with cohort; influence of error) determined by the distance of each student’s birth from the star. Discussion: Heintze, Berger, Nadich & Meyers (2006) • Question: Is housing assistance directly associated with employment among low-income single mother households? • Endogeneity problem? • “The instruments used here are lagged measures of state-level housing assistance generosity. They are constructed from HUD data (HUD 1998). The lagged measure of housing assistance generosity for the 1997 sample is the ratio of the number of federally subsidized units to the number of poor households in the state in 1993.” (p 649) • How is exclusion assumed here? • How might exclusion fail? • Did the authors sufficient examine exclusion? • What alternatives might exist? Recommendations Recommendations: Endogeneity • Never include a known endogenous variable as an intermediary between your focal causal variable and your outcome (e.g., do not include dosage when modeling treatment). • The seeming exception to this is mediation, though in this case the mediator is the focal causal variable. • There are many methods for dealing with endogeneity, including fixed effects, interrupted time series, PSM, regression discontinuity; some of these remain relatively untapped by social work researchers. Recommendations: IVE • Knowing what an IV is and how it works will be important for social work evaluators and researchers as the disciplines converge on scarce grant funds for research projects. • When doing mediation or dosage (efficacy; i.e., not intent-to-treat) analysis, IVE is a potentially valuable tool, particularly when the treatment (as an antecedent to dosage) is randomly assigned. Look for next paper on local average treatment effects. This method also appropriate when ethics prevents randomization directly to treatment. • Think about what instruments might be useful in your work; look for examples of instruments by other researchers in the literature. Be critical; a lousy instrument does not promote good science. Recommended Reading • Morgan, S. L. & Winship, C. (2007). Counterfactuals and Causal Inference: Methods and Principals for Social Research. Cambridge University Press. • There are about a dozen articles in the social work literature that use or refer to IVE. These are listed in the appendix of the Stone article on endogeneity. • The papers I’ve referred to: Cutler, D.M., and J. Gruber. 1996. The Effect of Expanding the Medicaid Program on Public Insurance, Private Insurance, and Redistribution. American Economic Review: Papers and Pro-ceedings 86(2): 368–373. Cobb-Clark, D. A. & Crossley, T. (2003). Econometrics for evaluations: An introduction to recent developments. The Economic Record, 79(247), 491-511. • A three part series in Social Science and Medicine by myself & Susan Parish & colleagues, and Michael Foster: Rose, R. A., Parish, S. L., Yoo, J. (2010). Observation, correlation and policy analysis: A response to Foster. Social Science & Medicine, 70, 1274-1276. Foster, E. M. (2010). A commentary on Rose et al. Social Science & Medicine, 70, 1271-1273. Rose, R. A., Parish, S. L., Yoo, J., Grady, M.D., Andrews, M. E., Powell, S. E., & HicksSangster, T. K. (2010). Medicaid suppresses racial disparities in health care among children with special health care needs: Evidence from a multi-level mediation modeling analysis. Social Science & Medicine, 70, 1263-1270. SSWR in Washington DC, Jan 2012 • I will be presenting a paper using LATE via IVE methods in a symposium for CareerStart that has several papers by myself, Mike Woolley and Jay Unick. • While my time for explaining the intricacies of LATE will be limited, I will have (by this time) briefs and probably a short report. • The paper I am presenting will constitute one of my dissertation papers. Closing Comments & Questions