1 Synthetic controls

1 Synthetic controls Suppose one has panel data but very limited evidence on the treated and/or untreated subpopulations (for example, perhaps only one individual is treated). Then, standard matching strategies (general or propensity score) are likely of limited utility. On the other hand, perhaps one can construct a match for the treated based on a composite of the controls (untreated). This is an extension of dierence-in-dierences (D-I-D) in which unobservable factors (perhaps correlated with covariates, Xit ) are allowed to vary cross-sectionally whereas such unobservables are a cross-sectional constant for the D-I-D strategy. This is commonly referred to as a factor model since the unobservables can be thought of as factor scores, t , (which vary through time) weighted by factor loadings, µi , (which vary cross-sectionally) both of which are unobservable in a factor analysis. Yit = dit +  T Xit + t µi + it where Yit is outcome for individual i during period t,  is an unknown vector of coecients common across individuals, it is unobservable (random) noise, dit is an indicator of treatment (1) or nontreatment (0), and  is the treatment eect of interest. Synthetic controls are constructed based on a convex combination (with T0   weights wi  0, wi = 1) of the pre-intervention outcomes, Yik = kt Yit , t=1 and covariates, Xi , from the control subpopulation such that T arg min (H1  [H2 · · · Hn ] W ) V (H1  [H2 · · · Hn ] W ) W  T where Hi = Yik1 , Yik2 , . . . Yikm , XiT , i = 1 denotes treated while i = 2, . . . , n denotes control, and V is a diagonal matrix controlling potentially dierent weights applied to the components of Hi . Abadie et al [2010, 2011] propose choosing V to minimize the expected mean square error T arg min (Y1  Y0 W  (V )) (Y1  Y0 W  (V )) V during the pre-intervention period where Y1 refers to outcomes for the treated and Y0 outcomes for the control group. Next, we explore several variations via some highly stylized examples. First, we visit examples with constant factors such that both D-I-D and synthetic controls identify the treatment eect. Then, we visit settings with heterogeneous factors in which D-I-D fails but synthetic controls eectively identify the treatment eect. Finally, we consider settings where neither D-I-D nor synthetic variables eectively identify the treatment eect. The first failure is due to violation of conditional mean independence of potential outcome without treatment while the second failure is due to the treated individuals’ H lying outside the convex hull of the controls. In the latter case, synthetic controls can produce greater selection bias than D-I-D. 1 1.1 Examples Suppose there are three individuals who are untreated during the first two periods but the third individual adopts treatment in the third period. The data generating process (DGP ) is Y = I1 + I2 + I3 + T + 2I3  T + µ where Y is observed outcome, Ij denotes and indicator variable equal to 1 for individual j and 0 otherwise, and T is an indicator equal to one for intervention period three and zero otherwise. In the examples below, Y0 represents potential outcome without treatment and only µ (and, of course, outcome) varies. Example 1 (D-I-D setting) Suppose the DGP is Y0 2 2 3 2 2 3 2 2 3 Y 2 2 3 2 2 3 2 2 5 I1 1 1 1 0 0 0 0 0 0 I2 0 0 0 1 1 1 0 0 0 I3 0 0 0 0 0 0 1 1 1 T 0 0 1 0 0 1 0 0 1 d = I3  T 0 0 0 0 0 0 0 0 1 µ 1 1 1 1 1 1 1 1 1  = 5  3 = 2. D-I-D identifies E [Y | X] = 2I1 + 2I2 + 2I3 + T + 2I3  T That is, even though the nonzero error term biases the coecients on the individual mean eects, the treatment eect is eectively identified. Any weights V and W produce synthetic controls that eectively identify the treatment eect. Example 2 (D-I-D setting with time variation) Suppose the DGP is Y0 2 2 4 2 2 4 2 2 4 Y 2 2 4 2 2 4 2 2 6 I1 1 1 1 0 0 0 0 0 0 I2 0 0 0 1 1 1 0 0 0 I3 0 0 0 0 0 0 1 1 1 T 0 0 1 0 0 1 0 0 1 d = I3  T 0 0 0 0 0 0 0 0 1 µ 3 3 2 3 3 2 3 3 2  = 6  4 = 2. D-I-D identifies E [Y | X] = 2I1  2I2  2I3 + 6T + 2I3  T 2 That is, even though the nonzero error term biases the coecients on the individual and time mean eects, the treatment eect is eectively identified. Again, any weights V and W produce synthetic controls that eectively identify the treatment eect.   Example 3 (synthetic control setting W T = 0.1 0.9 ) Suppose the DGP is Y0 Y I1 I2 I3 T d = I3  T µ 2 2 1 0 0 0 0 3 2 2 1 0 0 0 0 3 4 4 1 0 0 1 0 2 4 4 0 1 0 0 0 3 4 4 0 1 0 0 0 3 0 0 0 1 0 1 0 2 3.4 3.4 0 0 1 0 0 2.4 3.4 3.4 0 0 1 0 0 2.4 0.4 2.4 0 0 1 1 1 1.6  = 2.4  0.4 = 2. D-I-D identifies 1 1 2 E [Y | X] =  I1 + 2 I2 + 3 I3 + T  2I3  T 3 3 5 Cross-sectional variation in the unobservable factors, µ, produces substantial bias in the D-I-D estimate of the treatment eect. That is, it is so underestimated the sign is reversed. On the other hand, since the test subject lies in a convex hull of the control subjects’ predicted outcomes, the synthetic control strategy eectively identifies the treatment eect.   I1  I2   V  Yit=1  Y t=2 i WT  = Y33  W T where Yit=1 =  2  t=1 1 Identification Y13 Y23 V fixed  0 0 0 0  0 0 0 0     0 0 0.5 0  0 0 0 0.5   0.1 0.9    0.335  0   0 0 2 kt Yit = Yi1 and Yit=2 = is also eective with Yit=1 = 2  V estimated 0 0 0 0.037 0 0 0 0.332 0 0 0 0.296   0.1 0.9 2 kt Yit = Yi2 .1 t=1 2  kt Yit = 1 2 (Yi1 + Yi2 ) (or, for that matter, t=1 any convex combination of Yi1 and Yi2 ) in place of both Yit=1 and Yit=2 . 3       Example 4 (synthetic control setting W T = 0.9 0.1 ) Suppose the DGP is Y0 Y I1 I2 I3 T d = I3  T µ 2 2 1 0 0 0 0 3 2 2 1 0 0 0 0 3 4 4 1 0 0 1 0 2 4 4 0 1 0 0 0 3 4 4 0 1 0 0 0 3 0 0 0 1 0 1 0 2 1.4 1.4 0 0 1 0 0 2.4 1.4 1.4 0 0 1 0 0 2.4 3.6 5.6 0 0 1 1 1 1.6  = 5.6  3.6 = 2. D-I-D identifies 1 1 2 E [Y | X] = 1 I1 + 1 I2  2 I3 + T + 6I3  T 3 3 5 Cross-sectional variation in the unobservable factors, µ, produces substantial (overestimation) bias in the D-I-D estimate of the treatment eect. On the other hand, since the test subject lies in a convex hull of the control subjects’ predicted outcomes, the synthetic control strategy eectively identifies the treatment eect.   I1  I2   V  Yit=1  Y t=2  0  0   0 0 i WT  = Y33  W where Yit=1 = T  2  t=1 Y13 Y23  V fixed  0 0 0 0 0 0   0 0.5 0  0 0 0.5   0.9 0.1  0.037  0   0 0 2 kt Yit = Yi1 and Yit=2 = 2  V estimated 0 0 0 0.335 0 0 0 0.332 0 0 0 0.296   0.9 0.1 2 kt Yit = Yi2 . t=1 Example 5 (both d-i-d and synthetic controls fail) Suppose the DGP is almost identical to example 1 except that potential outcome without treatment is aected by the intervention (not conditionally mean independent). Y0 2 2 3 2 2 3 2 2 5 Y 2 2 3 2 2 3 2 2 5 I1 1 1 1 0 0 0 0 0 0 I2 0 0 0 1 1 1 0 0 0 I3 0 0 0 0 0 0 1 1 1 T 0 0 1 0 0 1 0 0 1 4 d = I3  T 0 0 0 0 0 0 0 0 1 µ 1 1 1 1 1 1 1 1 1      = 5  5 = 0. As before, D-I-D identifies E [Y | X] = 2I1 + 2I2 + 2I3 + T + 2I3  T but the treatment eect is not 2 but rather zero. Further, since mean conditional independence fails, synthetic controls cannot eectively identify the treatment eect either (also identifies 2 rather than zero). Example 6 (treated lie outside convex hull of controls) Suppose the DGP is as example 3 for individuals 1 and 2 but the weights are W T =  the same  1 1 (not a convex combination). Y0 2 2 4 4 4 0 2 2 4 Y 2 2 4 4 4 0 2 2 6 I1 1 1 1 0 0 0 0 0 0 I2 0 0 0 1 1 1 0 0 0 I3 0 0 0 0 0 0 1 1 1 T 0 0 1 0 0 1 0 0 1 d = I3  T 0 0 0 0 0 0 0 0 1 µ 3 3 2 3 3 2 0 0 0  = 6  4 = 2. D-I-D identifies 1 1 E [Y | X] =  I1 + 2 I2 + 2I3 + T + 3I3  T 3 3 but the treatment eect is 2 not 3. Since the test subject lies outside the convex hull of the control subjects’ predicted outcomes, the synthetic control strategy ineectively identifies the treatment eect.   I1 V  I2  Yi WT  = Y33  W T where Yi = 2   Y13 Y23 kt Yit =  1 2  1V fixed  0 0 3  0 1 0  3 0 0 13    0.435 0.565 = 1 V estimated  0.44 0 0  0 0.22 0  0 0 0.34   1 2   = 1 1 3 3  1 4.26 = 2  4 32 = 2 (Yi1 + Yi2 ). Hence, the bias in the treatment eect is t=1 greater with synthetic controls than the bias for D-I-D. 5

1 Synthetic controls

Related documents

Products

Support

1 Synthetic controls

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib