1 Synthetic controls Suppose one has panel data but very limited evidence on the treated and/or untreated subpopulations (for example, perhaps only one individual is treated). Then, standard matching strategies (general or propensity score) are likely of limited utility. On the other hand, perhaps one can construct a match for the treated based on a composite of the controls (untreated). This is an extension of dierence-in-dierences (D-I-D) in which unobservable factors (perhaps correlated with covariates, Xit ) are allowed to vary cross-sectionally whereas such unobservables are a cross-sectional constant for the D-I-D strategy. This is commonly referred to as a factor model since the unobservables can be thought of as factor scores, t , (which vary through time) weighted by factor loadings, µi , (which vary cross-sectionally) both of which are unobservable in a factor analysis. Yit = dit + T Xit + t µi + it where Yit is outcome for individual i during period t, is an unknown vector of coecients common across individuals, it is unobservable (random) noise, dit is an indicator of treatment (1) or nontreatment (0), and is the treatment eect of interest. Synthetic controls are constructed based on a convex combination (with T0 weights wi 0, wi = 1) of the pre-intervention outcomes, Yik = kt Yit , t=1 and covariates, Xi , from the control subpopulation such that T arg min (H1 [H2 · · · Hn ] W ) V (H1 [H2 · · · Hn ] W ) W T where Hi = Yik1 , Yik2 , . . . Yikm , XiT , i = 1 denotes treated while i = 2, . . . , n denotes control, and V is a diagonal matrix controlling potentially dierent weights applied to the components of Hi . Abadie et al [2010, 2011] propose choosing V to minimize the expected mean square error T arg min (Y1 Y0 W (V )) (Y1 Y0 W (V )) V during the pre-intervention period where Y1 refers to outcomes for the treated and Y0 outcomes for the control group. Next, we explore several variations via some highly stylized examples. First, we visit examples with constant factors such that both D-I-D and synthetic controls identify the treatment eect. Then, we visit settings with heterogeneous factors in which D-I-D fails but synthetic controls eectively identify the treatment eect. Finally, we consider settings where neither D-I-D nor synthetic variables eectively identify the treatment eect. The first failure is due to violation of conditional mean independence of potential outcome without treatment while the second failure is due to the treated individuals’ H lying outside the convex hull of the controls. In the latter case, synthetic controls can produce greater selection bias than D-I-D. 1 1.1 Examples Suppose there are three individuals who are untreated during the first two periods but the third individual adopts treatment in the third period. The data generating process (DGP ) is Y = I1 + I2 + I3 + T + 2I3 T + µ where Y is observed outcome, Ij denotes and indicator variable equal to 1 for individual j and 0 otherwise, and T is an indicator equal to one for intervention period three and zero otherwise. In the examples below, Y0 represents potential outcome without treatment and only µ (and, of course, outcome) varies. Example 1 (D-I-D setting) Suppose the DGP is Y0 2 2 3 2 2 3 2 2 3 Y 2 2 3 2 2 3 2 2 5 I1 1 1 1 0 0 0 0 0 0 I2 0 0 0 1 1 1 0 0 0 I3 0 0 0 0 0 0 1 1 1 T 0 0 1 0 0 1 0 0 1 d = I3 T 0 0 0 0 0 0 0 0 1 µ 1 1 1 1 1 1 1 1 1 = 5 3 = 2. D-I-D identifies E [Y | X] = 2I1 + 2I2 + 2I3 + T + 2I3 T That is, even though the nonzero error term biases the coecients on the individual mean eects, the treatment eect is eectively identified. Any weights V and W produce synthetic controls that eectively identify the treatment eect. Example 2 (D-I-D setting with time variation) Suppose the DGP is Y0 2 2 4 2 2 4 2 2 4 Y 2 2 4 2 2 4 2 2 6 I1 1 1 1 0 0 0 0 0 0 I2 0 0 0 1 1 1 0 0 0 I3 0 0 0 0 0 0 1 1 1 T 0 0 1 0 0 1 0 0 1 d = I3 T 0 0 0 0 0 0 0 0 1 µ 3 3 2 3 3 2 3 3 2 = 6 4 = 2. D-I-D identifies E [Y | X] = 2I1 2I2 2I3 + 6T + 2I3 T 2 That is, even though the nonzero error term biases the coecients on the individual and time mean eects, the treatment eect is eectively identified. Again, any weights V and W produce synthetic controls that eectively identify the treatment eect. Example 3 (synthetic control setting W T = 0.1 0.9 ) Suppose the DGP is Y0 Y I1 I2 I3 T d = I3 T µ 2 2 1 0 0 0 0 3 2 2 1 0 0 0 0 3 4 4 1 0 0 1 0 2 4 4 0 1 0 0 0 3 4 4 0 1 0 0 0 3 0 0 0 1 0 1 0 2 3.4 3.4 0 0 1 0 0 2.4 3.4 3.4 0 0 1 0 0 2.4 0.4 2.4 0 0 1 1 1 1.6 = 2.4 0.4 = 2. D-I-D identifies 1 1 2 E [Y | X] = I1 + 2 I2 + 3 I3 + T 2I3 T 3 3 5 Cross-sectional variation in the unobservable factors, µ, produces substantial bias in the D-I-D estimate of the treatment eect. That is, it is so underestimated the sign is reversed. On the other hand, since the test subject lies in a convex hull of the control subjects’ predicted outcomes, the synthetic control strategy eectively identifies the treatment eect. I1 I2 V Yit=1 Y t=2 i WT = Y33 W T where Yit=1 = 2 t=1 1 Identification Y13 Y23 V fixed 0 0 0 0 0 0 0 0 0 0 0.5 0 0 0 0 0.5 0.1 0.9 0.335 0 0 0 2 kt Yit = Yi1 and Yit=2 = is also eective with Yit=1 = 2 V estimated 0 0 0 0.037 0 0 0 0.332 0 0 0 0.296 0.1 0.9 2 kt Yit = Yi2 .1 t=1 2 kt Yit = 1 2 (Yi1 + Yi2 ) (or, for that matter, t=1 any convex combination of Yi1 and Yi2 ) in place of both Yit=1 and Yit=2 . 3 Example 4 (synthetic control setting W T = 0.9 0.1 ) Suppose the DGP is Y0 Y I1 I2 I3 T d = I3 T µ 2 2 1 0 0 0 0 3 2 2 1 0 0 0 0 3 4 4 1 0 0 1 0 2 4 4 0 1 0 0 0 3 4 4 0 1 0 0 0 3 0 0 0 1 0 1 0 2 1.4 1.4 0 0 1 0 0 2.4 1.4 1.4 0 0 1 0 0 2.4 3.6 5.6 0 0 1 1 1 1.6 = 5.6 3.6 = 2. D-I-D identifies 1 1 2 E [Y | X] = 1 I1 + 1 I2 2 I3 + T + 6I3 T 3 3 5 Cross-sectional variation in the unobservable factors, µ, produces substantial (overestimation) bias in the D-I-D estimate of the treatment eect. On the other hand, since the test subject lies in a convex hull of the control subjects’ predicted outcomes, the synthetic control strategy eectively identifies the treatment eect. I1 I2 V Yit=1 Y t=2 0 0 0 0 i WT = Y33 W where Yit=1 = T 2 t=1 Y13 Y23 V fixed 0 0 0 0 0 0 0 0.5 0 0 0 0.5 0.9 0.1 0.037 0 0 0 2 kt Yit = Yi1 and Yit=2 = 2 V estimated 0 0 0 0.335 0 0 0 0.332 0 0 0 0.296 0.9 0.1 2 kt Yit = Yi2 . t=1 Example 5 (both d-i-d and synthetic controls fail) Suppose the DGP is almost identical to example 1 except that potential outcome without treatment is aected by the intervention (not conditionally mean independent). Y0 2 2 3 2 2 3 2 2 5 Y 2 2 3 2 2 3 2 2 5 I1 1 1 1 0 0 0 0 0 0 I2 0 0 0 1 1 1 0 0 0 I3 0 0 0 0 0 0 1 1 1 T 0 0 1 0 0 1 0 0 1 4 d = I3 T 0 0 0 0 0 0 0 0 1 µ 1 1 1 1 1 1 1 1 1 = 5 5 = 0. As before, D-I-D identifies E [Y | X] = 2I1 + 2I2 + 2I3 + T + 2I3 T but the treatment eect is not 2 but rather zero. Further, since mean conditional independence fails, synthetic controls cannot eectively identify the treatment eect either (also identifies 2 rather than zero). Example 6 (treated lie outside convex hull of controls) Suppose the DGP is as example 3 for individuals 1 and 2 but the weights are W T = the same 1 1 (not a convex combination). Y0 2 2 4 4 4 0 2 2 4 Y 2 2 4 4 4 0 2 2 6 I1 1 1 1 0 0 0 0 0 0 I2 0 0 0 1 1 1 0 0 0 I3 0 0 0 0 0 0 1 1 1 T 0 0 1 0 0 1 0 0 1 d = I3 T 0 0 0 0 0 0 0 0 1 µ 3 3 2 3 3 2 0 0 0 = 6 4 = 2. D-I-D identifies 1 1 E [Y | X] = I1 + 2 I2 + 2I3 + T + 3I3 T 3 3 but the treatment eect is 2 not 3. Since the test subject lies outside the convex hull of the control subjects’ predicted outcomes, the synthetic control strategy ineectively identifies the treatment eect. I1 V I2 Yi WT = Y33 W T where Yi = 2 Y13 Y23 kt Yit = 1 2 1V fixed 0 0 3 0 1 0 3 0 0 13 0.435 0.565 = 1 V estimated 0.44 0 0 0 0.22 0 0 0 0.34 1 2 = 1 1 3 3 1 4.26 = 2 4 32 = 2 (Yi1 + Yi2 ). Hence, the bias in the treatment eect is t=1 greater with synthetic controls than the bias for D-I-D. 5