The synthetic control method is an econometric method used to evaluate the effect of large-scale interventions. It was proposed in a series of articles by Alberto Abadie and his coauthors.[2][3][4] A synthetic control is a weighted average of several units (such as regions or companies) combined to recreate the trajectory that the outcome of a treated unit would have followed in the absence of the intervention. The weights are selected in a data-driven manner to ensure that the resulting synthetic control closely resembles the treated unit in terms of key predictors of the outcome variable.[2] Unlike difference in differences approaches, this method can account for the effects of confounders changing over time, by weighting the control group to better match the treatment group before the intervention.[5] Another advantage of the synthetic control method is that it allows researchers to systematically select comparison groups. It has been applied to the fields of economics,[6]political science,[1]health policy,[5]criminology,[7] and others.
The synthetic control method combines elements from matching and difference-in-differences techniques. Difference-in-differences methods are often-used policy evaluation tools that estimate the effect of an intervention at an aggregate level (e.g. state, country, age group etc.) by averaging over a set of unaffected units. Famous examples include studies of the employment effects of a raise in the minimum wage in New Jersey fast food restaurants by comparing them to fast food restaurants just across the border in Philadelphia that were unaffected by a minimum wage raise,[8] and studies that look at crime rates in southern cities to evaluate the impact of the Mariel Boatlift on crime.[9] The control group in this specific scenario can be interpreted as a weighted average, where some units effectively receive zero weight while others get an equal, non-zero weight.
The synthetic control method tries to offer a more systematic way to assign weights to the control group. It typically uses a relatively long time series of the outcome prior to the intervention and estimates weights in such a way that the control group mirrors the treatment group as closely as possible. In particular, assume we have J observations over T time periods where the relevant treatment occurs at time where Let
be the treatment effect for unit at time , where is the outcome in absence of the treatment. Without loss of generality, if unit 1 receives the relevant treatment, only is not observed for . We aim to estimate .
Imposing some structure
and assuming there exist some optimal weights such that
for , the synthetic controls approach suggests using these weights to estimate the counterfactual
for . So under some regularity conditions, such weights would provide estimators for the treatment effects of interest. In essence, the method uses the idea of matching and using the training data pre-intervention to set up the weights and hence a relevant control post-intervention.[2]
Synthetic controls have been used in a number of empirical applications, ranging from studies examining natural catastrophes and growth,[10] studies that examine the effect of vaccine mandates on childhood immunization,[11] and studies linking political murders to house prices.[12]
^ abAbadie, Alberto; Diamond, Alexis; Hainmueller, Jens (February 2015). "Comparative Politics and the Synthetic Control Method". American Journal of Political Science. 59 (2): 495–510. doi:10.1111/ajps.12116.
^Billmeier, Andreas; Nannicini, Tommaso (July 2013). "Assessing Economic Liberalization Episodes: A Synthetic Control Approach". Review of Economics and Statistics. 95 (3): 983–1001. doi:10.1162/REST_a_00324. S2CID57561957.
^Saunders, Jessica; Lundberg, Russell; Braga, Anthony A.; Ridgeway, Greg; Miles, Jeremy (3 June 2014). "A Synthetic Control Approach to Evaluating Place-Based Crime Interventions". Journal of Quantitative Criminology. 31 (3): 413–434. doi:10.1007/s10940-014-9226-5. S2CID254702864.