The Augmented Synthetic Control Method (1811.04170v3)

Published 10 Nov 2018 in stat.ME and econ.EM

Abstract: The synthetic control method (SCM) is a popular approach for estimating the impact of a treatment on a single unit in panel data settings. The "synthetic control" is a weighted average of control units that balances the treated unit's pre-treatment outcomes as closely as possible. A critical feature of the original proposal is to use SCM only when the fit on pre-treatment outcomes is excellent. We propose Augmented SCM as an extension of SCM to settings where such pre-treatment fit is infeasible. Analogous to bias correction for inexact matching, Augmented SCM uses an outcome model to estimate the bias due to imperfect pre-treatment fit and then de-biases the original SCM estimate. Our main proposal, which uses ridge regression as the outcome model, directly controls pre-treatment fit while minimizing extrapolation from the convex hull. This estimator can also be expressed as a solution to a modified synthetic controls problem that allows negative weights on some donor units. We bound the estimation error of this approach under different data generating processes, including a linear factor model, and show how regularization helps to avoid over-fitting to noise. We demonstrate gains from Augmented SCM with extensive simulation studies and apply this framework to estimate the impact of the 2012 Kansas tax cuts on economic growth. We implement the proposed method in the new augsynth R package.

Citations (347)

View on Semantic Scholar

Summary

The paper presents a novel extension to SCM by incorporating a ridge regression outcome model to address pre-treatment imbalance.
It offers theoretical error bounds and demonstrates improved bias reduction and more accurate causal inference through simulations and empirical analysis.
The method expands SCM applicability by effectively leveraging auxiliary covariates and allowing negative weights to enhance model fit.

Overview of the Augmented Synthetic Control Method

The paper on the Augmented Synthetic Control Method (ASCM) presents an advanced methodological extension to the Synthetic Control Method (SCM), a widely utilized technique for causal inference in observational studies. SCM is effective for estimating the impact of interventions using a weighted combination of control units to approximate the counterfactual outcome of treated units. However, SCM's efficacy is constrained by its reliance on the availability of a highly fitting synthetic control, which limits its applicability in situations where exact pre-treatment matching is not achievable.

To address this limitation, the authors propose the Augmented SCM as a robust alternative that combines SCM with additional modeling frameworks to enhance the estimator's performance in scenarios of imperfect pre-treatment fit. The central innovation in ASCM is its incorporation of an outcome model that estimates and corrects the bias resulting from discrepancies in pre-treatment characteristics between the treated unit and the synthetic control. This integration allows ASCM to extend the applicability of SCM by leveraging outcome models such as ridge regression to adjust the weights and improve the accuracy of causal inference.

The primary contribution of the paper is the formulation of the Ridge ASCM, which employs ridge regression as the outcome model to address the issue of pre-treatment imbalance. The authors rigorously derive the mathematical properties of this estimator, demonstrating its ability to enhance fit by permitting extrapolation outside the convex hull of donor units. Unlike traditional SCM, which maintains non-negative weights constrained to a simplex, Ridge ASCM strategically applies negative weights to achieve better pre-treatment balance and reduce bias.

The paper also provides theoretical bounds on estimation errors under specific data-generating processes, including linear models and linear factor models, commonly cited contexts for SCM applications. The analysis reveals that, under a linear model, Ridge ASCM's improved fit mitigates bias effectively, balancing the bias-variance trade-off. Under a linear factor model, the risk of overfitting due to noise is acknowledged, emphasizing the importance of judicious hyperparameter selection.

Furthermore, the research extends the ASCM framework to account for auxiliary covariates beyond mere pre-treatment outcomes, proposing approaches that either incorporate these covariates alongside outcomes in model augmentation or residualize outcomes against covariates to enhance estimator performance.

The paper is enriched by extensive simulations and empirical analyses, including an empirical case paper on the economic impact of Kansas' tax cuts in 2012, which illustrate ASCM's marked improvements over conventional SCM in terms of bias reduction and estimation accuracy.

Practically, ASCM broadens the usability of synthetic control methods in causal inference by allowing researchers to employ SCM in a wider range of historical data scenarios where pre-treatment balance is not attainable with traditional methods alone. Theoretically, it signals an evolution in the methodology that encourages using hybrid approaches combining balance and model-based techniques. As future developments in AI continue to evolve, this method provides a more flexible and resilient approach for policy evaluation and observational research, which is ever-more relevant in data-driven decision-making across various fields.

PDF Markdown

Related Papers

Find Related Papers

Tweets

https://twitter.com/a_strezh/status/1819490725974679681