A joint test of unconfoundedness and common trends (2404.16961v3)
Abstract: This paper introduces an overidentification test of two alternative assumptions to identify the average treatment effect on the treated in a two-period panel data setting: unconfoundedness and common trends. Under the unconfoundedness assumption, treatment assignment and post-treatment outcomes are independent, conditional on control variables and pre-treatment outcomes, which motivates including pre-treatment outcomes in the set of controls. Conversely, under the common trends assumption, the trend and the treatment assignment are independent, conditional on control variables. This motivates employing a Difference-in-Differences (DiD) approach by comparing the differences between pre- and post-treatment outcomes of the treatment and control group. Given the non-nested nature of these assumptions and their often ambiguous plausibility in empirical settings, we propose a joint test using a doubly robust statistic that can be combined with machine learning to control for observed confounders in a data-driven manner. We discuss various causal models that imply the satisfaction of either common trends, unconfoundedness, or both assumptions jointly, and we investigate the finite sample properties of our test through a simulation study. Additionally, we apply the proposed method to five empirical examples using publicly available datasets and find the test to reject the null hypothesis in two cases.
- Semiparametric difference-in-differences estimators. Review of Economic Studies 72, 1–19.
- Estimating the effect of training programms on earnings. The Review of Economics and Statistics 6, 47–57.
- causalsens: Selection Bias Approach to Sensitivity Analysis for Causal Effects. URL: https://CRAN.R-project.org/package=causalsens. r package version 0.1.2.
- Should we combine difference in differences with conditioning on pre-treatment outcomes. working paper, Toulouse School of Economics .
- Double/debiased machine learning for difference-in-differences models. The Econometrics Journal 23, 177–191.
- Double/debiased machine learning for treatment and structural parameters. The Econometrics Journal 21, C1–C68.
- Planning of Experiments. Wiley, New York.
- Causal effects in non-experimental studies: Reevaluating the evaluation of training programmes. Journal of American Statistical Association 94, 1053–1062.
- Propensity-score-matching methods for nonexperimental causal studies. The Review of Economics and Statistics 84, 151–161.
- Selection and parallel trends. arXiv preprint 2203.09001 .
- The common structure of statistical models of truncation, sample selection and limited dependent variables and a simple estimator for such models. Annals of Economic and Social Measurement 5, 475–492.
- A generalization of sampling without replacement from a finite universe. Journal of the American Statistical Association 47, 663–685.
- Testing the identification of causal effects in data. arXiv preprint 2203.15890 .
- Nonparametric estimation of average treatment effects under exogeneity: a review. The Review of Economics and Statistics 86, 4–29.
- Double machine learning based program evaluation under unconfoundedness arXiv:arXiv preprint 2003.03191.
- Algorithm-agnostic significance testing in supervised learning with multimodal data. arXiv preprint 2402.14416 .
- Evaluating the econometric evaluations of training programs with experimental data. American Economic Review 76, 604–620.
- The estimation of causal effects by difference-in-difference methods. Foundations and Trends in Econometrics 4, 165–224.
- On the application of probability theory to agricultural experiments. essay on principles. Statistical Science Reprint, 5, 463–480.
- Probabilistic reasoning in intelligent systems: networks of plausible inference. Morgan Kaufmann, San Mateo.
- Causality: Models, Reasoning, and Inference. Cambridge University Press, Cambridge.
- Consistent significance testing for nonparametric regression. Journal of Business & Economic Statistics 15, 369–378.
- Semiparametric efficiency in multivariate regression models with missing data. Journal of the American Statistical Association 90, 122–129.
- Estimation of regression coefficients when some regressors are not always observed. Journal of the American Statistical Association 90, 846–866.
- Analysis of semiparametric regression models for repeated outcomes in the presence of missing data. Journal of the American Statistical Association 90, 106–121.
- The central role of the propensity score in observational studies for causal effects. Biometrika 70, 41–55.
- Constructing a control group using multivariate matched sampling methods that incorporate the propensity score. The American Statistician 39, 33–38.
- When is parallel trends sensitive to functional form? Econometrica 91, 737–747.
- What’s trending in difference-in-differences? a synthesis of the recent econometrics literature. arXiv preprint 2201.01194 .
- Comment on ’randomization analysis of experimental data: The fisher randomization test’ by d. basu. Journal of American Statistical Association 75, 591–593.
- Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology 66, 688–701.
- Does matching overcome lalonde’s critique of nonexperimental estimators? Journal of Econometrics 125, 305–353.
- On the Mode of Communication of Cholera.
- Causation, prediction, and search. Springer.
- Regresson shrinkage and selection via the lasso. Journal of the Royal Statistical Society 58, 267–288.
- Econometric Analysis of Cross Section and Panel Data. MIT Press, Cambridge.
- Efficient difference-in-differences estimation with high-dimensional common trend confounding. arXiv preprint 1809.01643 .