Double Robust Bayesian Inference on Average Treatment Effects (2211.16298v5)
Abstract: We propose a double robust Bayesian inference procedure on the average treatment effect (ATE) under unconfoundedness. For our new Bayesian approach, we first adjust the prior distributions of the conditional mean functions, and then correct the posterior distribution of the resulting ATE. Both adjustments make use of pilot estimators motivated by the semiparametric influence function for ATE estimation. We prove asymptotic equivalence of our Bayesian procedure and efficient frequentist ATE estimators by establishing a new semiparametric Bernstein-von Mises theorem under double robustness; i.e., the lack of smoothness of conditional mean functions can be compensated by high regularity of the propensity score and vice versa. Consequently, the resulting Bayesian credible sets form confidence intervals with asymptotically exact coverage probability. In simulations, our method provides precise point estimates of the ATE through the posterior mean and credible intervals that closely align with the nominal coverage probability. Furthermore, our approach achieves a shorter interval length in comparison to existing methods. We illustrate our method in an application to the National Supported Work Demonstration following LaLonde [1986] and Dehejia and Wahba [1999].
- A. Abadie and G. W. Imbens. Bias-corrected matching estimators for average treatment effects. Journal of Business & Economic Statistics, 29(1):1–11, 2011.
- I. Andrews and A. Mikusheva. Optimal decision rules for weak gmm. Econometrica, 90:715–748, 2022.
- T. B. Armstrong and M. Kolesár. Finite-sample optimal estimation and inference on average treatment effects under unconfoundedness. Econometrica, 89(3):1141–1177, 2021.
- Using wasserstein generative adversarial networks for the design of monte carlo simulations. Journal of Econometrics, 2021.
- Doubly robust nonparametric inference on the average treatment effect. Biometrika, 104(4):863–880, 2017.
- I. Castillo. A semiparametric bernstein–von mises theorem for gaussian process priors. Probability Theory and Related Fields, 152:53–99, 2012.
- I. Castillo and J. Rousseau. A bernstein–von mises theorem for smooth functionals in semiparametric models. Annals of Statistics, 43:2353–2383, 2015.
- G. Chamberlain and G. Imbens. Nonparametric applications of bayesian inference. Journal of Business and Economic Statistics, 21:12–18, 2003.
- Semiparametric efficiency in gmm models with auxiliary data. The Annals of Statistics, 36(2):808–843, 2008.
- Monte carlo confidence sets for identified sets. Econometrica, 86:1965–2018, 2018.
- Double/debiased/neyman machine learning of treatment effects. American Economic Review, 107(5):261–265, 2017.
- Double/debiased machine learning for treatment and structural parameters. The Econometrics Journal, 21(1):C1–C68, 2018.
- De-biased machine learning of global and local parameters using regularized riesz representers. arXiv preprint arXiv:1802.08667, 2020.
- Automatic debiased machine learning of causal and structural effects. Econometrica, 90:967–1027, 2022.
- Dealing with limited overlap in estimation of average treatment effects. Biometrika, 96(1):187–199, 2009.
- R. H. Dehejia and S. Wahba. Causal effects in nonexperimental studies: Reevaluating the evaluation of training programs. Journal of the American statistical Association, 94(448):1053–1062, 1999.
- R. H. Dehejia and S. Wahba. Propensity score-matching methods for nonexperimental causal studies. Review of Economics and Statistics, 84(1):151–161, 2002.
- M. H. Farrell. Robust inference on average treatment effects with possibly more covariates than observations. Journal of Econometrics, 189(1):1–23, 2015.
- Deep neural networks for estimation and inference. Econometrica, 89(1):181–213, 2021.
- Regularization paths for generalized linear models via coordinate descent. Journal of statistical software, 33(1):1, 2010.
- S. Ghosal and A. Van der Vaart. Fundamentals of nonparametric Bayesian inference, volume 44. Cambridge University Press, 2017.
- Convergence rates of posterior distributions. The Annals of Statistics, 28:500–531, 2000.
- R. Giacomini and T. Kitagawa. Robust bayesian inference for set-identified models. Econometrica, 89(4):1519–1556, 2021.
- J. Hahn. On the role of the propensity score in efficient semiparametric estimation of average treatment effects. Econometrica, pages 315–331, 1998.
- Q. Han. Set structured global empirical risk minimizers are rate optimal in general dimensions. The Annals of Statistics, 49(5):2642–2671, 2021.
- Efficient estimation of average treatment effects using the estimated propensity score. Econometrica, 71(4):1161–1189, 2003.
- G. W. Imbens. Nonparametric estimation of average treatment effects under exogeneity: A review. Review of Economics and statistics, 86(1):4–29, 2004.
- Causal inference in statistics, social, and biomedical sciences. Cambridge University Press, 2015.
- L. Jonathan. Tutorial: deriving the efficient influence curve for large models. arxiv preprint, arXiv:1903.01706v3, 2019.
- M. Kasy. Optimal taxation and insurance using machine learning—sufficient statistics and beyond. Journal of Public Economics, 167:205–219, 2018.
- R. J. LaLonde. Evaluating the econometric evaluations of training programs with experimental data. The American economic review, pages 604–620, 1986.
- M. Ledoux and M. Talagrand. Probability in Banach Spaces: isoperimetry and processes. Springer Science & Business Media, 1991.
- Statistical analysis with missing data (3rd Edition). John Wiley & Sons, 2019.
- Semiparametric bayesian doubly robust causal estimation. Journal of Statistical Planning and Inference, 225:171–187, 2023.
- C. Rassmusen and C. Williams. Gaussian processes for machine learning. MIT, 2006.
- K. Ray and B. Szabó. Debiased bayesian inference for average treatment effects. Advances in Neural Information Processing Systems, 32, 2019.
- K. Ray and A. van der Vaart. Semiparametric bayesian causal inference. The Annals of Statistics, 48:2999–3020, 2020.
- The bayesian analysis of complex, high-dimensional models: Can it be coda? Statistical Science, 29(4):619–639, 2014.
- J. M. Robins and Y. Ritov. Toward a curse of dimensionality appropriate (coda) asymptotic theory for semi-parametric models. Statistics in Medicine, 16(3):285–319, 1997.
- J. M. Robins and A. Rotnitzky. Semiparametric efficiency in multivariate regression models with missing data. Journal of the American Statistical Association, 90(429):122–129, 1995.
- Reducing bias in observational studies using subclassification on the propensity score. Journal of the American statistical Association, 79(387):516–524, 1984.
- D. Rubin. Bayesian bootstrap. The Annals of Statistics, 9:130–134, 1981.
- D. Rubin. Bayesianly justifiable and relevant frequency calculations for the applied statistician. The Annals of statistics, 12:1151–1172, 1984.
- A bayesian view of doubly robust causal inference. Biometrika, 103:667–681, 2016.
- Targeted learning: causal inference for observational and experimental data. Springer, 2011.
- A. van der Vaart. Asymptotic statistics. Cambridge University Press, 1998.
- Weak convergence and empirical processes. Springer, 1996.
- A. W. van der Vaart and J. H. van Zanten. Rates of contraction of posterior distributions based on gaussian process priors. The Annals of Statistics, 36:1435–1463, 2008.
- A. W. van der Vaart and J. H. van Zanten. Adaptive bayesian estimation using a gaussian random field with inverse gamma bandwidth. The Annals of Statistics, 37:2655–2675, 2009.
- L. Wasserman. All of statistics: a concise course in statistical inference, volume 26. Springer, 2004.
- Inference under unequal probability sampling with the bayesian exponentially tilted empirical likelihood. Biometrika, 107:857–873, 2020.