Generalization Bounds for Causal Regression: Insights, Guarantees and Sensitivity Analysis
Abstract: Many algorithms have been recently proposed for causal machine learning. Yet, there is little to no theory on their quality, especially considering finite samples. In this work, we propose a theory based on generalization bounds that provides such guarantees. By introducing a novel change-of-measure inequality, we are able to tightly bound the model loss in terms of the deviation of the treatment propensities over the population, which we show can be empirically limited. Our theory is fully rigorous and holds even in the face of hidden confounding and violations of positivity. We demonstrate our bounds on semi-synthetic and real data, showcasing their remarkable tightness and practical utility.
- Two-stage least squares estimation of average causal effects in models with variable treatment intensity. Journal of the American Statistical Association, 90:431–442, 1995. URL https://api.semanticscholar.org/CorpusID:8384694.
- Recursive partitioning for heterogeneous causal effects. Proceedings of the National Academy of Sciences, 113:7353 – 7360, 2015. URL https://api.semanticscholar.org/CorpusID:16171120.
- Generalized random forests. The Annals of Statistics, 2016. URL https://api.semanticscholar.org/CorpusID:51735142.
- Generalized random forests. The Annals of Statistics, 47(2):1148 – 1178, 2019. doi: 10.1214/18-AOS1709. URL https://doi.org/10.1214/18-AOS1709.
- Learning bounds for domain adaptation. In Neural Information Processing Systems, 2007. URL https://api.semanticscholar.org/CorpusID:2497886.
- Learning bounds for importance weighting. In Neural Information Processing Systems, 2010. URL https://api.semanticscholar.org/CorpusID:2555196.
- Nonparametric estimation of heterogeneous treatment effects: From theory to learning algorithms. In International Conference on Artificial Intelligence and Statistics, 2021. URL https://api.semanticscholar.org/CorpusID:231709566.
- Automated versus do-it-yourself methods for causal inference: Lessons learned from a data analysis competition. Statistical Science, 2017. URL https://api.semanticscholar.org/CorpusID:51992418.
- Interpolating between optimal transport and mmd using sinkhorn divergences. In International Conference on Artificial Intelligence and Statistics, 2018. URL https://api.semanticscholar.org/CorpusID:84834062.
- A new pac-bayesian perspective on domain adaptation. In Balcan, M. F. and Weinberger, K. Q. (eds.), Proceedings of The 33rd International Conference on Machine Learning, volume 48 of Proceedings of Machine Learning Research, pp. 859–868, New York, New York, USA, 20–22 Jun 2016. PMLR. URL https://proceedings.mlr.press/v48/germain16.html.
- Pac-bayes and domain adaptation. Neurocomputing, 379:379–397, 2017. URL https://api.semanticscholar.org/CorpusID:53493590.
- Bayesian regression tree models for causal inference: Regularization, confounding, and heterogeneous effects. Econometrics: Multiple Equation Models eJournal, 2017. URL https://api.semanticscholar.org/CorpusID:34019969.
- Hill, J. L. Bayesian nonparametric modeling for causal inference. Journal of Computational and Graphical Statistics, 20:217 – 240, 2011. URL https://api.semanticscholar.org/CorpusID:122155840.
- Estimation of causal effects using propensity score weighting: An application to data on right heart catheterization. Health Services and Outcomes Research Methodology, 2:259–278, 2001. URL https://api.semanticscholar.org/CorpusID:3346892.
- Learning representations for counterfactual inference. ArXiv, abs/1605.03661, 2016. URL https://api.semanticscholar.org/CorpusID:8558103.
- Generalization bounds and representation learning for estimation of potential outcomes and causal effects. Journal of Machine Learning Research, 23(166):1–50, 2022. URL http://jmlr.org/papers/v23/19-511.html.
- Generating and imputing tabular data via diffusion and flow-based gradient-boosted trees, 2023.
- Metalearners for estimating heterogeneous treatment effects using machine learning. Proceedings of the National Academy of Sciences of the United States of America, 116:4156 – 4165, 2017. URL https://api.semanticscholar.org/CorpusID:73455742.
- Domain adaptation: Learning bounds and algorithms. ArXiv, abs/0902.3430, 2009. URL https://api.semanticscholar.org/CorpusID:6178817.
- Foundations of machine learning. In Adaptive computation and machine learning, 2012. URL https://api.semanticscholar.org/CorpusID:263010642.
- Quasi-oracle estimation of heterogeneous treatment effects. Biometrika, 2017. URL https://api.semanticscholar.org/CorpusID:85529052.
- Novel change of measure inequalities with applications to pac-bayesian bounds and monte carlo estimation. In Banerjee, A. and Fukumizu, K. (eds.), Proceedings of The 24th International Conference on Artificial Intelligence and Statistics, volume 130 of Proceedings of Machine Learning Research, pp. 1711–1719. PMLR, 13–15 Apr 2021. URL https://proceedings.mlr.press/v130/ohnishi21a.html.
- B-learner: Quasi-oracle bounds on heterogeneous causal effects under hidden confounding. In International Conference on Machine Learning, 2023. URL https://api.semanticscholar.org/CorpusID:258291549.
- Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12:2825–2830, 2011.
- Tighter variational representations of f-divergences via restriction to probability measures. ArXiv, abs/1206.4664, 2012. URL https://api.semanticscholar.org/CorpusID:288983.
- Estimating individual treatment effect: generalization bounds and algorithms. In Precup, D. and Teh, Y. W. (eds.), Proceedings of the 34th International Conference on Machine Learning, volume 70 of Proceedings of Machine Learning Research, pp. 3076–3085. PMLR, 06–11 Aug 2017. URL https://proceedings.mlr.press/v70/shalit17a.html.
- Accurate telemonitoring of parkinson’s disease progression by noninvasive speech tests. IEEE Transactions on Biomedical Engineering, 57:884–893, 2009. URL https://api.semanticscholar.org/CorpusID:7382779.
- Valiant, L. G. A theory of the learnable. Commun. ACM, 27:1134–1142, 1984. URL https://api.semanticscholar.org/CorpusID:59712.
- Estimation and inference of heterogeneous treatment effects using random forests. Journal of the American Statistical Association, 113:1228 – 1242, 2015. URL https://api.semanticscholar.org/CorpusID:15676251.
- Ganite: Estimation of individualized treatment effects using generative adversarial nets. In International Conference on Learning Representations, 2018. URL https://api.semanticscholar.org/CorpusID:65516833.
- Learning overlapping representations for the estimation of individualized treatment effects. ArXiv, abs/2001.04754, 2020. URL https://api.semanticscholar.org/CorpusID:210473399.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.