Neural Networks with Causal Graph Constraints: A New Approach for Treatment Effects Estimation (2404.12238v1)
Abstract: In recent years, there has been a growing interest in using machine learning techniques for the estimation of treatment effects. Most of the best-performing methods rely on representation learning strategies that encourage shared behavior among potential outcomes to increase the precision of treatment effect estimates. In this paper we discuss and classify these models in terms of their algorithmic inductive biases and present a new model, NN-CGC, that considers additional information from the causal graph. NN-CGC tackles bias resulting from spurious variable interactions by implementing novel constraints on models, and it can be integrated with other representation learning methods. We test the effectiveness of our method using three different base models on common benchmarks. Our results indicate that our model constraints lead to significant improvements, achieving new state-of-the-art results in treatment effects estimation. We also show that our method is robust to imperfect causal graphs and that using partial causal information is preferable to ignoring it.
- Representation learning: A review and new perspectives. IEEE transactions on pattern analysis and machine intelligence, 35(8):1798–1828, 2013.
- The causal cookbook: Recipes for propensity scores, g-computation, and doubly robust standardization. 2023.
- Causalml: Python package for causal machine learning. arXiv preprint arXiv:2002.11631, 2020.
- A crash course in good and bad controls. Sociological Methods & Research, page 00491241221099552, 2022.
- Alicia Curth and Mihaela van der Schaar. Nonparametric estimation of heterogeneous treatment effects: From theory to learning algorithms. In International Conference on Artificial Intelligence and Statistics, pages 1810–1818. PMLR, 2021.
- Really doing great at estimating cate? a critical look at ml benchmarking practices in treatment effect estimation. In Thirty-fifth conference on neural information processing systems datasets and benchmarks track (round 2), 2021.
- Propensity score-matching methods for nonexperimental causal studies. Review of Economics and statistics, 84(1):151–161, 2002.
- Vincent Dorie. Npci: Non-parametrics for causal inference. URL: https://github. com/vdorie/npci, 11:23, 2016.
- How to select predictive models for decision making or causal inference. Available at SSRN 4467871, 2023.
- Assessing spurious interaction effects in structural equation modeling: A cautionary note. Educational and psychological measurement, 75(5):721–738, 2015.
- Counterfactual regression with importance sampling weights. In IJCAI, pages 5880–5887, 2019.
- Graphical criteria for efficient total effect estimation via adjustment in causal linear models. Journal of the Royal Statistical Society Series B: Statistical Methodology, 84(2):579–599, 2022.
- Jennifer L Hill. Bayesian nonparametric modeling for causal inference. Journal of Computational and Graphical Statistics, 20(1):217–240, 2011.
- Learning representations for counterfactual inference. In International conference on machine learning, pages 3020–3029. PMLR, 2016.
- Causal machine learning: A survey and open problems. arXiv preprint arXiv:2206.15475, 2022.
- Probabilistic graphical models: principles and techniques. MIT press, 2009.
- Metalearners for estimating heterogeneous treatment effects using machine learning. Proceedings of the national academy of sciences, 116(10):4156–4165, 2019.
- Robert J LaLonde. Evaluating the econometric evaluations of training programs with experimental data. The American economic review, pages 604–620, 1986.
- Estimating treatment effects under heterogeneous interference. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases, pages 576–592. Springer, 2023.
- Causal effect inference with deep latent-variable models. Advances in neural information processing systems, 30, 2017.
- What can be estimated? identifiability, estimability, causal inference and ill-posed inverse problems. arXiv preprint arXiv:1904.02826, 2019.
- Brady Neal. Introduction to causal inference. Course Lecture Notes (draft), 2020.
- Quasi-oracle estimation of heterogeneous treatment effects. Biometrika, 108(2):299–319, 2021.
- B-learner: Quasi-oracle bounds on heterogeneous causal effects under hidden confounding. arXiv preprint arXiv:2304.10577, 2023.
- Estimand-agnostic causal query estimation with deep causal graphs. IEEE Access, 10:71370–71386, 2022.
- Judea Pearl. Bayesian analysis in expert systems: comment: graphical models, causality and intervention. Statistical Science, 8(3):266–269, 1993.
- Judea Pearl. Causality. Cambridge university press, 2009.
- Efficient adjustment sets for population average causal treatment effect estimation in graphical models. The Journal of Machine Learning Research, 21(1):7642–7727, 2020.
- Donald B Rubin. Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of educational Psychology, 66(5):688, 1974.
- Toward causal representation learning. Proceedings of the IEEE, 109(5):612–634, 2021.
- Estimating individual treatment effect: generalization bounds and algorithms. In Doina Precup and Yee Whye Teh, editors, Proceedings of the 34th International Conference on Machine Learning, volume 70 of Proceedings of Machine Learning Research, pages 3076–3085. PMLR, 06–11 Aug 2017.
- Adapting neural networks for the estimation of treatment effects. In H. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alché-Buc, E. Fox, and R. Garnett, editors, Advances in Neural Information Processing Systems, volume 32. Curran Associates, Inc., 2019.
- A linear non-gaussian acyclic model for causal discovery. Journal of Machine Learning Research, 7(10), 2006.
- Learning end-to-end patient representations through self-supervised covariate balancing for causal treatment effect estimation. Journal of Biomedical Informatics, 140:104339, 2023.
- Magne Thoresen. Spurious interaction as a result of categorization. BMC medical research methodology, 19(1):1–8, 2019.
- The causal-neural connection: Expressiveness, learnability, and inference. Advances in Neural Information Processing Systems, 34:10823–10836, 2021.
- Ganite: Estimation of individualized treatment effects using generative adversarial nets. In International conference on learning representations, 2018.
- gcastle: A python toolbox for causal discovery. arXiv preprint arXiv:2111.15155, 2021.