Causal Inference with Conditional Front-Door Adjustment and Identifiable Variational Autoencoder (2310.01937v1)
Abstract: An essential and challenging problem in causal inference is causal effect estimation from observational data. The problem becomes more difficult with the presence of unobserved confounding variables. The front-door adjustment is a practical approach for dealing with unobserved confounding variables. However, the restriction for the standard front-door adjustment is difficult to satisfy in practice. In this paper, we relax some of the restrictions by proposing the concept of conditional front-door (CFD) adjustment and develop the theorem that guarantees the causal effect identifiability of CFD adjustment. Furthermore, as it is often impossible for a CFD variable to be given in practice, it is desirable to learn it from data. By leveraging the ability of deep generative models, we propose CFDiVAE to learn the representation of the CFD adjustment variable directly from data with the identifiable Variational AutoEncoder and formally prove the model identifiability. Extensive experiments on synthetic datasets validate the effectiveness of CFDiVAE and its superiority over existing methods. The experiments also show that the performance of CFDiVAE is less sensitive to the causal strength of unobserved confounding variables. We further apply CFDiVAE to a real-world dataset to demonstrate its potential application.
- Mostly harmless econometrics: An empiricist’s companion. Princeton university press, 2009.
- Susan Athey. Beyond prediction: Using big data for policy problems. Science, 355(6324):483–485, 2017.
- Recursive partitioning for heterogeneous causal effects. Proceedings of the National Academy of Sciences, 113(27):7353–7360, 2016.
- Generalized random forests. The Annals of Statistics, 47(2):1148–1178, 2019.
- Iain Barr. Causal inference with python part 3 - frontdoor adjustment, Sep 2018. URL http://www.degeneratestate.org/posts/2018/Sep/03/causal-inference-with-python-part-3-frontdoor-adjustment/.
- Deep generalized method of moments for instrumental variable analysis. In Advances in Neural Information Processing Systems 32, NIPS, pp. 3559–3569, 2019.
- Pyro: Deep universal probabilistic programming. The Journal of Machine Learning Research, 20(1):973–978, 2019.
- Use of allele scores as instrumental variables for mendelian randomization. International Journal of Epidemiology, 42(4):1134–1144, 2013.
- Causal mediation analysis with hidden confounders. In The Fifteenth ACM International Conference on Web Search and Data Mining, WSDM, pp. 113–122, 2022.
- Double/debiased machine learning for treatment and structural parameters: Double/debiased machine learning. The Econometrics Journal, 21(1), 2018.
- Understanding and misunderstanding randomized controlled trials. Social Science & Medicine, 210:2–21, 2018.
- UCI machine learning repository, 2017. URL http://archive.ics.uci.edu/ml.
- Ronald Aylmer Fisher. Design of experiments. British Medical Journal, 1(3923):554, 1936.
- Confidence intervals for causal effects with invalid instruments by using two-stage hard thresholding with voting. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 80(4):793–815, 2018.
- Deep IV: A flexible approach for counterfactual prediction. In Proceedings of the 34th International Conference on Machine Learning, ICML, pp. 1414–1423, 2017.
- Valid causal inference with (some) invalid instruments. In Proceedings of the 38th International Conference on Machine Learning, ICML, pp. 4096–4106, 2021.
- Counterfactual regression with importance sampling weights. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI, pp. 5880–5887, 2019.
- Causal Inference in Statistics, Social, and Biomedical Sciences. Cambridge University Press, 2015.
- Finding and listing front-door adjustment sets. arXiv preprint arXiv:2210.05816, 2022.
- Learning representations for counterfactual inference. In Proceedings of the 33nd International Conference on Machine Learning, ICML, pp. 3020–3029, 2016.
- Maggie Hei Greg Lewis Paul Oka Miruna Oprescu Vasilis Syrgkanis Keith Battocchi, Eleanor Dillon. EconML: A Python Package for ML-Based Heterogeneous Treatment Effects Estimation. https://github.com/microsoft/EconML, 2019. Version 0.13.
- Variational autoencoders and nonlinear ICA: A unifying framework. In The 23rd International Conference on Artificial Intelligence and Statistics, AISTATS 2020, pp. 2207–2217, 2020.
- Ivy: Instrumental variable synthesis for causal inference. In The 23rd International Conference on Artificial Intelligence and Statistics, AISTATS, pp. 398–410, 2020.
- Metalearners for estimating heterogeneous treatment effects using machine learning. Proceedings of the National Academy of Sciences, 116(10):4156–4165, 2019.
- Causal effect inference with deep latent-variable models. In Advances in Neural Information Processing Systems 30, NIPS, pp. 6446–6456, 2017.
- Measuring living standards with proxy variables. Demography, 37(2):155–174, 2000.
- Quasi-oracle estimation of heterogeneous treatment effects. Biometrika, 108(2):299–319, 2021.
- Pytorch: An imperative style, high-performance deep learning library. In Advances in Neural Information Processing Systems 32, NIPS, pp. 8024–8035, 2019.
- Judea Pearl. Causality. Cambridge university press, 2009.
- Maya L Petersen and Mark J van der Laan. Causal models and learning from data: Integrating causal modeling and statistical estimation. Epidemiology (Cambridge, Mass.), 25(3):418, 2014.
- R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria, 2021. URL https://www.R-project.org/.
- Estimating individual treatment effect: Generalization bounds and algorithms. In Proceedings of the 33nd International Conference on Machine Learning, ICML, pp. 3076–3085, 2017.
- Kernel instrumental variable regression. In Advances in Neural Information Processing Systems 32, NIPS, pp. 4595–4607, 2019.
- Causation, Prediction, and Search. MIT press, 2000.
- Subgroup analysis via recursive partitioning. Journal of Machine Learning Research, 10(2), 2009.
- Eric J Tchetgen Tchetgen and Ilya Shpitser. Semiparametric theory for causal mediation analysis: efficiency bounds, multiple robustness, and sensitivity analysis. Annals of statistics, 40(3):1816, 2012.
- What is the most effective intervention to increase job retention for this disabled worker? In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD, pp. 3981–3991, 2022.
- Python reference manual. Centrum voor Wiskunde en Informatica Amsterdam, 1995.
- Estimation and inference of heterogeneous treatment effects using random forests. Journal of the American Statistical Association, 113(523):1228–1242, 2018.
- Finding front-door adjustment sets in linear time. arXiv preprint arXiv:2211.16468, 2022.
- Auto iv: Counterfactual prediction via automatic instrumental variable decomposition. ACM Transactions on Knowledge Discovery from Data (TKDD), 16(4):1–20, 2022.
- A causal framework for discovering and removing direct and indirect discrimination. In Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, IJCAI 2017, pp. 3929–3935, 2017a.
- Causal modeling-based discrimination discovery and removal: criteria, bounds, and algorithms. IEEE Transactions on Knowledge and Data Engineering, 31(11):2035–2050, 2018.
- Mining heterogeneous causal effects for personalized cancer treatment. Bioinformatics, 33(15):2372–2378, 2017b.
- Treatment effect estimation with disentangled latent factors. In Thirty-Fifth AAAI Conference on Artificial Intelligence, AAAI, pp. 10923–10930, 2021.