Recovering Latent Confounders from High-dimensional Proxy Variables
Abstract: Detecting latent confounders from proxy variables is an essential problem in causal effect estimation. Previous approaches are limited to low-dimensional proxies, sorted proxies, and binary treatments. We remove these assumptions and present a novel Proxy Confounder Factorization (PCF) framework for continuous treatment effect estimation when latent confounders manifest through high-dimensional, mixed proxy variables. For specific sample sizes, our two-step PCF implementation, using Independent Component Analysis (ICA-PCF), and the end-to-end implementation, using Gradient Descent (GD-PCF), achieve high correlation with the latent confounder and low absolute error in causal effect estimation with synthetic datasets in the high sample size regime. Even when faced with climate data, ICA-PCF recovers four components that explain $75.9\%$ of the variance in the North Atlantic Oscillation, a known confounder of precipitation patterns in Europe. Code for our PCF implementations and experiments can be found here: https://github.com/IPL-UV/confound_it. The proposed methodology constitutes a stepping stone towards discovering latent confounders and can be applied to many problems in disciplines dealing with high-dimensional observed proxies, e.g., spatiotemporal fields.
- Kernel multivariate analysis framework for supervised subspace learning: A tutorial on linear and kernel multivariate methods. IEEE Signal Processing Magazine, 30(4):16–29, 2013. URL http://dx.doi.org/10.1109/MSP.2013.2250591.
- JAX: composable transformations of Python+NumPy programs, 2018. URL http://github.com/google/jax.
- Sufficient dimension reduction for average causal effect estimation. Data Mining and Knowledge Discovery, 36(3):1174–1196, 2022a.
- Learning disentangled representations for counterfactual regression via mutual information minimization. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 1802–1806, 2022b.
- On learning disentangled representations for individual treatment effect estimation. Journal of Biomedical Informatics, 124:103940, 2021.
- Pierre Comon. Independent component analysis, a new concept? Signal processing, 36(3):287–314, 1994.
- Semiparametric proximal causal inference. Journal of the American Statistical Association, pages 1–12, 2023.
- The small sample size problem of ICA : A comparative study and analysis. Pattern Recognition, 45(12):4438–4450, 2012.
- Deep multi-modal structural equations for causal effect estimation with unstructured proxies. Advances in Neural Information Processing Systems, 35:10931–10944, 2022.
- Learning latent functions for causal discovery. Machine Learning: Science and Technology, 4(3):035004, jul 2023. doi: 10.1088/2632-2153/ace151. URL https://dx.doi.org/10.1088/2632-2153/ace151.
- Regularized regression versus the high-dimensional propensity score for confounding adjustment in secondary database analyses. American Journal of Epidemiology, 182(7):651–659, 2015.
- Partial least-squares regression: A tutorial. Analytica chimica acta, 185:1–17, 1986.
- Causal inference in public health. Annual Review of Public Health, 34(1):61–75, 2013. doi: 10.1146/annurev-publhealth-031811-124606. URL https://doi.org/10.1146/annurev-publhealth-031811-124606. PMID: 23297653.
- Robust learning with the hilbert-schmidt independence criterion. In International Conference on Machine Learning, 2019. URL https://api.semanticscholar.org/CorpusID:203610308.
- Measuring statistical dependence with hilbert-schmidt norms. In Proceedings of the 16th International Conference on Algorithmic Learning Theory, ALT’05, page 63–77, Berlin, Heidelberg, 2005. Springer-Verlag. ISBN 354029242X. doi: 10.1007/11564089_7. URL https://doi.org/10.1007/11564089_7.
- Learning disentangled representations for counterfactual regression. In International Conference on Learning Representations, 2019.
- Causal Inference. Chapman & Hall/CRC Monographs on Statistics & Applied Probab. CRC Press, 2023. ISBN 9781420076165. URL https://books.google.es/books?id=_KnHIAAACAAJ.
- Harold Hotelling. Analysis of a complex of statistical variables into principal components. Journal of educational psychology, 24(6):417, 1933.
- Independent component analysis: algorithms and applications. Neural networks, 13(4-5):411–430, 2000.
- Blind separation of sources, part i: An adaptive algorithm based on neuromimetic architecture. Signal processing, 24(1):1–10, 1991.
- Quantifying causal pathways of teleconnections. Bulletin of the American Meteorological Society, 102(12):E2247–E2263, 2021.
- Measurement bias and effect restoration in causal inference. Biometrika, 101(2):423–437, 2014.
- Independent component analysis. Springer, 1998.
- Causal effect inference with deep latent-variable models. Advances in neural information processing systems, 30, 2017.
- Matching using sufficient dimension reduction for causal inference. Journal of Business & Economic Statistics, 38(4):888–900, 2020.
- Proximal causal learning with kernels: Two-stage estimation and moment restriction. In International conference on machine learning, pages 7512–7523. PMLR, 2021.
- Identifying causal effects with proxy variables of an unmeasured confounder. Biometrika, 105(4):987–993, 2018.
- Judea Pearl. On measurement bias in causal inference. In Proceedings of the Twenty-Sixth Conference on Uncertainty in Artificial Intelligence, UAI’10, page 425–432, Arlington, Virginia, USA, 2010. AUAI Press. ISBN 9780974903965.
- Elements of Causal Inference: Foundations and Learning Algorithms. Adaptive Computation and Machine Learning. MIT Press, Cambridge, MA, 2017. ISBN 978-0-262-03731-0. URL https://mitpress.mit.edu/books/elements-causal-inference.
- Identifying causal gateways and mediators in complex spatio-temporal systems. Nature Communications, 6:8502, 10 2015. doi: 10.1038/ncomms9502.
- Causal inference for time series. Nature Reviews Earth & Environment, 4(7):487–505, 2023.
- Climate data guide spurs discovery and understanding. Eos, Transactions American Geophysical Union, 94(13):121–122, 2013. doi: https://doi.org/10.1002/2013EO130001. URL https://agupubs.onlinelibrary.wiley.com/doi/abs/10.1002/2013EO130001.
- Outcome-adaptive lasso: variable selection for causal inference. Biometrics, 73(4):1111–1122, 2017.
- How to control for confounds in decoding analyses of neuroimaging data. Neuroimage, 184:741–760, 2019.
- Proximal causal learning of conditional average treatment effects. 2023.
- An introduction to proximal causal learning. arXiv preprint arXiv:2009.10982, 2020.
- Teleconnections in the geopotential height field during the northern hemisphere winter. Monthly Weather Review, 109:784–812, 1981. URL https://api.semanticscholar.org/CorpusID:124794491.
- El Niño/Southern Oscillation behaviour since 1871 as diagnosed in an extended multivariate ENSO index (MEI.ext). International Journal of Climatology, 31(7):1074–1087, 2011. doi: https://doi.org/10.1002/joc.2336. URL https://rmets.onlinelibrary.wiley.com/doi/abs/10.1002/joc.2336.
- Learning decomposed representations for treatment effect estimation. IEEE Transactions on Knowledge and Data Engineering, 35(5):4989–5001, 2022.
- Machine learning for improving high-dimensional proxy confounder adjustment in healthcare database studies: An overview of the current literature. Pharmacoepidemiology and Drug Safety, 31(9):932–943, 2022.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.