Proximal Causal Inference With Text Data (2401.06687v3)
Abstract: Recent text-based causal methods attempt to mitigate confounding bias by estimating proxies of confounding variables that are partially or imperfectly measured from unstructured text data. These approaches, however, assume analysts have supervised labels of the confounders given text for a subset of instances, a constraint that is sometimes infeasible due to data privacy or annotation costs. In this work, we address settings in which an important confounding variable is completely unobserved. We propose a new causal inference method that uses two instances of pre-treatment text data, infers two proxies using two zero-shot models on the separate instances, and applies these proxies in the proximal g-formula. We prove, under certain assumptions about the instances of text and accuracy of the zero-shot predictions, that our method of inferring text-based proxies satisfies identification conditions of the proximal g-formula while other seemingly reasonable proposals do not. To address untestable assumptions associated with our method and the proximal g-formula, we further propose an odds ratio falsification heuristic that flags when to proceed with downstream effect estimation using the inferred proxies. We evaluate our method in synthetic and semi-synthetic settings -- the latter with real-world clinical notes from MIMIC-III and open LLMs for zero-shot prediction -- and find that our method produces estimates with low bias. We believe that this text-based design of proxies allows for the use of proximal causal inference in a wider range of scenarios, particularly those for which obtaining suitable proxies from structured data is difficult.
- Identification of causal effects using instrumental variables. Journal of the American Statistical Association, 91(434):444–455.
- Rohit Bhattacharya and Razieh Nabi. 2022. On testability of the front-door model via Verma constraints. In Proceedings of the 38th Conference On Uncertainty in Artificial Intelligence, pages 202–212. PMLR.
- Language models are few-shot learners. Advances in Neural Information Processing Systems, 33:1877–1901.
- Hua Yun Chen. 2007. A semiparametric odds ratio model for measuring association. Biometrics, 63:413–421.
- Causal inference with outcome-dependent missingness and self-censoring. In Proceedings of the 39th Conference on Uncertainty in Artificial Intelligence, pages 358–368. PMLR.
- Scaling instruction-finetuned language models. arXiv preprint arXiv:2210.11416.
- How to make causal inferences using texts. Science Advances, 8(42):eabg2652.
- Using imperfect surrogates for downstream inference: Design-based supervised learning for social science applications of large language models. In Thirty-seventh Conference on Neural Information Processing Systems.
- Causal inference in natural language processing: Estimation, prediction, interpretation and beyond. Transactions of the Association for Computational Linguistics, 10:1138–1158.
- Christian Fong and Matthew Tyler. 2021. Machine learning predictions as regression covariates. Political Analysis, 29(4):467–484.
- The case for evaluating causal models using interventional measures and empirical data. Advances in Neural Information Processing Systems, 32.
- Paul W Holland. 1986. Statistics and causal inference. Journal of the American Statistical Association, 81(396):945–960.
- MIMIC-III, a freely accessible critical care database. Scientific Data, 3(1):1–9.
- Text and causal inference: A review of using text to remove confounding from causal estimates. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 5332–5344.
- RCT rejection sampling for causal estimation evaluation. (Forthcoming) Transactions of Machine Learning Research.
- Using longitudinal social media analysis to understand the effects of early college alcohol use. In Proceedings of the International AAAI Conference on Web and Social Media, volume 12.
- Testing causal theories with learned proxies. Annual Review of Political Science, 25:419–441.
- Manabu Kuroki and Judea Pearl. 2014. Measurement bias and effect restoration in causal inference. Biometrika, 101(2):423–437.
- Kalle Leppälä. 2023. Sensitivity analysis on odds ratios. American Journal of Epidemiology, page kwad137.
- An introduction to sensitivity analysis for unobserved confounding in nonexperimental prevention research. Prevention Science, 14:570–580.
- Proximal causal learning with kernels: Two-stage estimation and moment restriction. In International conference on machine learning, pages 7512–7523. PMLR.
- Identifying causal effects with proxy variables of an unmeasured confounder. Biometrika, 105(4):987–993.
- Leveraging text data for causal inference using electronic health records. arXiv preprint arXiv:2307.03687.
- Distilling the outcomes of personal experiences: A propensity-scored analysis of social media. In Proceedings of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing, pages 370–386.
- Judea Pearl. 1995. Causal diagrams for empirical research. Biometrika, 82(4):669–688.
- Judea Pearl. 2009. Causality: Models, Reasoning, and Inference. Cambridge University Press.
- Judea Pearl. 2010. On measurement bias in causal inference. In Proceedings of the Twenty-Sixth Conference on Uncertainty in Artificial Intelligence, pages 425–432.
- Causal effects of linguistic properties. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 4095–4109.
- Adjusting for confounding with text matching. American Journal of Political Science, 64(4):887–903.
- Multitask prompted training enables zero-shot task generalization. In International Conference on Learning Representations.
- The proximal ID algorithm. Journal of Machine Learning Research, 24(188):1–46.
- Causation, Prediction, and Search. MIT press.
- Dhanya Sridhar and Lise Getoor. 2019. Estimating causal effects of tone in online debates. In Proceedings of the 28th International Joint Conference on Artificial Intelligence, pages 1872–1878.
- Elijah Tamarchenko. 2023. Combining optimal adjustment set selection and post selection inference in unknown causal graphs. Undergraduate thesis, Williams College.
- On doubly robust estimation in a semiparametric odds ratio model. Biometrika, 97(1):171–180.
- An introduction to proximal causal learning. arXiv preprint arXiv:2009.10982.
- Tyler J VanderWeele. 2008. The sign of the bias of unmeasured confounding. Biometrics, 64(3):702–706.
- Adapting text embeddings for causal inference. In Conference on Uncertainty in Artificial Intelligence, pages 919–928. PMLR.
- On falsification of the binary instrumental variable model. Biometrika, 104(1):229–236.
- Uncertainty estimation and reduction of pre-trained models for text regression. Transactions of the Association for Computational Linguistics, 10:680–696.
- Larry Wasserman. 2004. All of statistics: a concise course in statistical inference, volume 26. Springer.
- Finetuned language models are zero-shot learners. In International Conference on Learning Representations.
- Challenges of using text classifiers for causal inference. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 4586–4598.
- Benchmarking zero-shot text classification: Datasets, evaluation and entailment approach. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pages 3914–3923.
- Thrombolysis in acute ischaemic stroke: An update. Therapeutic Advances in Chronic Disease, 2(2):119–131.
- Uncovering interpretable potential confounders in electronic medical records. Nature Communications, 13(1):1014.
- Causal matching with text embeddings: A case study in estimating the causal effects of peer review policies. In Findings of the Association for Computational Linguistics: ACL 2023, pages 1284–1297.
- Can large language models transform computational social science? arXiv preprint arXiv:2305.03514.