Leveraging text data for causal inference using electronic health records (2307.03687v2)
Abstract: In studies that rely on data from electronic health records (EHRs), unstructured text data such as clinical progress notes offer a rich source of information about patient characteristics and care that may be missing from structured data. Despite the prevalence of text in clinical research, these data are often ignored for the purposes of quantitative analysis due their complexity. This paper presents a unified framework for leveraging text data to support causal inference with electronic health data at multiple stages of analysis. In particular, we consider how natural language processing and statistical text analysis can be combined with standard inferential techniques to address common challenges due to missing data, confounding bias, and treatment effect heterogeneity. Through an application to a recent EHR study investigating the effects of a non-randomized medical intervention on patient outcomes, we show how incorporating text data in a traditional matching analysis can help strengthen the validity of an estimated treatment effect and identify patient subgroups that may benefit most from treatment. We believe these methods have the potential to expand the scope of secondary analysis of clinical data to domains where structured EHR data is limited, such as in developing countries. To this end, we provide code and open-source replication materials to encourage adoption and broader exploration of these techniques in clinical research.
- Personalized medicine and the power of electronic health records. Cell, 177(1):58–69.
- Understanding of a convolutional neural network. In 2017 international conference on engineering and technology (ICET), pages 1–6. Ieee.
- Publicly available clinical bert embeddings. arXiv preprint arXiv:1904.03323.
- Ashley, E. A. (2016). Towards precision medicine. Nature Reviews Genetics, 17(9):507–522.
- The revival of the notes field: leveraging the unstructured content in electronic health records. Frontiers in medicine, 6:66.
- Austin, P. C. (2008). A critical appraisal of propensity-score matching in the medical literature between 1996 and 2003. Statistics in medicine, 27(12):2037–2049.
- Multiple imputation by chained equations: what is it and how does it work? International journal of methods in psychiatric research, 20(1):40–49.
- Double dipping in machine learning: problems and solutions. Biological psychiatry. Cognitive neuroscience and neuroimaging, 5(3):261.
- On the adaptive control of the false discovery rate in multiple testing with independent statistics. Journal of educational and Behavioral Statistics, 25(1):60–83.
- How to make causal inferences with time-series cross-sectional data under selection on observables. American Political Science Review, 112(4):1067–1082.
- Latent dirichlet allocation. Journal of Machine Learning Research, 3:993–1022.
- mice: Multivariate imputation by chained equations in r. Journal of statistical software, pages 1–68.
- Automatic identification of heart failure diagnostic criteria, using text analysis of clinical notes from electronic health records. International journal of medical informatics, 83(12):983–992.
- Some practical guidance for the implementation of propensity score matching. Journal of economic surveys, 22(1):31–72.
- Electronic health records to facilitate clinical research. Clinical Research in Cardiology, 106(1):1–9.
- Critical Data, M. (2016). Secondary analysis of electronic health records. Springer Nature.
- Text preprocessing for unsupervised learning: Why it matters, when it misleads, and what to do about it. Political Analysis, 26(2):168–189.
- How to make causal inferences using texts. Science Advances, 8(42):eabg2652.
- Evans, R. S. (2016). Electronic health records: then, now, and in the future. Yearbook of medical informatics, 25(S 01):S48–S61.
- Transthoracic echocardiography and mortality in sepsis: analysis of the mimic-iii database. Intensive care medicine, 44(6):884–892.
- Testing for heterogeneous treatment effects in experimental data: false discovery risks and correction procedures. Journal of Development Effectiveness, 6(1):44–57.
- Natural language processing: state of the art and prospects for significant progress, a workshop sponsored by the national library of medicine. Journal of biomedical informatics, 46(5):765–773.
- Machine learning for social science: An agnostic approach. Annual Review of Political Science, 24:395–419.
- A second chance to get causal inference right: a classification of data science tasks. Chance, 32(1):42–49.
- Clinicalbert: Modeling clinical notes and predicting hospital readmission. arXiv preprint arXiv:1904.05342.
- Fusion of medical imaging and electronic health records using deep learning: a systematic review and implementation guidelines. NPJ digital medicine, 3(1):1–9.
- Mining electronic health records: towards better research applications and clinical care. Nature Reviews Genetics, 13(6):395–405.
- Mimic-iii, a freely accessible critical care database. Scientific data, 3:160035.
- Personalized evidence based medicine: predictive approaches to heterogeneous treatment effects. Bmj, 363.
- A review of causal inference for biomedical informatics. Journal of biomedical informatics, 44(6):1102–1112.
- Unpredictable bias when using the missing indicator method or complete case analysis for missing confounder values: an empirical example. Journal of clinical epidemiology, 63(7):728–736.
- What is precision medicine? European respiratory journal, 50(4).
- Natural language processing of symptoms documented in free-text narratives of electronic health records: a systematic review. Journal of the American Medical Informatics Association, 26(4):364–379.
- The artificial intelligence clinician learns optimal treatment strategies for sepsis in intensive care. Nature medicine, 24(11):1716–1720.
- Propensity score analysis with partially observed covariates: how should multiple imputation be used? Statistical methods in medical research, 28(1):3–19.
- The prevention and treatment of missing data in clinical trials. New England Journal of Medicine, 367(14):1355–1360.
- An introduction to sensitivity analysis for unobserved confounding in nonexperimental prevention research. Prevention science, 14(6):570–580.
- Appropriate use and clinical impact of transthoracic echocardiography. JAMA internal medicine, 173(17):1600–1607.
- Matching with text data: An experimental evaluation of methods for matching documents and of measuring match quality. Political Analysis, 28(4):445–468.
- Insights on variance estimation for blocked and matched pairs designs. Journal of Educational and Behavioral Statistics, 46(3):271–296.
- Comments on propensity score matching following multiple imputation.
- Some methods for heterogeneous treatment effect estimation in high dimensions. Statistics in medicine, 37(11):1767–1787.
- Scalable and accurate deep learning with electronic health records. NPJ Digital Medicine, 1(1):1–10.
- News from the nih: potential contributions of the behavioral and social sciences to the precision medicine initiative. Translational behavioral medicine, 5(3):243–246.
- Adjusting for confounding with text matching. American Journal of Political Science, 64(4):887–903.
- Structural topic models for open-ended survey responses. American Journal of Political Science, 58(4):1064–1082.
- Rosenbaum, P. R. (2002). Observational studies. In Observational Studies, pages 1–17. Springer.
- Rosenbaum, P. R. (2009). Design of Observational Studies. Springer Science & Business Media. Springer Science & Business Media.
- The central role of the propensity score in observational studies for causal effects. Biometrika, 70(1):41–55.
- Rubin, D. B. (1976). Inference and missing data. Biometrika, 63(3):581–592.
- Rubin, D. B. (2004). Multiple imputation for nonresponse in surveys, volume 81. John Wiley & Sons.
- Multiple imputation in health-care databases: An overview and some applications. Statistics in medicine, 10(4):585–598.
- On the application of probability theory to agricultural experiments. Essay on principles. Section 9. Statistical Science, 5(4):465 – 472.
- Stuart, E. A. (2010). Matching methods for causal inference: A review and a look forward. Statistical science: a review journal of the Institute of Mathematical Statistics, 25(1):1.
- Taddy, M. (2013). Multinomial inverse regression for text analysis. Journal of the American Statistical Association, 108(503):755–770.
- Taddy, M. (2015). Distributed multinomial regression. The Annals of Applied Statistics, 9(3):1394–1414.
- Challenges and opportunities beyond structured data in analysis of electronic health records. Wiley Interdisciplinary Reviews: Computational Statistics, 13(6):e1549.
- Comparing methods for estimation of heterogeneous treatment effects using observational data from health care databases. Statistics in medicine, 37(23):3309–3324.
- Bias and efficiency of multiple imputation compared with complete-case analysis for missing covariate values. Statistics in medicine, 29(28):2920–2931.
- Multiple imputation using chained equations: issues and guidance for practice. Statistics in medicine, 30(4):377–399.
- Youden, W. J. (1950). Index for rating diagnostic tests. Cancer, 3(1):32–35.
- Reagan Mozer (4 papers)
- Aaron R. Kaufman (3 papers)
- Leo A. Celi (9 papers)
- Luke Miratrix (33 papers)