Learning Optimal Dynamic Treatment Regimes Using Causal Tree Methods in Medicine (2204.07124v2)
Abstract: Dynamic treatment regimes (DTRs) are used in medicine to tailor sequential treatment decisions to patients by considering patient heterogeneity. Common methods for learning optimal DTRs, however, have shortcomings: they are typically based on outcome prediction and not treatment effect estimation, or they use linear models that are restrictive for patient data from modern electronic health records. To address these shortcomings, we develop two novel methods for learning optimal DTRs that effectively handle complex patient data. We call our methods DTR-CT and DTR-CF. Our methods are based on a data-driven estimation of heterogeneous treatment effects using causal tree methods, specifically causal trees and causal forests, that learn non-linear relationships, control for time-varying confounding, are doubly robust, and explainable. To the best of our knowledge, our paper is the first that adapts causal tree methods for learning optimal DTRs. We evaluate our proposed methods using synthetic data and then apply them to real-world data from intensive care units. Our methods outperform state-of-the-art baselines in terms of cumulative regret and percentage of optimal decisions by a considerable margin. Our work improves treatment recommendations from electronic health record and is thus of direct relevance for personalized medicine.
- Analyzing patient trajectories with artificial intelligence. Journal of Medical Internet Research, 23(12):e29812, 2021.
- Explainability for artificial intelligence in healthcare: a multidisciplinary perspective. BMC Medical Informatics and Decision Making, 20(1):1–9, 2020.
- Recursive partitioning for heterogeneous causal effects. PNAS, 113(27):7353–7360, 2016.
- Estimating treatment effects with causal forests: An application. Observational Studies, 5(2):37–51, 2019.
- Generalized random forests. The Annals of Statistics, 47(2):1148–1178, 2018.
- Alina Beygelzimer et al. FNN: Fast nearest neighbor search algorithms and applications. R package, 2013.
- Classification and regression trees. Wadsworth, 1984.
- Deep jump learning for off-policy evaluation in continuous treatment settings. Advances in Neural Information Processing Systems, 34, 2021.
- Bibhas Chakraborty and EE Moodie. Statistical methods for dynamic treatment regimes. Springer, 2013.
- Machine intelligence in healthcare: Perspectives on trustworthiness, explainability, usability, and transparency. npj Digital Medicine, 3(1), 2020.
- Sequential deconfounding for causal inference with unobserved confounders. arXiv:2104.09323, 2021.
- Doubly-robust dynamic treatment regimen estimation for binary outcomes. arXiv preprint arXiv:2203.08269, 2022.
- Alistair Johnson et al. MIMIC-III, A freely accessible critical care database. Scientific Data, 3(1), 2016.
- Precision medicine. Annual Review of Statistics and its Application, 6:263–286, 2019.
- Deconfounding temporal autoencoder: Estimating treatment effects over time using noisy proxies. In ML4H, 2021.
- Reinforcement learning for clinical decision support in critical care: comprehensive review. Journal of Medical Internet Research, 22(7):e18477, 2020.
- Ying Liu et al. Deep reinforcement learning for dynamic treatment regimes on medical registry data. In ICHI, 2017.
- Demystifying optimal dynamic treatment regimes. Biometrics, 63(2):447–455, 2007.
- Q-learning for estimating optimal dynamic treatment rules from observational data. Canadian Journal of Statistics, 40(4):629–645, 2012.
- Susan Murphy. Optimal dynamic treatment regimes. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 65(2):331–355, 2003.
- Marginal mean models for dynamic regimes. Journal of the American Statistical Association, 96(456):1410–1423, 2001.
- Jersey Neyman. Sur les applications de la théorie des probabilités aux experiences agricoles: Essai des principes. Roczniki Nauk Rolniczych, 10:1–51, 1923.
- AttDMM: An attentive deep Markov model for risk scoring in intensive care units. KDD, 2021.
- A reinforcement learning approach to weaning of mechanical ventilation in intensive care units. Conference on Uncertainty in Artificial Intelligence, 2017.
- Deep reinforcement learning for sepsis treatment. arXiv preprint arXiv:1711.09602, 2017.
- ”Why should I trust you?” explaining the predictions of any classifier. In ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016.
- James Robins. Causal inference from complex longitudinal data. In Latent variable modeling and applications to causality, pages 69–117. New York: Springer, 1997.
- James Robins. Optimal structural nested models for optimal sequential decisions. In Proceedings of the Second Seattle Symposium in Biostatistics, 2004.
- Estimation of the causal effects of time-varying exposures. In Garrett Fitzmaurice, Marie Davidian, Geert Verbeke, and Geert Molenberghs, editors, Advances in Longitudinal Data Analysis, pages 553–599. Boca Raton, FL: CRC press, 2009.
- The central role of the propensity score in observational studies for causal effects. Biometrika, 70(1):41–55, 1983.
- Donald Rubin. Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology, 66(5):688, 1974.
- Cynthia Rudin. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nature Machine Intelligence, 1(5):206–215, 2019.
- Kelly Speth and Lu Wang. Restricted sub-tree learning to estimate an optimal dynamic treatment regime using observational data. Statistics in Medicine, 40(26):5796–5812, 2021.
- Assessment of tree-based statistical learning to estimate optimal personalized treatment decision rules for traumatic finger amputations. JAMA Network Open, 3(2):e1921626–e1921626, 2020.
- Yilun Sun and Lu Wang. Stochastic tree search for estimating optimal dynamic treatment regimes. Journal of the American Statistical Association, 116(533):421–432, 2021.
- Tree-based reinforcement learning for estimating optimal dynamic treatment regimes. The Annals of Applied Statistics, 12(3):1914–1938, 2018.
- Anastasios Tsiatis et al. Dynamic treatment regimes: Statistical methods for precision medicine. Boca Raton, FL: CRC Press, 2019.
- Estimation and inference of heterogeneous treatment effects using random forests. Journal of the American Statistical Association, 113(523):1228–1242, 2018.
- Doubly-robust dynamic treatment regimen estimation via weighted least squares. Biometrics, 71(3):636–644, 2015.
- Dynamic treatment regimen estimation via regression-based techniques: Introducing R package DTRreg. Journal of Statistical Software, 80(2):1–20, 2017. URL https://www.jstatsoft.org/index.php/jss/article/view/v080i02.
- Supervised reinforcement learning with recurrent neural network for dynamic treatment recommendation. In ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2018.
- Shirly Wang et al. MIMIC-extract: A data extraction, preprocessing, and representation pipeline for MIMIC-III. In CHIL, 2020.
- Reinforcement learning with action-derived rewards for chemotherapy and clinical trial dosing regimen selection. In Machine Learning for Healthcare Conference, pages 161–226. PMLR, 2018.
- Reinforcement learning in healthcare: A survey. ACM Computing Surveys (CSUR), 55(1):1–36, 2021.
- Yichi Zhang et al. Interpretable dynamic treatment regimes. Journal of the American Statistical Association, 113(524):1541–1549, 2018.
- Ying-Qi Zhao et al. New statistical learning methods for estimating optimal dynamic treatment regimes. Journal of the American Statistical Association, 110(510):583–598, 2015.
- Recent development on statistical methods for personalized medicine discovery. Frontiers of Medicine, 7(1):102–110, 2013.
- Theresa Blümlein (1 paper)
- Joel Persson (5 papers)
- Stefan Feuerriegel (117 papers)