What Makes Forest-Based Heterogeneous Treatment Effect Estimators Work? (2206.10323v2)
Abstract: Estimation of heterogeneous treatment effects (HTE) is of prime importance in many disciplines, ranging from personalized medicine to economics among many others. Random forests have been shown to be a flexible and powerful approach to HTE estimation in both randomized trials and observational studies. In particular "causal forests", introduced by Athey, Tibshirani and Wager (2019), along with the R implementation in package grf were rapidly adopted. A related approach, called "model-based forests", that is geared towards randomized trials and simultaneously captures effects of both prognostic and predictive variables, was introduced by Seibold, Zeileis and Hothorn (2018) along with a modular implementation in the R package model4you. Here, we present a unifying view that goes beyond the theoretical motivations and investigates which computational elements make causal forests so successful and how these can be blended with the strengths of model-based forests. To do so, we show that both methods can be understood in terms of the same parameters and model assumptions for an additive model under L2 loss. This theoretical insight allows us to implement several flavors of "model-based causal forests" and dissect their different elements in silico. The original causal forests and model-based forests are compared with the new blended versions in a benchmark study exploring both randomized trials and observational settings. In the randomized setting, both approaches performed akin. If confounding was present in the data generating process, we found local centering of the treatment indicator with the corresponding propensities to be the main driver for good performance. Local centering of the outcome was less important, and might be replaced or enhanced by simultaneous split selection with respect to both prognostic and predictive effects.
- “Machine Learning Approach for the Prediction of Postpartum Hemorrhage in Vaginal Birth.” Scientific Reports, 11, 22620. 10.1038/s41598-021-02198-y.
- Antoine C, Young BK (2021). “Cesarean Section one Hundred Years 1920–2020: the Good, the Bad and the Ugly.” Journal of Perinatal Medicine, 49(1), 5–16. doi:10.1515/jpm-2020-0305.
- “Generalized Random Forests.” The Annals of Statistics, 47(2), 1148–1178. 10.1214/18-aos1709.
- Athey S, Wager S (2019). “Estimating Treatment Effects with Causal Forests: An Application.” Observational Studies, 5(2), 37–51. 10.1353/obs.2019.0001.
- Breiman L (2001). “Random Forests.” Machine Learning, 45(1), 5–32. 10.1023/a:1010933404324.
- \pkgglmmTMB: Generalized Linear Mixed Models Using Template Model Builder. \proglangR package version 1.1.2, URL https://CRAN.R-project.org/package=glmmTMB.
- Buri M, Hothorn T (2020). “Model-Based Random Forests for Ordinal Regression.” International Journal of Biostatistics, 16(2), 20190063. 10.1515/ijb-2019-0063.
- “Double/Debiased Machine Learning for Treatment and Structural Parameters.” The Econometrics Journal, 21(1), C1–C68. 10.1111/ectj.12097.
- “BART: Bayesian Additive Regression Trees.” The Annals of Applied Statistics, 4(1), 266–298. 10.1214/09-aoas285.
- “Estimating Heterogeneous Treatment Effects with Right-Censored Data via Causal Survival Forests.” arXiv 2001.09887 v3, arXiv.org E-Print Archive. 10.48550/arXiv.2001.09887. URL https://arxiv.org/abs/2001.09887.
- “Risk Estimation Using Probability Machines.” BioData Mining, 7(2), 2. 10.1186/1756-0381-7-2.
- Ende HB (2022). “Risk Assessment Tools to Predict Postpartum Hemorrhage.” Best Practice & Research Clinical Anaesthesiology. 10.1016/j.bpa.2022.08.003. Online first.
- Erickson EN, Carlson NS (2020). “Predicting Postpartum Hemorrhage After Low-Risk Vaginal Birth by Labor Characteristics and Oxytocin Administration.” Journal of Obstetric, Gynecologic & Neonatal Nursing, 49(6), 549–563. 10.1016/j.jogn.2020.08.005.
- Farouki RT (2012). “The Bernstein Polynomial Basis: A Centennial Retrospective.” Computer Aided Geometric Design, 29(6), 379–419. 10.1016/j.cagd.2012.03.001.
- “Generalized Propensity Score for Estimating the Average Treatment Effect of Multiple Treatments.” Statistics in Medicine, 31(7), 681–697. 10.1002/sim.4168.
- “Detecting Treatment-Subgroup Interactions in Clustered Data with Generalized Linear Mixed-Effects Model Trees.” Behavior Research Methods, 50(6), 2016–2034. 10.3758/s13428-017-0971-x.
- “Subgroup Identification from Randomized Clinical Trial Data.” Statistics in Medicine, 30(24), 2867–2880. 10.1002/sim.4322.
- “The Impact of Prepartum Factor XIII Activity on Postpartum Blood Loss.” Journal of Thrombosis and Haemostasis, 18, 1310–1319. 10.1111/jth.14795.
- Hill JL (2011). “Bayesian Nonparametric Modeling for Causal Inference.” Journal of Computational and Graphical Statistics, 20(1), 217–240. 10.1198/jcgs.2010.08162.
- Hothorn T (2021). \pkgtrtf: Transformation Trees and Forests. \proglangR package version 0.3-8, URL http://ctm.R-forge.R-project.org.
- \pkgmultcomp: Simultaneous Inference in General Parametric Models. \proglangR package version 1.4-17, URL https://CRAN.R-project.org/package=multcomp.
- ‘‘Unbiased Recursive Partitioning: A Conditional Inference Framework.” Journal of Computational and Graphical Statistics, 15(3), 651–674. 10.1198/106186006x133933.
- “Bagging Survival Trees.” Statistics in Medicine, 23(1), 77–91. 10.1002/sim.1593.
- “Most Likely Transformations.” Scandinavian Journal of Statistics, 45(1), 110–134. 10.1111/sjos.12291.
- Hothorn T, Zeileis A (2015). “\pkgpartykit: A Modular Toolkit for Recursive Partytioning in \proglangR.” Journal of Machine Learning Research, 16, 3905–3909. URL https://jmlr.org/papers/v16/hothorn15a.html.
- Hothorn T, Zeileis A (2021a). \pkgpartykit: A Toolkit for Recursive Partytioning. \proglangR package version 1.2-15, URL http://partykit.r-forge.r-project.org/partykit/.
- Hothorn T, Zeileis A (2021b). “Predictive Distribution Modelling Using Transformation Forests.” Journal of Computational and Graphical Statistics, 14, 144–148. 10.1080/10618600.2021.1872581.
- Imbens G, Athey S (2016). “Recursive Partitioning for Heterogeneous Causal Effects.” Proceedings of the National Academy of Sciences of the United States of America, 113(27), 7353–7360. 10.1073/pnas.1510489113.
- Imbens GW, Rubin DW (2015). Causal Inference for Statistics, Social, and Biomedical Sciences. Cambridge University Press, New York.
- “Random Survival Forests.” The Annals of Applied Statistics, 2(3), 841–860. 10.1214/08-aoas169.
- Ishwaran H, Malley JD (2014). “Synthetic Learning Machines.” BioData Mining, 7(28). 10.1186/s13040-014-0028-y.
- “Validation of a Quantitative System for Real-time Measurement of Postpartum Blood Loss.” Archives of Gynecology and Obstetrics, 298, 1071–1077. 10.1007/s00404-018-4896-0.
- “Evaluation of Risk-Assessment Tools for Severe Postpartum Hemorrhage in Women Undergoing Cesarean Delivery.” Obstetrics & Gynecology, 134(6), 1308–1316. 10.1097/AOG.0000000000003574.
- “Survival Forests under Test: Impact of the Proportional Hazards Assumption on Prognostic and Predictive Forests for ALS Survival.” Statistical Methods in Medical Research, 29(5), 1403–1419. 10.1177/0962280219862586.
- “Metalearners for Estimating Heterogeneous Treatment Effects Using Machine Learning.” Proceedings of the National Academy of Sciences of the United States of America, 116(10), 4156–4165. 10.1073/pnas.1804597116.
- Lin Y, Jeon Y (2006). “Random Forests and Adaptive Nearest Neighbors.” Journal of the American Statistical Association, 101(474), 578–590. 10.1198/016214505000001230.
- “Estimating Individual Treatment Effect in Observational Data Using Random Forest Methods.” Journal of Computational and Graphical Statistics, 27(1), 209–219. 10.1080/10618600.2017.1356325.
- “Recent Increases in the U.S. Maternal Mortality Rate: Disentangling Trends From Measurement Issues.” Obstetrics & Gynecology, 128(3), 447–455. 10.1097/AOG.0000000000001556.
- “Doubly Robust Treatment Effect Estimation with Missing Attributes.” The Annals of Applied Statistics, 14(3), 1409–1431. 10.1214/20-aoas1356.
- Meinshausen N (2006). “Quantile Regression Forests.” Journal of Machine Learning Research, 7, 983–999. 10.1007/s10994-014-5452-1.
- Nie X, Wager S (2021). “Quasi-Oracle Estimation of Heterogeneous Treatment Effects.” Biometrika, 108(2), 299–319. 10.1093/biomet/asaa076.
- “Some Methods for Heterogeneous Treatment Effect Estimation in High Dimensions.” Statistics in Medicine, 37(11), 1767–1787. 10.1002/sim.7623.
- “Tunability: Importance of Hyperparameters of Machine Learning Algorithms.” Journal of Machine Learning Research, 20(1), 1934–1965.
- “Hyperparameters and tuning strategies for random forest.” WIREs Data Mining and Knowledge Discovery, 9(3). 10.1002/widm.1301.
- \proglangR Core Team (2021). \proglangR: A Language and Environment for Statistical Computing. \proglangR Foundation for Statistical Computing, Vienna, Austria. URL https://www.R-project.org/.
- Robinson PM (1988). “Root-N-Consistent Semiparametric Regression.” Econometrica, 56(4), 931–954. 10.2307/1912705.
- Rosenbaum PR, Rubin DB (1983). “The Central Role of the Propensity Score in Observational Studies for Causal Effects.” Biometrika, 70(1), 41–55. 10.1093/biomet/70.1.41.
- “Global Causes of Maternal Death: a WHO Systematic Analysis.” The Lancet Global Health, 2(6), e323–e333. 10.1016/S2214-109X(14)70227-X.
- “Distributional Regression Forests for Probabilistic Precipitation Forecasting in Complex Terrain.” The Annals of Applied Statistics, 13(3), 1564–1589. 10.1214/19-AOAS1247.
- “Subgroup Inference for Multiple Treatments and Multiple Endpoints in an Alzheimer’s Disease Treatment Trial.” The Annals of Applied Statistics, 11(2), 949–966. 10.1214/17-aoas1024.
- “Model-Based Recursive Partitioning for Subgroup Analyses.” International Journal of Biostatistics, 12(1), 45–63. 10.1515/ijb-2015-0032.
- ‘‘Individual Treatment Effect Prediction for Amyotrophic Lateral Sclerosis Patients.” Statistical Methods in Medical Research, 27(10), 3104–3125. 10.1177/0962280217693034.
- “\pkgmodel4you: An \proglangR Package for Personalised Treatment Effect Estimation.” Journal of Open Research Software, 7(17), 1–6. 10.5334/jors.219.
- \pkgmodel4you: Stratified and Personalised Models Based on Model-Based Trees and Forests. \proglangR package version 0.9-7, URL https://CRAN.R-project.org/package=model4you.
- ‘‘Targeted Smooth Bayesian Causal Forests: An Analysis of Heterogeneous Treatment Effects for Simultaneous vs. Interval Medical Abortion Regimens Over Gestation.” The Annals of Applied Statistics, 15(3), 1194–1219. 10.1214/20-AOAS1438.
- Tang F, Ishwaran H (2017). “Random Forest Missing Data Algorithms.” Statistical Analysis and Data Mining, 10(6), 363–377. 10.1002/sam.11348.
- \pkggrf: Generalized Random Forests. \proglangR package version 2.0.2, URL https://CRAN.R-project.org/package=grf.
- ‘‘Machine Learning and Statistical Models to Predict Postpartum Hemorrhage.” Obstetrics & Gynecology, 135(4), 935–944. 10.1097/AOG.0000000000003759.
- Wager S, Athey S (2018). “Estimation and Inference of Heterogeneous Treatment Effects Using Random Forests.” Journal of the American Statistical Association, 113(523), 1228–1242. 10.1080/01621459.2017.1319839.
- WHO (2012). “WHO Recommendations for the Prevention and Treatment of Postpartum Haemorrhage.” World Health Organization, Geneva, Switzerland.
- “Using Propensity Score Subclassification for Multiple Treatment Doses to Evaluate a National Antidrug Media Campaign.” Journal of Educational and Behavioral Statistics, 30(1), 59–73. 10.3102/10769986030001059.
- “Model-Based Recursive Partitioning.” Journal of Computational and Graphical Statistics, 17(2), 492–514. 10.1198/106186008x319331.
- Susanne Dandl (12 papers)
- Torsten Hothorn (32 papers)
- Heidi Seibold (12 papers)
- Erik Sverdrup (9 papers)
- Stefan Wager (72 papers)
- Achim Zeileis (30 papers)