Multi-Study R-Learner for Estimating Heterogeneous Treatment Effects Across Studies Using Statistical Machine Learning
Abstract: Estimating heterogeneous treatment effects (HTEs) is crucial for precision medicine. While multiple studies can improve the generalizability of results, leveraging them for estimation is statistically challenging. Existing approaches often assume identical HTEs across studies, but this may be violated due to various sources of between-study heterogeneity, including differences in study design, study populations, and data collection protocols, among others. To this end, we propose a framework for multi-study HTE estimation that accounts for between-study heterogeneity in the nuisance functions and treatment effects. Our approach, the multi-study R-learner, extends the R-learner to obtain principled statistical estimation with ML in the multi-study setting. It involves a data-adaptive objective function that links study-specific treatment effects with nuisance functions through membership probabilities, which enable information to be borrowed across potentially heterogeneous studies. The multi-study R-learner framework can combine data from randomized controlled trials, observational studies, or a combination of both. It's easy to implement and flexible in its ability to incorporate ML for estimating HTEs, nuisance functions, and membership probabilities. In the series estimation framework, we show that the multi-study R-learner is asymptotically normal and more efficient than the R-learner when there is between-study heterogeneity in the propensity score model under homoscedasticity. We illustrate using cancer data that the proposed method performs favorably compared to existing approaches in the presence of between-study heterogeneity.
- All of Us Research Program Investigators (2019). The ”all of us” research program. New England Journal of Medicine 381, 668–676.
- Some new asymptotic theory for least squares series: Pointwise and uniform results. Journal of Econometrics 186, 345–366.
- Biobank, U. (2014). About uk biobank.
- Methods for integrating trials and non-experimental data to examine treatment effect heterogeneity. arXiv preprint arXiv:2302.13428 .
- Generalizing evidence from randomized trials using inverse probability of sampling weights. Journal of the Royal Statistical Society: Series A (Statistics in Society) 181, 1193–1209.
- Double/debiased machine learning for treatment and structural parameters.
- A new initiative on precision medicine. New England journal of medicine 372, 793–795.
- Causal inference methods for combining randomized trials and observational studies: a review. Statistical science 39, 165–191.
- Extending inferences from a randomized trial to a target population. European journal of epidemiology 34, 719–722.
- Toward personalizing care: assessing heterogeneity of treatment effects in randomized trials. JAMA 329, 1063–1065.
- Efficient and robust methods for causally interpretable meta-analysis: transporting inferences from multiple randomized trials to a target population. arXiv preprint arXiv:1908.09230 .
- A review of generalizability and transportability. Annual Review of Statistics and Its Application 10, 501–524.
- Covariate selection for generalizing experimental results: application to a large-scale development program in uganda. Journal of the Royal Statistical Society Series A: Statistics in Society 184, 1524–1548.
- curatedovariandata: clinically annotated data for the ovarian cancer transcriptome. Database 2013,.
- Merging versus ensembling in multi-study machine learning: Theoretical insight from random effects. arXiv preprint arXiv:1905.07382 .
- The use of targeted therapies for precision medicine in oncology. Clinical Chemistry 62, 1556–1564.
- From sample average treatment effect to population average treatment effect on the treated: combining experimental with observational studies to estimate population treatment effects. Journal of the Royal Statistical Society Series A: Statistics in Society 178, 757–778.
- A genomic predictor of response and survival following taxane-anthracycline chemotherapy for invasive breast cancer. Jama 305, 1873–1881.
- Predicting the efficacy of future training programs using past experiences at other locations. Journal of econometrics 125, 241–270.
- Impact of breast cancer subtypes on prognosis of women with operable invasive breast cancer: a population-based study using seer database. Clinical Cancer Research 25, 1970–1979.
- Estimating treatment effect heterogeneity in randomized program evaluation.
- Removing hidden confounding by experimental grounding. Advances in neural information processing systems 31,.
- Annual review of statistics and its application. Precis Med 6, 263–286.
- Genome, transcriptome and proteome: the rise of omics data and their integration in biomedical sciences. Briefings in bioinformatics 19, 286–302.
- Design of a national distributed health data network. Annals of internal medicine 151, 341–344.
- Genomic predictors of response to doxorubicin versus docetaxel in primary breast cancer. Breast cancer research and treatment 128, 127–136.
- Communication-efficient learning of deep networks from decentralized data. In Artificial intelligence and statistics, pages 1273–1282. PMLR.
- Quasi-oracle estimation of heterogeneous treatment effects. Biometrika 108, 299–319.
- Training replicable predictors in multiple studies. Proceedings of the National Academy of Sciences 115, 2578–2583.
- Package ‘curatedbreastdata’.
- Some methods for heterogeneous treatment effect estimation in high dimensions. Statistics in medicine 37, 1767–1787.
- Tree-weighting for multi-study ensemble learners. In Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing, volume 25, page 451. NIH Public Access.
- Cross-study learning for generalist and specialist predictions. arXiv preprint arXiv:2007.12807 .
- Robinson, P. M. (1988). Root-n-consistent semiparametric regression. Econometrica: Journal of the Econometric Society pages 931–954.
- Rubin, D. B. (1974). Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of educational Psychology 66, 688.
- Schick, A. (1986). On asymptotically efficient estimation in semiparametric models. The Annals of Statistics pages 1139–1151.
- Estimating individual treatment effect: generalization bounds and algorithms. In International Conference on Machine Learning, pages 3076–3085. PMLR.
- Multi-study boosting: Theoretical considerations for merging vs. ensembling. arXiv preprint arXiv:2207.04588 .
- Development of the 21-gene assay and its application in clinical practice and clinical trials. Journal of Clinical Oncology 26, 721–728.
- The use of propensity scores to assess the generalizability of results from randomized trials. Journal of the Royal Statistical Society Series A: Statistics in Society 174, 369–386.
- A tree-based model averaging approach for personalized treatment effect estimation from heterogeneous data sources. In International Conference on Machine Learning, pages 21013–21036. PMLR.
- Tipton, E. (2013). Improving generalizations from experiments using propensity score subclassification: Assumptions, properties, and contexts. Journal of Educational and Behavioral Statistics 38, 239–266.
- An adaptive kernel approach to federated learning of heterogeneous causal effects. Advances in Neural Information Processing Systems 35, 24459–24473.
- Estimation and inference of heterogeneous treatment effects using random forests. Journal of the American Statistical Association 113, 1228–1242.
- Wasserman, L. (2006). All of nonparametric statistics. Springer Science & Business Media.
- Integrative r𝑟ritalic_r-learner of heterogeneous treatment effects combining experimental and observational studies. In First Conference on Causal Learning and Reasoning.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.