Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
126 tokens/sec
GPT-4o
28 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Multiply Robust Federated Estimation of Targeted Average Treatment Effects (2309.12600v1)

Published 22 Sep 2023 in stat.ML, cs.LG, math.ST, stat.ME, and stat.TH

Abstract: Federated or multi-site studies have distinct advantages over single-site studies, including increased generalizability, the ability to study underrepresented populations, and the opportunity to study rare exposures and outcomes. However, these studies are challenging due to the need to preserve the privacy of each individual's data and the heterogeneity in their covariate distributions. We propose a novel federated approach to derive valid causal inferences for a target population using multi-site data. We adjust for covariate shift and covariate mismatch between sites by developing multiply-robust and privacy-preserving nuisance function estimation. Our methodology incorporates transfer learning to estimate ensemble weights to combine information from source sites. We show that these learned weights are efficient and optimal under different scenarios. We showcase the finite sample advantages of our approach in terms of efficiency and robustness compared to existing approaches.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (31)
  1. Generalizing evidence from randomized trials using inverse probability of sampling weights. Journal of the Royal Statistical Society. Series A,(Statistics in Society), 181(4):1193, 2018.
  2. K. C. G. Chan. A simple multiply robust estimator for missing response problem. Stat, 2(1):143–149, 2013.
  3. Oracle, multiple robust and multipurpose calibration in a missing response problem. Statistical Science, 29(3):380–396, 2014.
  4. S. Chen and D. Haziza. Multiply robust imputation procedures for the treatment of item nonresponse in surveys. Biometrika, 104(2):439–453, 2017.
  5. Generalizing evidence from randomized clinical trials to target populations: the actg 320 trial. American journal of epidemiology, 172(1):107–115, 2010.
  6. Extending inferences from a randomized trial to a new target population. Statistics in Medicine, 39(14):1999–2014, 2020.
  7. Generalizing causal inferences from individuals in randomized trials to all trial-eligible individuals. Biometrics, 75(2):685–694, 2019.
  8. Learning from local to global-an efficient distributed algorithm for modeling time-to-event data. bioRxiv, 2020.
  9. A fast score test for generalized mixture models. Biometrics, 76:811–820, 2021.
  10. Y. Gu and H. Zou. Aggregated expectile regression by exponential weighting. Statistica Sinica, 29(2):671–692, 2019.
  11. Multi-source causal inference using control variates. arXiv preprint arXiv:2103.16689, 2021.
  12. Federated adaptive causal estimation (face) of target treatment effects. arXiv preprint arXiv:2112.09313, 2021.
  13. Privacy-preserving and communication-efficient causal inference for hospital quality measurement. arXiv preprint arXiv:2203.00768, 2022.
  14. P. Han. A further study of the multiply robust estimator in missing data analysis. Journal of Statistical Planning and Inference, 148:101–110, 2014.
  15. P. Han. Multiply robust estimation in regression analysis with missing data. Journal of the American Statistical Association, 109(507):1159–1173, 2014.
  16. P. Han and L. Wang. Estimation with missing data: beyond double robustness. Biometrika, 100(2):417–430, 2013.
  17. Demystifying double robustness: A comparison of alternative strategies for estimating a population mean from incomplete data. Statistical Science, 22(4):523–539, 2007.
  18. Collaborative causal inference on distributed data. arXiv preprint arXiv:2208.07898, 2022.
  19. Minimax rates for heterogeneous causal effect estimation. arXiv preprint arXiv:2203.00837, 2022.
  20. Demystifying a class of multiply robust estimators. Biometrika, 107(4):919–933, 2020.
  21. J. Neyman. On the application of probability theory to agricultural experiments. Statistical Science, 5(5):463–480, 1923.
  22. J. Qin. Inferences for case-control and semiparametric two-sample density ratio models. Biometrika, 85(3):619–630, 1998.
  23. D. B. Rubin. Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology, 66(5):688, 1974.
  24. D. B. Rubin. Randomization analysis of experimental data: the fisher randomization test comment. Journal of the American Statistical Association, 75(371):591–593, 1980.
  25. E. Tipton. Improving generalizations from experiments using propensity score subclassification: Assumptions, properties, and contexts. Journal of Educational and Behavioral Statistics, 38(3):239–266, 2013.
  26. Federated estimation of causal effects from observational data. arXiv preprint arXiv:2106.00456, 2021.
  27. Federated causal inference in heterogeneous observational data. arXiv preprint arXiv:2107.11732, 2021.
  28. S. Yang and P. Ding. Combining multiple observational data sources to estimate causal effects. Journal of the American Statistical Association, 115(531):1540–1554, 2020.
  29. Y. Yang. Adaptive estimation in pattern recognition by combining different procedures. Statistica Sinica, pages 1069–1089, 2000.
  30. Y. Yang. Combining forecasting procedures: Some theoretical results. Econometric Theory, 20(1):176–222, 2004.
  31. Efficient generalization and transportation. arXiv preprint arXiv:2302.00092, 2023.
Citations (8)

Summary

We haven't generated a summary for this paper yet.