Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
149 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Federated Learning for Estimating Heterogeneous Treatment Effects (2402.17705v2)

Published 27 Feb 2024 in cs.LG

Abstract: Machine learning methods for estimating heterogeneous treatment effects (HTE) facilitate large-scale personalized decision-making across various domains such as healthcare, policy making, education, and more. Current machine learning approaches for HTE require access to substantial amounts of data per treatment, and the high costs associated with interventions makes centrally collecting so much data for each intervention a formidable challenge. To overcome this obstacle, in this work, we propose a novel framework for collaborative learning of HTE estimators across institutions via Federated Learning. We show that even under a diversity of interventions and subject populations across clients, one can jointly learn a common feature representation, while concurrently and privately learning the specific predictive functions for outcomes under distinct interventions across institutions. Our framework and the associated algorithm are based on this insight, and leverage tabular transformers to map multiple input data to feature representations which are then used for outcome prediction via multi-task learning. We also propose a novel way of federated training of personalised transformers that can work with heterogeneous input feature spaces. Experimental results on real-world clinical trial data demonstrate the effectiveness of our method.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (42)
  1. Federated learning based on dynamic regularization. In International Conference on Learning Representations, 2021.
  2. Ahmed M. Alaa and Mihaela van der Schaar. Bayesian inference of individualized treatment effects using multi-task gaussian processes. In Proceedings of the 31st International Conference on Neural Information Processing Systems, NIPS’17, page 3427–3435, Red Hook, NY, USA, 2017. Curran Associates Inc. ISBN 9781510860964.
  3. Deep-treat: Learning optimal personalized treatments from observational data using neural networks. In AAAI Conference on Artificial Intelligence, 2018. URL https://api.semanticscholar.org/CorpusID:19100865.
  4. Machine learning methods for estimating heterogeneous causal eects. 2015. URL https://api.semanticscholar.org/CorpusID:17421999.
  5. Recursive partitioning for heterogeneous causal effects. Proceedings of the National Academy of Science, 113(27):7353–7360, July 2016. doi: 10.1073/pnas.1510489113.
  6. Ioana Bica and Mihaela van der Schaar. Transfer learning on heterogeneous feature spaces for treatment effects estimation. In S. Koyejo, S. Mohamed, A. Agarwal, D. Belgrave, K. Cho, and A. Oh, editors, Advances in Neural Information Processing Systems, volume 35, pages 37184–37198. Curran Associates, Inc., 2022a. URL https://proceedings.neurips.cc/paper_files/paper/2022/file/f0e5cde3850e7dd0db125c0ebae16680-Paper-Conference.pdf.
  7. Ioana Bica and Mihaela van der Schaar. Transfer learning on heterogeneous feature spaces for treatment effects estimation. In Alice H. Oh, Alekh Agarwal, Danielle Belgrave, and Kyunghyun Cho, editors, Advances in Neural Information Processing Systems, 2022b. URL https://openreview.net/forum?id=nRcyGtY2kBC.
  8. Analysis of randomized comparative clinical trial data for personalized treatment selections. Biostatistics, 12 2:270–82, 2011. URL https://api.semanticscholar.org/CorpusID:10848218.
  9. Fed{be}: Making bayesian model ensemble applicable to federated learning. In International Conference on Learning Representations, 2021.
  10. Exploiting shared representations for personalized federated learning. arXiv preprint arXiv:2102.07078, 2021.
  11. Alicia Curth and Mihaela van der Schaar. Nonparametric estimation of heterogeneous treatment effects: From theory to learning algorithms. In International Conference on Artificial Intelligence and Statistics, 2021a. URL https://api.semanticscholar.org/CorpusID:231709566.
  12. Alicia Curth and Mihaela van der Schaar. On inductive biases for heterogeneous treatment effect estimation. ArXiv, abs/2106.03765, 2021b. URL https://api.semanticscholar.org/CorpusID:235358636.
  13. Alicia Curth and Mihaela van der Schaar. On inductive biases for heterogeneous treatment effect estimation. In M. Ranzato, A. Beygelzimer, Y. Dauphin, P.S. Liang, and J. Wortman Vaughan, editors, Advances in Neural Information Processing Systems, volume 34, pages 15883–15894. Curran Associates, Inc., 2021c. URL https://proceedings.neurips.cc/paper_files/paper/2021/file/8526e0962a844e4a2f158d831d5fddf7-Paper.pdf.
  14. Personalized federated learning with theoretical guarantees: A model-agnostic meta-learning approach. In Hugo Larochelle, Marc’Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, and Hsuan-Tien Lin, editors, Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual, 2020.
  15. Subgroup identification from randomized clinical trial data. Statistics in Medicine, 30, 2011. URL https://api.semanticscholar.org/CorpusID:24046082.
  16. Learning disentangled representations for counterfactual regression. In International Conference on Learning Representations, 2020. URL https://openreview.net/forum?id=HkxBJT4YvB.
  17. Jennifer Hill. Bayesian nonparametric modeling for causal inference. Journal of Computational and Graphical Statistics, 20:217–240, 03 2011. doi: 10.1198/jcgs.2010.08162.
  18. Learning representations for counterfactual inference. In Proceedings of the 33rd International Conference on International Conference on Machine Learning - Volume 48, ICML’16, page 3020–3029. JMLR.org, 2016.
  19. SCAFFOLD: Stochastic controlled averaging for federated learning. In Hal Daumé III and Aarti Singh, editors, Proceedings of the 37th International Conference on Machine Learning, volume 119 of Proceedings of Machine Learning Research, pages 5132–5143. PMLR, 13–18 Jul 2020.
  20. Edward Kennedy. Optimal doubly robust estimation of heterogeneous causal effects. 04 2020.
  21. Metalearners for estimating heterogeneous treatment effects using machine learning. Proceedings of the National Academy of Sciences of the United States of America, 116:4156 – 4165, 2017. URL https://api.semanticscholar.org/CorpusID:73455742.
  22. Federated optimization in heterogeneous networks. In Inderjit S. Dhillon, Dimitris S. Papailiopoulos, and Vivienne Sze, editors, Proceedings of Machine Learning and Systems 2020, MLSys 2020, Austin, TX, USA, March 2-4, 2020. mlsys.org, 2020.
  23. Architecture agnostic federated learning for neural networks. In International Conference on Machine Learning, 2022.
  24. Communication-Efficient Learning of Deep Networks from Decentralized Data. In Aarti Singh and Jerry Zhu, editors, Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, volume 54 of Proceedings of Machine Learning Research, pages 1273–1282. PMLR, 20–22 Apr 2017.
  25. Cross-stitch networks for multi-task learning. CoRR, abs/1604.03539, 2016. URL http://arxiv.org/abs/1604.03539.
  26. Jerzy Neyman. Sur les applications de la théorie des probabilités aux expériences agricoles : Essai des principes, mémoire de master. réédité en anglais dans la revue Statistical Science, 5:463–472, 1923.
  27. Quasi-oracle estimation of heterogeneous treatment effects. Biometrika, 2017. URL https://api.semanticscholar.org/CorpusID:85529052.
  28. Some methods for heterogeneous treatment effect estimation in high dimensions. Statistics in Medicine, 37:1767 – 1787, 2017. URL https://api.semanticscholar.org/CorpusID:3979161.
  29. A new representation learning method for individual treatment effect estimation: Split covariate representation network. In Sinno Jialin Pan and Masashi Sugiyama, editors, Proceedings of The 12th Asian Conference on Machine Learning, volume 129 of Proceedings of Machine Learning Research, pages 811–822. PMLR, 18–20 Nov 2020. URL https://proceedings.mlr.press/v129/qidong20a.html.
  30. Personalised federated learning on heterogeneous feature spaces, 2023.
  31. Donald B Rubin. Causal inference using potential outcomes. Journal of the American Statistical Association, 100(469):322–331, 2005. doi: 10.1198/016214504000001880. URL https://doi.org/10.1198/016214504000001880.
  32. Perfect match: A simple method for learning representations for counterfactual inference with neural networks. CoRR, abs/1810.00656, 2018. URL http://arxiv.org/abs/1810.00656.
  33. Estimating individual treatment effect: generalization bounds and algorithms. In Proceedings of the 34th International Conference on Machine Learning - Volume 70, ICML’17, page 3076–3085. JMLR.org, 2017.
  34. Personalized federated learning using hypernetworks. In International Conference on Machine Learning, pages 9489–9502. PMLR, 2021.
  35. Clustered federated learning for heterogeneous feature spaces using siamese graph convolutional neural network distance prediction. In Federated Learning Systems (FLSys) Workshop @ MLSys 2023, 2023. URL https://openreview.net/forum?id=R5n5-Kq7Hlp.
  36. Estimation and inference of heterogeneous treatment effects using random forests. Journal of the American Statistical Association, 113:1228 – 1242, 2015. URL https://api.semanticscholar.org/CorpusID:15676251.
  37. Transtab: Learning transferable tabular transformers across tables. In Advances in Neural Information Processing Systems, 2022.
  38. Representation learning for treatment effect estimation from observational data. In S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett, editors, Advances in Neural Information Processing Systems, volume 31. Curran Associates, Inc., 2018. URL https://proceedings.neurips.cc/paper_files/paper/2018/file/a50abba8132a77191791390c3eb19fe7-Paper.pdf.
  39. GANITE: estimation of individualized treatment effects using generative adversarial nets. In 6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, April 30 - May 3, 2018, Conference Track Proceedings. OpenReview.net, 2018.
  40. Fedpd: A federated learning framework with adaptivity to non-iid data. IEEE Transactions on Signal Processing, 69:6055–6070, 2021. doi: 10.1109/TSP.2021.3115952.
  41. Learning overlapping representations for the estimation of individualized treatment effects. ArXiv, abs/2001.04754, 2020. URL https://api.semanticscholar.org/CorpusID:210473399.
  42. Xtab: Cross-table pretraining for tabular transformers. In International Conference on Machine Learning, ICML 2023, 23-29 July 2023, Honolulu, Hawaii, USA, volume 202 of Proceedings of Machine Learning Research, pages 43181–43204. PMLR, 2023.

Summary

We haven't generated a summary for this paper yet.