FedMM: Federated Multi-Modal Learning with Modality Heterogeneity in Computational Pathology (2402.15858v1)
Abstract: The fusion of complementary multimodal information is crucial in computational pathology for accurate diagnostics. However, existing multimodal learning approaches necessitate access to users' raw data, posing substantial privacy risks. While Federated Learning (FL) serves as a privacy-preserving alternative, it falls short in addressing the challenges posed by heterogeneous (yet possibly overlapped) modalities data across various hospitals. To bridge this gap, we propose a Federated Multi-Modal (FedMM) learning framework that federatedly trains multiple single-modal feature extractors to enhance subsequent classification performance instead of existing FL that aims to train a unified multimodal fusion model. Any participating hospital, even with small-scale datasets or limited devices, can leverage these federated trained extractors to perform local downstream tasks (e.g., classification) while ensuring data privacy. Through comprehensive evaluations of two publicly available datasets, we demonstrate that FedMM notably outperforms two baselines in accuracy and AUC metrics.
- “Multimodal integration of radiology, pathology and genomics for prediction of response to PD-(L) 1 blockade in patients with non-small cell lung cancer” In Nature cancer 3.10 Nature Publishing Group US New York, 2022, pp. 1151–1164
- “Artificial intelligence for multimodal data integration in oncology” In Cancer cell 40.10 Elsevier, 2022, pp. 1095–1110
- “Enhancing the prediction of disease–gene associations with multimodal deep learning” In Bioinformatics 35.19 Oxford University Press, 2019, pp. 3735–3742
- “Multimodal genomic features predict outcome of immune checkpoint blockade in non-small-cell lung cancer” In Nature cancer 1.1 Nature Publishing Group US New York, 2020, pp. 99–111
- “Lung cancer subtype diagnosis by fusing image-genomics data and hybrid deep networks” In IEEE/ACM Transactions on Computational Biology and Bioinformatics IEEE, 2021
- “MGNN: A multimodal graph neural network for predicting the survival of cancer patients” In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2020, pp. 1697–1700
- “Federated learning for smart healthcare: A survey” In ACM Computing Surveys (CSUR) 55.3 ACM New York, NY, 2022, pp. 1–37
- “Communication-efficient learning of deep networks from decentralized data” In Artificial intelligence and statistics, 2017, pp. 1273–1282 PMLR
- “Federated learning and differential privacy for medical image analysis” In Scientific reports 12.1 Nature Publishing Group UK London, 2022, pp. 1953
- “Rethinking architecture design for tackling data heterogeneity in federated learning” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 10061–10071
- “Prospective assessment of breast cancer risk from multimodal multiview ultrasound images via clinically applicable deep learning” In Nature biomedical engineering 5.6 Nature Publishing Group UK London, 2021, pp. 522–532
- “Multimodal analysis of cell-free DNA whole-genome sequencing for pediatric cancers with low mutational burden” In Nature communications 12.1 Nature Publishing Group UK London, 2021, pp. 3230
- “Pan-cancer integrative histology-genomic analysis via multimodal deep learning” In Cancer Cell 40.8 Elsevier, 2022, pp. 865–878
- “Multimodal biomedical AI” In Nature Medicine 28.9 Nature Publishing Group US New York, 2022, pp. 1773–1784
- “Decentralized federated learning through proxy model sharing” In Nature communications 14.1 Nature Publishing Group UK London, 2023, pp. 2899
- “Federated learning for predicting histological response to neoadjuvant chemotherapy in triple-negative breast cancer” In Nature medicine 29.1 Nature Publishing Group US New York, 2023, pp. 135–146
- “Federated learning for computational pathology on gigapixel whole slide images” In Medical image analysis 76 Elsevier, 2022, pp. 102298
- “Multi-site fMRI analysis using privacy-preserving federated learning and domain adaptation: ABIDE results” In Medical Image Analysis 65 Elsevier, 2020, pp. 101765
- “Fedbn: Federated learning on non-iid features via local batch normalization” In arXiv preprint arXiv:2102.07623, 2021
- “Federated learning on non-iid data silos: An experimental study” In 2022 IEEE 38th International Conference on Data Engineering (ICDE), 2022, pp. 965–978 IEEE
- “Rethinking data heterogeneity in federated learning: Introducing a new notion and standard benchmarks” In IEEE Transactions on Artificial Intelligence IEEE, 2023
- “Multimodal Federated Learning via Contrastive Representation Ensemble” In arXiv preprint arXiv:2302.08888, 2023
- “FedMSplit: Correlation-adaptive federated multi-task learning across multimodal split networks” In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2022, pp. 87–96
- “Harmony: Heterogeneous Multi-Modal Federated Learning through Disentangled Model Training” In Proceedings of the 21st Annual International Conference on Mobile Systems, Applications and Services, 2023, pp. 530–543
- “Federated optimization in heterogeneous networks” In Proceedings of Machine learning and systems 2, 2020, pp. 429–450
- “Fedproto: Federated prototype learning across heterogeneous clients” In Proceedings of the AAAI Conference on Artificial Intelligence 36.8, 2022, pp. 8432–8440
- “Rethinking Federated Learning With Domain Shift: A Prototype View” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023, pp. 16312–16322
- Philip Chikontwe, Soopil Kim and Sang Hyun Park “CAD: Co-Adapting Discriminative Features for Improved Few-Shot Classification” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 14554–14563
- “Multiple instance learning: A survey of problem characteristics and applications” In Pattern Recognition 77 Elsevier, 2018, pp. 329–353
- “Deep Residual Learning for Image Recognition” In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016
- “Imagenet: A large-scale hierarchical image database” In 2009 IEEE conference on computer vision and pattern recognition, 2009, pp. 248–255 Ieee
- Maximilian Ilse, Jakub Tomczak and Max Welling “Attention-based deep multiple instance learning” In International conference on machine learning, 2018, pp. 2127–2136 PMLR
- Yun Wang, Juncheng Li and Florian Metze “A comparison of five multiple instance learning pooling functions for sound event detection with weak labeling” In ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019, pp. 31–35 IEEE
- “GISTIC2. 0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers” In Genome biology 12 Springer, 2011, pp. 1–14
- “Self-normalizing neural networks” In Advances in neural information processing systems 30, 2017
- “Pathomic fusion: an integrated framework for fusing histopathology and genomic features for cancer diagnosis and prognosis” In IEEE Transactions on Medical Imaging 41.4 IEEE, 2020, pp. 757–770
- Yuanzhe Peng (4 papers)
- Jieming Bian (17 papers)
- Jie Xu (467 papers)