Path-GPTOmic: A Balanced Multi-modal Learning Framework for Survival Outcome Prediction (2403.11375v1)
Abstract: For predicting cancer survival outcomes, standard approaches in clinical research are often based on two main modalities: pathology images for observing cell morphology features, and genomic (e.g., bulk RNA-seq) for quantifying gene expressions. However, existing pathology-genomic multi-modal algorithms face significant challenges: (1) Valuable biological insights regarding genes and gene-gene interactions are frequently overlooked; (2) one modality often dominates the optimization process, causing inadequate training for the other modality. In this paper, we introduce a new multi-modal ``Path-GPTOmic" framework for cancer survival outcome prediction. First, to extract valuable biological insights, we regulate the embedding space of a foundation model, scGPT, initially trained on single-cell RNA-seq data, making it adaptable for bulk RNA-seq data. Second, to address the imbalance-between-modalities problem, we propose a gradient modulation mechanism tailored to the Cox partial likelihood loss for survival prediction. The contributions of the modalities are dynamically monitored and adjusted during the training process, encouraging that both modalities are sufficiently trained. Evaluated on two TCGA(The Cancer Genome Atlas) datasets, our model achieves substantially improved survival prediction accuracy.
- “scGPT: Towards building a foundation model for single-cell multi-omics using generative AI,” bioRxiv, pp. 2023–04, 2023.
- “From bulk, single-cell to spatial RNA sequencing,” International Journal of Oral Science, vol. 13, no. 1, pp. 36, 2021.
- “Pathomic fusion: An integrated framework for fusing histopathology and genomic features for cancer diagnosis and prognosis,” IEEE Transactions on Medical Imaging, vol. 41, no. 4, pp. 757–770, 2020.
- “Pan-cancer integrative histology-genomic analysis via multimodal deep learning,” Cancer Cell, vol. 40, no. 8, pp. 865–878, 2022.
- “Multimodal co-attention Transformer for survival prediction in gigapixel whole slide images,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 4015–4025.
- “Pathology-and-genomics multimodal Transformer for survival outcome prediction,” in International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, 2023, pp. 622–631.
- “Deep learning with multimodal representation for pancancer prognosis prediction,” Bioinformatics, vol. 35, no. 14, pp. i446–i454, 2019.
- “Balanced multimodal learning via on-the-fly gradient modulation,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 8238–8247.
- “MMCosine: Multi-modal cosine loss towards balanced audio-visual fine-grained learning,” in ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2023, pp. 1–5.
- “Science forum: The human cell atlas,” eLife, vol. 6, pp. e27041, 2017.
- “Smooth image-to-image translations with latent space interpolations,” arXiv preprint arXiv:2210.00841, 2022.
- “Unlabeled data guided semi-supervised histopathology image segmentation,” in 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE, 2020, pp. 815–820.
- “On adversarial mixup resynthesis,” Advances in Neural Information Processing Systems, vol. 32, 2019.
- “Cox-nnet: An artificial neural network method for prognosis prediction of high-throughput omics data,” PLoS Computational Biology, vol. 14, no. 4, pp. e1006076, 2018.
- “Tokens-to-Token ViT: Training Vision Transformers from scratch on ImageNet,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 558–567.
- “Smoothing the disentangled latent style space for unsupervised image-to-image translation,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 10785–10794.
- “Latent space smoothing for individually fair representations,” in European Conference on Computer Vision. Springer, 2022, pp. 535–554.
- “MixMatch: A holistic approach to semi-supervised learning,” Advances in Neural Information Processing Systems, vol. 32, 2019.
- “Predicting cancer outcomes from histology and genomics using convolutional networks,” Proceedings of the National Academy of Sciences, vol. 115, no. 13, pp. E2970–E2979, 2018.
- Hongxiao Wang (10 papers)
- Yang Yang (883 papers)
- Zhuo Zhao (12 papers)
- Pengfei Gu (20 papers)
- Nishchal Sapkota (10 papers)
- Danny Z. Chen (72 papers)