Joint Analysis of Single-Cell Data across Cohorts with Missing Modalities
Abstract: Joint analysis of multi-omic single-cell data across cohorts has significantly enhanced the comprehensive analysis of cellular processes. However, most of the existing approaches for this purpose require access to samples with complete modality availability, which is impractical in many real-world scenarios. In this paper, we propose (Single-Cell Cross-Cohort Cross-Category) integration, a novel framework that learns unified cell representations under domain shift without requiring full-modality reference samples. Our generative approach learns rich cross-modal and cross-domain relationships that enable imputation of these missing modalities. Through experiments on real-world multi-omic datasets, we demonstrate that offers a robust solution to single-cell tasks such as cell type clustering, cell type classification, and feature imputation.
- MultiVI: deep generative model for the integration of multimodal data. Nature Methods 20, 8 (2023), 1222–1231.
- The technological landscape and applications of single-cell multi-omics. Nature Reviews Molecular Cell Biology (2023), 1–19.
- Johannes Braams. 1991. Babel, a Multilingual Style-Option System for Use with LaTeX’s Standard Document Styles. TUGboat 12, 2 (June 1991), 291–301.
- Single-cell chromatin accessibility reveals principles of regulatory variation. Nature 523, 7561 (2015), 486–490.
- Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nature biotechnology 36, 5 (2018), 411–420.
- Manifold alignment for heterogeneous single-cell multi-omics data integration using Pamona. Bioinformatics 38, 1 (2022), 211–219.
- Scotv2: Single-cell multiomic alignment with disproportionate cell-type representation. Journal of Computational Biology 29, 11 (2022), 1213–1228.
- Missing Modality Transfer Learning via Latent Low-Rank Constraint. IEEE Transactions on Image Processing 24, 11 (2015), 4322–4334. https://doi.org/10.1109/TIP.2015.2462023
- Joint probabilistic modeling of single-cell multi-omic data with totalVI. Nature methods 18, 3 (2021), 272–282.
- Cobolt: integrative analysis of multimodal single-cell sequencing data. Genome biology 22, 1 (2021), 1–21.
- Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors. Nature biotechnology 36, 5 (2018), 421–427.
- Diederik P Kingma and Max Welling. 2013. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114 (2013).
- Enhancing modality-agnostic representations via meta-learning for brain tumor segmentation. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 21415–21425.
- April R Kriebel and Joshua D Welch. 2022. UINMF performs mosaic integration of single-cell multi-omic datasets using nonnegative matrix factorization. Nature communications 13, 1 (2022), 780.
- Deep learning enables accurate clustering with batch effect removal in single-cell RNA-seq analysis. Nature communications 11, 1 (2020), 2338.
- Multigrate: single-cell multi-omic data integration. bioRxiv (2022). https://doi.org/10.1101/2022.03.16.484643 arXiv:https://www.biorxiv.org/content/early/2022/03/17/2022.03.16.484643.full.pdf
- A sandbox for prediction and integration of DNA, RNA, and proteins in single cells. In 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and Benchmarks.
- Benchmarking atlas-level data integration in single-cell genomics. Nature methods 19, 1 (2022), 41–50.
- Smil: Multimodal learning with severely missing modality. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. 2302–2310.
- UMAP: Uniform Manifold Approximation and Projection. Journal of Open Source Software 3, 29 (2018), 861.
- Learning across diverse biomedical data modalities and cohorts: Challenges and opportunities for innovation. Patterns (2024).
- Patchwork Learning: A Paradigm Towards Integrative Analysis across Diverse Biomedical Data Sources. arXiv preprint arXiv:2305.06217 (2023).
- Simultaneous epitope and transcriptome measurement in single cells. Nature methods 14, 9 (2017), 865–868.
- MultiModN- Multimodal, Multi-Task, Interpretable Modular Networks. (2023).
- Mike Wu and Noah Goodman. 2018. Multimodal generative models for scalable weakly-supervised learning. Advances in neural information processing systems 31 (2018).
- SMILE: mutual information learning for integration of single-cell omics data. Bioinformatics 38, 2 (2022), 476–486.
- scMoMaT jointly performs single cell mosaic integration and multi-modal bio-marker detection. Nature Communications 14, 1 (2023), 384.
- Single-cell multi-omic topic embedding reveals cell-type-specific and COVID-19 severity-related immune signatures. Cell Reports Methods. https://pubmed.ncbi.nlm.nih.gov/36778483/
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.