Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
169 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

CAVACHON: a hierarchical variational autoencoder to integrate multi-modal single-cell data (2405.18655v1)

Published 28 May 2024 in cs.LG, cs.AI, and q-bio.GN

Abstract: Paired single-cell sequencing technologies enable the simultaneous measurement of complementary modalities of molecular data at single-cell resolution. Along with the advances in these technologies, many methods based on variational autoencoders have been developed to integrate these data. However, these methods do not explicitly incorporate prior biological relationships between the data modalities, which could significantly enhance modeling and interpretation. We propose a novel probabilistic learning framework that explicitly incorporates conditional independence relationships between multi-modal data as a directed acyclic graph using a generalized hierarchical variational autoencoder. We demonstrate the versatility of our framework across various applications pertinent to single-cell multi-omics data integration. These include the isolation of common and distinct information from different modalities, modality-specific differential analysis, and integrated cell clustering. We anticipate that the proposed framework can facilitate the construction of highly flexible graphical models that can capture the complexities of biological hypotheses and unravel the connections between different biological data types, such as different modalities of paired single-cell multi-omics data. The implementation of the proposed framework can be found in the repository https://github.com/kuijjerlab/CAVACHON.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (10)
  1. mrna-seq whole-transcriptome analysis of a single cell. Nature methods, 6(5):377–382, 2009.
  2. Counting absolute numbers of molecules using unique molecular identifiers. Nature methods, 9(1):72–74, 2012.
  3. Scope-ms: mass spectrometry of single mammalian cells quantifies proteome heterogeneity during cell differentiation. Genome biology, 19(1):1–12, 2018.
  4. Multiplex single-cell profiling of chromatin accessibility by combinatorial cellular indexing. Science, 348(6237):910–914, 2015.
  5. Integrative single-cell analysis of transcriptional and epigenetic states in the human adult brain. Nature biotechnology, 36(1):70–80, 2018.
  6. High-throughput chromatin accessibility profiling at single-cell resolution. Nature communications, 9(1):1–6, 2018.
  7. High-throughput single-cell chip-seq identifies heterogeneity of chromatin states in breast cancer. Nature genetics, 51(6):1060–1066, 2019.
  8. Quantitative single-cell rna-seq with unique molecular identifiers. Nature methods, 11(2):163–166, 2014.
  9. Single-cell rna-sequencing of differentiating ips cells reveals dynamic genetic effects on gene expression. Nature communications, 11(1):1–14, 2020.
  10. Transcript-indexed atac-seq for precision immune profiling. Nature medicine, 24(5):580–590, 2018.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com