Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
156 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Factorized Discriminant Analysis for Genetic Signatures of Neuronal Phenotypes (2010.02171v7)

Published 5 Oct 2020 in q-bio.QM and cs.LG

Abstract: Navigating the complex landscape of single-cell transcriptomic data presents significant challenges. Central to this challenge is the identification of a meaningful representation of high-dimensional gene expression patterns that sheds light on the structural and functional properties of cell types. Pursuing model interpretability and computational simplicity, we often look for a linear transformation of the original data that aligns with key phenotypic features of cells. In response to this need, we introduce factorized linear discriminant analysis (FLDA), a novel method for linear dimensionality reduction. The crux of FLDA lies in identifying a linear function of gene expression levels that is highly correlated with one phenotypic feature while minimizing the influence of others. To augment this method, we integrate it with a sparsity-based regularization algorithm. This integration is crucial as it selects a subset of genes pivotal to a specific phenotypic feature or a combination thereof. To illustrate the effectiveness of FLDA, we apply it to transcriptomic datasets from neurons in the Drosophila optic lobe. We demonstrate that FLDA not only captures the inherent structural patterns aligned with phenotypic features but also uncovers key genes associated with each phenotype.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (41)
  1. Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets. Cell, 161(5):1202–1214, May 2015.
  2. Adult mouse cortical cell taxonomy revealed by single cell transcriptomics. Nat. Neurosci., 19(2):335–346, February 2016.
  3. COMPREHENSIVE CLASSIFICATION OF RETINAL BIPOLAR NEURONS BY SINGLE-CELL TRANSCRIPTOMICS. Cell, 166(5):1308–1323.e30, August 2016.
  4. Shared and distinct transcriptomic cell types across neocortical areas. Nature, 563(7729):72–78, November 2018.
  5. Molecular Classification and Comparative Taxonomics of Foveal and Peripheral Cells in Primate Retina. Cell, 176(5):1222–1237.e22, February 2019.
  6. The types of retinal ganglion cells: Current status and implications for neuronal classification. Annu. Rev. Neurosci., 38:221–246, July 2015.
  7. Neuronal cell-type classification: Challenges, opportunities and the path forward. Nature Reviews Neuroscience, 18(9):530–546, September 2017.
  8. Electrophysiological, transcriptomic and morphologic profiling of single neurons using Patch-seq. Nature Biotechnology, 34(2):199–203, February 2016.
  9. Placing RNA in context and space – methods for spatially resolved transcriptomics. The FEBS Journal, 286(8):1468–1481, 2019.
  10. Classification of electrophysiological and morphological neuron types in the mouse visual cortex. Nat. Neurosci., 22(7):1182–1195, July 2019.
  11. Ronald Aylmer Fisher. The Correlation between Relatives on the Supposition of Mendelian Inheritance. Royal Society of Edinburgh], 1918.
  12. Modular transcriptional programs separately define axon and dendrite connectivity. eLife, 8:e50822, November 2019.
  13. Eigenvalue and Generalized Eigenvalue Problems: Tutorial. arXiv:1903.11240 [cs, stat], March 2019. Comment: 8 pages, Tutorial paper.
  14. Jerome H. Friedman. Regularized Discriminant Analysis. Journal of the American Statistical Association, 84(405):165–175, March 1989.
  15. Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data. Journal of the American Statistical Association, 97(457):77–87, March 2002.
  16. Some theory for Fisher’s linear discriminant function, ‘naive Bayes’, and some alternatives when there are many more variables than observations. Bernoulli, 10(6):989–1010, December 2004.
  17. Class Prediction by Nearest Shrunken Centroids, with Applications to DNA Microarrays. Statist. Sci., 18(1):104–117, February 2003.
  18. Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nature Biotechnology, 36(5):411–420, May 2018.
  19. Comprehensive Integration of Single-Cell Data. Cell, 177(7):1888–1902.e21, June 2019.
  20. Evolution of neuronal cell classes and types in the vertebrate retina. bioRxiv, 2023.
  21. Sparse generalized eigenvalue problem: Optimal statistical rates via truncated Rayleigh flow. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 80(5):1057–1086, 2018.
  22. Linear Dimensionality Reduction: Survey, Insights, and Generalizations. Journal of Machine Learning Research, 16(89):2859–2900, 2015.
  23. A comparison for dimensionality reduction methods of single-cell rna-seq data. Frontiers in Genetics, 12:646936, 2021.
  24. The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nature Biotechnology, 32(4):381–386, 2014.
  25. I. T. Jolliffe. Principal Component Analysis. Springer Series in Statistics. Springer-Verlag, New York, second edition, 2002.
  26. Independent Component Analysis. Wiley-Interscience, New York, 1st edition edition, May 2001.
  27. C. Spearman. "General Intelligence," Objectively Determined and Measured. The American Journal of Psychology, 15(2):201–292, 1904.
  28. R. A. Fisher. The Use of Multiple Measurements in Taxonomic Problems. Annals of Eugenics, 7(2):179–188, 1936.
  29. Geoffrey McLachlan. Discriminant Analysis and Statistical Pattern Recognition. Wiley-Interscience, Hoboken, N.J, August 2004.
  30. Harold Hotelling. RELATIONS BETWEEN TWO SETS OF VARIATES. Biometrika, 28(3-4):321–377, December 1936.
  31. On Deep Multi-View Representation Learning: Objectives and Optimization. arXiv:1602.01024 [cs], February 2016.
  32. Laurens van der Maaten and Geoffrey Hinton. Visualizing data using t-sne. Journal of Machine Learning Research, 9(Nov):2579–2605, 2008.
  33. Umap: Uniform manifold approximation and projection for dimension reduction. arXiv preprint arXiv:1802.03426, 2018.
  34. Peng Qiu. Embracing the dropouts in single-cell rna-seq analysis. Nature Communications, 11:1169, 2020.
  35. Single-cell profiles of retinal neurons differing in resilience to injury reveal neuroprotective genes. bioRxiv, page 711762, July 2019.
  36. Detection of high variability in gene expression from single-cell RNA-seq profiling. BMC Genomics, 17 Suppl 7:508, August 2016.
  37. Comprehensive Identification and Spatial Mapping of Habenular Neuronal Types Using Single-Cell RNA-Seq. Curr. Biol., 28(7):1052–1065.e7, April 2018.
  38. Learning deep disentangled embeddings with the F-statistic loss. In Proceedings of the 32nd International Conference on Neural Information Processing Systems, NIPS’18, pages 185–194, Red Hook, NY, USA, December 2018. Curran Associates Inc.
  39. Development of Concurrent Retinotopic Maps in the Fly Motion Detection Circuit. Cell, 173(2):485–498.e11, April 2018.
  40. D. Mendelejew. Über die beziehungen der eigenschaften zu den atomgewichten der elemente. Zeitschrift für Chemie, 12:405–406, 1869.
  41. Bayesian representation learning with oracle constraints. arXiv:1506.05011 [cs, stat], March 2016. Comment: 16 pages, publishes in ICLR 16.
Citations (1)

Summary

We haven't generated a summary for this paper yet.