WGTDA: A Topological Perspective to Biomarker Discovery in Gene Expression Data
Abstract: Advancing the discovery of prognostic cancer biomarkers is crucial for comprehending disease mechanisms, refining treatment plans, and improving patient outcomes. This study introduces Weighted Gene Topological Data Analysis (WGTDA), an innovative framework utilizing topological principles to identify gene interactions and distinctive biomarker features. WGTDA undergoes evaluation against Weighted Gene Co-expression Network Analysis (WGCNA), underscoring that topology-based biomarkers offer more reliable predictors of survival probability than WGCNA's hub genes. Furthermore, WGTDA identifies gene signatures that are significant to survival probability, irrespective of whether the expression is above or below the median. WGTDA provides a new perspective on biomarker discovery, uncovering intricate gene-to-gene relationships often overlooked by conventional correlation-based analyses, emphasizing the potential advantage of leveraging topological concepts to extract crucial information about gene-gene interactions.
- “Current advances in comprehensive omics data mining for oncology and cancer research” In Biochimica et Biophysica Acta (BBA)-Reviews on Cancer Elsevier, 2023, pp. 189030
- “Prognostic versus predictive value of biomarkers in oncology” In European Journal of Cancer 44.7 Elsevier, 2008, pp. 946–953
- “Challenges in biomarker discovery: combining expert insights with statistical analysis of complex omics data” In Expert Opinion on Medical Diagnostics 7.1 Taylor & Francis, 2013, pp. 37–51
- “An introduction to topological data analysis: fundamental and practical aspects for data scientists” In Frontiers in Artificial Intelligence 4 Frontiers, 2021, pp. 108
- “WGCNA: an R package for weighted correlation network analysis” In BMC Bioinformatics 9.1 BioMed Central, 2008, pp. 1–13
- “Persistent homology-a survey” In Contemporary Mathematics 453.26 Providence, RI: American Mathematical Society, 2008, pp. 257–282
- Robert O Ness, Karen Sachs and Olga Vitek “From correlation to causality: statistical approaches to learning regulatory relationships in large-scale biomolecular investigations” In Journal of Proteome Research 15.3 ACS Publications, 2016, pp. 683–690
- Cristian S Calude and Giuseppe Longo “The deluge of spurious correlations in big data” In Foundations of Science 22 Springer, 2017, pp. 595–612
- “Study on linear correlation coefficient and nonlinear correlation coefficient in mathematical statistics” In Studies in Mathematical Sciences 3.1, 2011, pp. 58–63
- “Extracting insights from the shape of complex data using topology” In Scientific Reports 3.1 Nature Publishing Group UK London, 2013, pp. 1236
- Larry Wasserman “Topological data analysis” In Annual Review of Statistics and Its Application 5 Annual Reviews, 2018, pp. 501–532
- “Multidimensional persistence in biomolecular data” In Journal of Computational Chemistry 36.20 Wiley Online Library, 2015, pp. 1502–1520
- “The topology of the cosmic web in terms of persistent Betti numbers” In Monthly Notices of the Royal Astronomical Society 465.4 Oxford University Press, 2017, pp. 4281–4310
- “A topological data analysis approach on predicting phenotypes from gene expression data” In International Conference on Algorithms for Computational Biology, 2020, pp. 178–187 Springer
- Tamal K Dey, Sayan Mandal and Soham Mukherjee “Gene expression data classification using topology and machine learning models” In BMC Bioinformatics 22.10 BioMed Central, 2021, pp. 1–22
- “Topological analysis of interaction patterns in cancer-specific gene regulatory network: Persistent homology approach” In Scientific Reports 11.1 Nature Publishing Group UK London, 2021, pp. 16414
- “The cancer genome atlas pan-cancer analysis project” In Nature Genetics 45.10 Nature Publishing Group, 2013, pp. 1113–1120
- “KEGG for linking genomes to life and the environment” In Nucleic Acids Research 36.suppl_1 Oxford University Press, 2007, pp. D480–D484
- Michael I Love, John B Hogenesch and Rafael A Irizarry “Modeling of RNA-seq fragment sequence bias reduces systematic errors in transcript abundance estimation” In Nature Biotechnology 34.12 Nature Publishing Group US New York, 2016, pp. 1287–1291
- “Gene expression patterns combined with network analysis identify hub genes associated with bladder cancer” In Computational Biology and Chemistry 56 Elsevier, 2015, pp. 71–83
- “Global and local architecture of the mammalian microRNA–transcription factor regulatory network” In PLoS Computational Biology 3.7 Public Library of Science San Francisco, USA, 2007, pp. e131
- “Differential gene expression in nasal airway epithelium from overweight or obese youth with asthma” In Pediatric Allergy and Immunology 33.4 Wiley Online Library, 2022, pp. e13776
- “Weighted gene coexpression network analysis strategies applied to mouse weight” In Mammalian Genome 18 Springer, 2007, pp. 463–472
- “Identification of hub genes and pathways associated with retinoblastoma based on co-expression network analysis” In Genetics and Molecular Research 14.4, 2015, pp. 16151–16161
- “A general framework for weighted gene co-expression network analysis” In Statistical Applications in Genetics and Molecular Biology 4.1 De Gruyter, 2005
- “Identification of five hub genes as key prognostic biomarkers in liver cancer via integrated bioinformatics analysis” In Biology 10.10 MDPI, 2021, pp. 957
- “Identification of key gene modules and hub genes of human mantle cell lymphoma by coexpression network analysis” In PeerJ 8 PeerJ Inc., 2020, pp. e8843
- “Weighted gene co-expression network analysis to identify key modules and hub genes associated with atrial fibrillation” In International Journal of Molecular Medicine 45.2 Spandidos Publications, 2020, pp. 401–416
- “Topological fidelity and image thresholding: A persistent homology approach” In Journal of Mathematical Imaging and Vision 60 Springer, 2018, pp. 1167–1179
- Manish Kumar Goel, Pardeep Khanna and Jugal Kishore “Understanding survival analysis: Kaplan-Meier estimate” In International Journal of Ayurveda Research 1.4 Wolters Kluwer–Medknow Publications, 2010, pp. 274
- Baptiste Gregorutti, Bertrand Michel and Philippe Saint-Pierre “Correlation and variable importance in random forests” In Statistics and Computing 27 Springer, 2017, pp. 659–678
- “Determining relative importance of variables in developing and validating predictive models” In BMC Medical Research Methodology 9.1 BioMed Central, 2009, pp. 1–10
- “clusterProfiler 4.0: A universal enrichment tool for interpreting omics data” In The Innovation 2.3 Elsevier, 2021
- “The reactome pathway knowledgebase” In Nucleic Acids Research 48.D1 Oxford University Press, 2020, pp. D498–D503
- “The impact of pathway database choice on statistical enrichment analysis and predictive modeling” In Frontiers in Genetics 10 Frontiers Media SA, 2019, pp. 1203
- David A Fruman and Christian Rommel “PI3K and cancer: lessons, challenges and opportunities” In Nature Reviews Drug Discovery 13.2 Nature Publishing Group UK London, 2014, pp. 140–156
- “PDGFRB promotes liver metastasis formation of mesenchymal-like colorectal tumor cells” In Neoplasia 15.2 Elsevier, 2013, pp. 204–IN30
- “Lung cancer treatment potential and limits associated with the STAT family of transcription factors” In Cellular Signalling 109 Elsevier, 2023, pp. 110797
- “Rho Family GTPases and their Modulators” In NADPH Oxidases Revisited: From Function to Structure Springer, 2023, pp. 287–310
- “Targetable pathways in advanced bladder cancer: FGFR signaling” In Cancers 13.19 MDPI, 2021, pp. 4891
- “Biological effects of IL-15 on immune cells and its potential for the treatment of cancer” In International Immunopharmacology 91 Elsevier, 2021, pp. 107318
- “Molecular contribution of BRCA1 and BRCA2 to genome instability in breast cancer patients: Review of radiosensitivity assays” In Biological Procedures Online 22 Springer, 2020, pp. 1–28
- “BIRC5: A novel therapeutic target for lung cancer stem cells and glioma stem cells” In Biochemical and Biophysical Research Communications 682 Elsevier, 2023, pp. 141–147
- “Cancer stem cells (CSCs) in drug resistance and their therapeutic implications in cancer treatment” In Stem Cells International 2018 Hindawi, 2018
- “High expression of RAD51 promotes DNA damage repair and survival in KRAS-mutant lung cancer cells” In BMB Reports 52.2 Korean Society for BiochemistryMolecular Biology, 2019, pp. 151
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.