GENEVIC: GENetic data Exploration and Visualization via Intelligent interactive Console (2404.04299v1)
Abstract: Summary: The vast generation of genetic data poses a significant challenge in efficiently uncovering valuable knowledge. Introducing GENEVIC, an AI-driven chat framework that tackles this challenge by bridging the gap between genetic data generation and biomedical knowledge discovery. Leveraging generative AI, notably ChatGPT, it serves as a biologist's 'copilot'. It automates the analysis, retrieval, and visualization of customized domain-specific genetic information, and integrates functionalities to generate protein interaction networks, enrich gene sets, and search scientific literature from PubMed, Google Scholar, and arXiv, making it a comprehensive tool for biomedical research. In its pilot phase, GENEVIC is assessed using a curated database that ranks genetic variants associated with Alzheimer's disease, schizophrenia, and cognition, based on their effect weights from the Polygenic Score Catalog, thus enabling researchers to prioritize genetic variants in complex diseases. GENEVIC's operation is user-friendly, accessible without any specialized training, secured by Azure OpenAI's HIPAA-compliant infrastructure, and evaluated for its efficacy through real-time query testing. As a prototype, GENEVIC is set to advance genetic research, enabling informed biomedical decisions. Availability and implementation: GENEVIC is publicly accessible at https://genevic-anath2024.streamlit.app. The underlying code is open-source and available via GitHub at https://github.com/anath2110/GENEVIC.git.
- GPT-4 Technical Report, December 2023. URL https://arxiv.org/abs/2303.08774v4.
- A guide to performing Polygenic Risk Score analyses. Nature Protocols, 15(9):2759, September 2020. doi: 10.1038/S41596-020-0353-1. URL https://www.nature.com/articles/s41596-020-0353-1.
- J. Fraenkel and B. Grofman. Strategic Voting and Coalitions. Public Choice, 28(1):1–15, 2014. doi: doi.org/10.1007/BF01718454. URL https://doi.org/10.1007/BF01718454.
- Toward a Conversational Agent to Support the Self-Management of Adults and Young Adults With Sickle Cell Disease: Usability and Usefulness Study. Frontiers in Digital Health, 3:600333, January 2021. doi: 10.3389/FDGTH.2021.600333. URL https://pubmed.ncbi.nlm.nih.gov/34713087/.
- GeneGPT: Augmenting Large Language Models with Domain Tools for Improved Access to Biomedical Information, May 2023. URL http://arxiv.org/abs/2304.09667.
- The Polygenic Score Catalog as an open database for reproducibility and systematic evaluation. Nature Genetics, 53(4):420–425, March 2021. doi: 10.1038/s41588-021-00783-5. URL https://www.nature.com/articles/s41588-021-00783-5.
- quincunx: an R package to query, download and wrangle PGS Catalog data, journal = Bioinformatics. 38(1):294–296, December 2021. doi: 10.1093/BIOINFORMATICS/BTAB522. URL https://pubmed.ncbi.nlm.nih.gov/34270693/.
- The evaluation of chatbot as a tool for health literacy education among undergraduate students. Education and Information Technologies, 26(5):6033–6049, May 2021. doi: 10.1007/S10639-021-10542-Y. URL https://pubmed.ncbi.nlm.nih.gov/34054328/.
- Object Hallucination in Image Captioning. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 4035–4045, October–November 2018. doi: 10.18653/v1/d18-1437. URL https://aclanthology.org/D18-1437/.
- ANNOVAR: Functional annotation of genetic variants from next-generation sequencing data Nucleic Acids Research. Nucleic Acids Research, 38(16):e164, September 2010. doi: doi.org/10.1007/BF01718454. URL https://doi.org/10.1007/BF01718454.
- dplyr: A Grammar of Data Manipulation, November 2023. URL https://dplyr.tidyverse.org. R package version 1.1.4, https://github.com/tidyverse/dplyr.
- Y. Xiao and W. Y. Wang. On Hallucination and Predictive Uncertainty in Conditional Language Generation. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics, pages 2734–2744, April 2021. doi: 10.18653/v1/2021.eacl-main.236. URL https://aclanthology.org/2021.eacl-main.236.pdf.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.