A clustering and graph deep learning-based framework for COVID-19 drug repurposing (2306.13995v1)
Abstract: Drug repurposing (or repositioning) is the process of finding new therapeutic uses for drugs already approved by drug regulatory authorities (e.g., the Food and Drug Administration (FDA) and Therapeutic Goods Administration (TGA)) for other diseases. This involves analyzing the interactions between different biological entities, such as drug targets (genes/proteins and biological pathways) and drug properties, to discover novel drug-target or drug-disease relations. Artificial intelligence methods such as machine learning and deep learning have successfully analyzed complex heterogeneous data in the biomedical domain and have also been used for drug repurposing. This study presents a novel unsupervised machine learning framework that utilizes a graph-based autoencoder for multi-feature type clustering on heterogeneous drug data. The dataset consists of 438 drugs, of which 224 are under clinical trials for COVID-19 (category A). The rest are systematically filtered to ensure the safety and efficacy of the treatment (category B). The framework solely relies on reported drug data, including its pharmacological properties, chemical/physical properties, interaction with the host, and efficacy in different publicly available COVID-19 assays. Our machine-learning framework reveals three clusters of interest and provides recommendations featuring the top 15 drugs for COVID-19 drug repurposing, which were shortlisted based on the predicted clusters that were dominated by category A drugs. The anti-COVID efficacy of the drugs should be verified by experimental studies. Our framework can be extended to support other datasets and drug repurposing studies, given open-source code and data availability.
- The sars-cov-2 outbreak: what we know, International journal of infectious diseases 94 (2020) 44–48.
- D. Cucinotta, M. Vanelli, Who declares covid-19 a pandemic, Acta Bio Medica: Atenei Parmensis 91 (2020) 157.
- An interactive web-based dashboard to track covid-19 in real time, The Lancet infectious diseases 20 (2020) 533–534.
- Causes of death and comorbidities in hospitalized patients with covid-19, Scientific Reports 11 (2021) 1–9.
- Effect of noninvasive respiratory strategies on intubation or mortality among patients with acute hypoxemic respiratory failure and covid-19: the recovery-rs randomized clinical trial, Jama 327 (2022) 546–558.
- Attributes and predictors of long covid, Nature medicine 27 (2021) 626–631.
- Safety and efficacy of the bnt162b2 mrna covid-19 vaccine, New England journal of medicine (2020).
- Efficacy and safety of the mrna-1273 sars-cov-2 vaccine, New England journal of medicine (2020).
- Safety and efficacy of single-dose ad26. cov2. s vaccine against covid-19, New England Journal of Medicine 384 (2021) 2187–2201.
- Safety and efficacy of the chadox1 ncov-19 vaccine (azd1222) against sars-cov-2: an interim analysis of four randomised controlled trials in brazil, south africa, and the uk, The Lancet 397 (2021) 99–111.
- Inactivated covid-19 vaccine bbv152/covaxin effectively neutralizes recently emerged b. 1.1. 7 variant of sars-cov-2, Journal of Travel Medicine 28 (2021) taab051.
- Understanding how covid-19 vaccines work, 2022. URL: https://www.cdc.gov/coronavirus/2019-ncov/vaccines/different-vaccines/how-they-work.html.
- Covid-19 vaccines work, 2022. URL: https://www.cdc.gov/coronavirus/2019-ncov/vaccines/effectiveness/work.html.
- Estimated Research and Development Investment Needed to Bring a New Medicine to Market, 2009-2018, JAMA 323 (2020) 844–853.
- Drug repurposing strategy (drs): Emerging approach to identify potential therapeutics for treatment of novel coronavirus infection, Frontiers in Molecular Biosciences 8 (2021) 628144.
- Fda approves first treatment for covid-19, 2020. URL: https://www.fda.gov/news-events/press-announcements/fda-approves-first-treatment-covid-19.
- Chloroquine or hydroxychloroquine for prevention and treatment of covid‐19, Cochrane Database of Systematic Reviews (2021).
- β𝛽\betaitalic_β-d-n 4-hydroxycytidine inhibits sars-cov-2 through lethal mutagenesis but is also mutagenic to mammalian cells, The Journal of infectious diseases 224 (2021) 415–419.
- A systematic review on ai/ml approaches against covid-19 outbreak, Complex & Intelligent Systems 7 (2021) 2655–2678.
- Modeling the spread of covid-19 infection using a multilayer perceptron, Computational and mathematical methods in medicine 2020 (2020).
- R. Chandra, A. Krishna, Covid-19 sentiment analysis via deep learning during the rise of novel cases, PLoS One 16 (2021) e0255615.
- J. A. Suykens, Support vector machines and kernel-based learning for dynamical systems modelling, IFAC Proceedings Volumes 42 (2009) 1029–1037.
- Development and validation of k-nearest-neighbor qspr models of metabolic stability of drug candidates, Journal of medicinal chemistry 46 (2003) 3013–3020.
- R. G. Susnow, S. L. Dixon, Use of robust classification techniques for the prediction of human cytochrome p450 2d6 inhibition, Journal of chemical information and computer sciences 43 (2003) 1308–1315.
- Ai-powered drug repurposing for developing covid-19 treatments, Reference Module in Biomedical Sciences (2022).
- Biological network analysis with deep learning, Briefings in bioinformatics 22 (2021) 1515–1530.
- A multimodal deep learning-based drug repurposing approach for treatment of covid-19, Molecular diversity 25 (2021) 1717–1730.
- Deep learning driven drug discovery: tackling severe acute respiratory syndrome coronavirus 2, Frontiers in Microbiology 12 (2021).
- C. Y. Lee, Y.-P. P. Chen, New insights into drug repurposing for covid-19 using deep learning, IEEE Transactions on Neural Networks and Learning Systems (2021).
- A comprehensive review of artificial intelligence and network based approaches to drug repurposing in covid-19, Biomedicine & Pharmacotherapy 153 (2022) 113350.
- A two-tiered unsupervised clustering approach for drug repositioning through heterogeneous data integration, BMC bioinformatics 19 (2018) 1–18.
- Graph neural networks: A review of methods and applications, AI Open 1 (2020) 57–81.
- Graph neural networks and their current applications in bioinformatics, Frontiers in genetics 12 (2021).
- Drug repurposing for covid-19 using graph neural network with genetic, mechanistic, and epidemiological validation, Research Square (2020).
- Knowledge graph-based approaches to drug repurposing for covid-19, Journal of chemical information and modeling 61 (2021) 4058–4067.
- Few-shot link prediction via graph neural networks for covid-19 drug-repurposing, arXiv preprint arXiv:2007.10261 (2020).
- Graph representation learning for covid-19 drug repurposing, in: International Conference on Computing Systems and Applications, Springer, 2022, pp. 61–72.
- Inductive representation learning on large graphs, Advances in neural information processing systems 30 (2017).
- Systematic down-selection of repurposed drug candidates for covid-19 (2022).
- Covirx: A user-friendly interface for systematic down-selection of repurposed drug candidates for covid-19 (2022).
- Human organoids: model systems for human biology and medicine, Nature Reviews Molecular Cell Biology 21 (2020) 571–584.
- M. Simpson, S.-A. Poulsen, An overview of australia’s compound management facility: the queensland compound library, ACS chemical biology 9 (2014) 28–33.
- Mechanism of action. in wikipedia, 2021. URL: https://en.wikipedia.org/wiki/Mechanism_of_action.
- Path4drug: Data science workflow for identification of tissue-specific biological pathways modulated by toxic drugs, Frontiers in pharmacology 12 (2021).
- Drug-path: a database for drug-induced pathways, Database 2015 (2015).
- Z. H, Biological target and its mechanism, J Cell Signal 6:219 (2021).
- D. Xu, Y. Tian, A comprehensive survey of clustering algorithms, Annals of Data Science 2 (2015) 165–193.
- D. Gómez, A. Rojas, An empirical overview of the no free lunch theorem and its effect on real-world machine learning classification, Neural computation 28 (2016) 216–228.
- No free lunch theorem: A review, Approximation and Optimization: Algorithms, Complexity and Applications (2019) 57–82.
- D. Steinley, K-means clustering: a half-century synthesis, British Journal of Mathematical and Statistical Psychology 59 (2006) 1–34.
- F. Murtagh, A survey of recent advances in hierarchical clustering algorithms, The computer journal 26 (1983) 354–359.
- W. E. Donath, A. J. Hoffman, Lower bounds for the partitioning of graphs, IBM Journal of Research and Development 17 (1973) 420–425.
- E. W. Forgy, Cluster analysis of multivariate data: efficiency versus interpretability of classifications, biometrics 21 (1965) 768–769.
- Residual sum of squares, 2022. URL: https://en.wikipedia.org/wiki/Residual_sum_of_squares.
- P. Silitonga, Clustering of patient disease data by using k-means clustering, International Journal of Computer Science and Information Security (IJCSIS) 15 (2017) 219–221.
- The application of unsupervised clustering methods to alzheimer’s disease, Frontiers in computational neuroscience 13 (2019) 31.
- An enhanced k-means clustering algorithm for pattern discovery in healthcare data, International Journal of distributed sensor networks 11 (2015) 615740.
- S. C. Johnson, Hierarchical clustering schemes, Psychometrika 32 (1967) 241–254.
- F. Murtagh, P. Contreras, Algorithms for hierarchical clustering: an overview, Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 2 (2012) 86–97.
- Using hierarchical clustering analysis to evaluate covid-19 pandemic preparedness and performance in 180 countries in 2020, BMJ open 11 (2021) e049844.
- Agglomerative hierarchical clustering analysis of co/multi-morbidities, arXiv preprint arXiv:1807.04325 (2018).
- Hierarchical clustering analysis for predicting 1-year mortality after starting hemodialysis, Kidney international reports 5 (2020) 1188–1195.
- U. Von Luxburg, A tutorial on spectral clustering, Statistics and computing 17 (2007) 395–416.
- Spectrum: fast density-aware spectral clustering for single and multi-omic data, Bioinformatics 36 (2020) 1159–1166.
- Analysis of medical datasets by using geodesic based approximate spectral clustering, in: 2015 Medical Technologies National Conference (TIPTEKNO), IEEE, 2015, pp. 1–4.
- Spectral clustering for medical imaging, in: 2014 IEEE International Conference on Data Mining, IEEE, 2014, pp. 887–892.
- P. J. Rousseeuw, Silhouettes: A graphical aid to the interpretation and validation of cluster analysis, Journal of Computational and Applied Mathematics 20 (1987) 53–65.
- G. Ogbuabor, F. Ugwoke, Clustering algorithm for a healthcare dataset using silhouette score value, International Journal of Computer Science & Information Technology 102 (2018) 27–37.
- Silhouette scores for arbitrary defined groups in gene expression data and insights into differential expression results, Biological procedures online 20 (2018) 1–12.
- The graph neural network model, IEEE Transactions on Neural Networks 20 (2009) 61–80.
- Rumor detection on social media with bi-directional graph convolutional networks, in: Proceedings of the AAAI conference on artificial intelligence, volume 34, 2020, pp. 549–556.
- Deep learning for community detection: progress, challenges and opportunities, arXiv preprint arXiv:2005.08225 (2020).
- The evolution of citation graphs in artificial intelligence research, Nature Machine Intelligence 1 (2019) 79–85.
- Modeling polypharmacy side effects with graph convolutional networks, Bioinformatics 34 (2018) i457–i466.
- Predicting drug-disease interactions by semi-supervised graph cut algorithm and three-layer data integration, BMC medical genomics 10 (2017) 17–30.
- A comprehensive survey on graph neural networks, IEEE transactions on neural networks and learning systems 32 (2020) 4–24.
- mCSM: predicting the effects of mutations in proteins using graph-based signatures, Bioinformatics 30 (2014) 335–342.
- Graph-based multi-label disease prediction model learning from medical data and domain knowledge, Knowledge-Based Systems 235 (2022) 107662.
- A. K. Das, P. Das, Graph based ensemble classification for crime report prediction, Applied Soft Computing 125 (2022) 109215.
- Representation learning on graphs: Methods and applications, arXiv preprint arXiv:1709.05584 (2017).
- T. N. Kipf, M. Welling, Semi-supervised classification with graph convolutional networks, arXiv preprint arXiv:1609.02907 (2016).
- Graph attention networks, arXiv preprint arXiv:1710.10903 (2017).
- T. N. Kipf, M. Welling, Variational graph auto-encoders, arXiv preprint arXiv:1611.07308 (2016).
- P. Baldi, Autoencoders, unsupervised learning, and deep architectures, in: Proceedings of ICML workshop on unsupervised and transfer learning, JMLR Workshop and Conference Proceedings, 2012, pp. 37–49.
- An introduction to variational autoencoders, Foundations and Trends® in Machine Learning 12 (2019) 307–392.
- D. P. Kingma, M. Welling, Auto-encoding variational bayes, arXiv preprint arXiv:1312.6114 (2013).
- Kullback–leibler divergence, 2022. URL: https://en.wikipedia.org/wiki/Kullback%E2%80%93Leibler_divergence.
- Variational graph auto-encoders for mirna-disease association prediction, Methods 192 (2021) 25–34.
- Scikit-learn: Machine learning in Python, Journal of Machine Learning Research 12 (2011) 2825–2830.
- E. R. Gansner, S. C. North, An open graph visualization system and its applications to software engineering, Software: practice and experience 30 (2000) 1203–1233.
- L. Zelnik-Manor, P. Perona, Self-tuning spectral clustering, Advances in neural information processing systems 17 (2004).
- Low-dose versus high-dose dexamethasone for hospitalized patients with covid-19 pneumonia: A randomized clinical trial, PloS one 17 (2022) e0275217.
- Comparison of remdesivir versus lopinavir/ ritonavir and remdesivir combination in covid-19 patients, 2021. URL: https://clinicaltrials.gov/ct2/show/NCT04738045?term=Lopinavir&cond=COVID-19&draw=2&rank=2.
- Anti-hiv integrase inhibitors as new candidates for the treatment of covid-19: A narrative literature review, Anti-Infective Agents 20 (2022) 59–64.
- Analysis of the efficacy of HIV protease inhibitors against sars-cov-2’s main protease, Virology journal 17 (2020) 1–8.
- Effect of Remdesivir vs standard care on clinical status at 11 days in patients with moderate COVID-19: a randomized clinical trial, Jama 324 (2020) 1048–1057.
- Antivirals for covid-19: a critical review, Clinical Epidemiology and global health 9 (2021) 90–98.
- E. B. Pollak, M. Parmar, Indinavir, in: StatPearls [Internet], StatPearls Publishing, 2022.
- Indinavir, 2022. URL: https://covid19-help.org/substance/indinavir.
- Elvitegravir, 2022. URL: https://covid19-help.org/substance/elvitegravir.
- K. Shimura, E. N. Kodama, Elvitegravir: a new HIV integrase inhibitor, Antiviral Chemistry and Chemotherapy 20 (2009) 79–85.
- Calcium signaling pathway is involved in the shedding of ace2 catalytic ectodomain: new insights for clinical and therapeutic applications of ace2 for covid-19, Biomolecules 12 (2022) 76.
- Glucocorticoids improve severe or critical covid-19 by activating ace2 and reducing il-6 levels, International Journal of Biological Sciences 16 (2020) 2382.
- The association of an alpha-2 adrenergic receptor agonist and mortality in patients with covid-19, Frontiers in medicine 8 (2021).
- Network pharmacology study to elucidate the key targets of underlying antihistamines against covid-19, Current Issues in Molecular Biology 44 (2022) 1597–1609.
- S. A. Aldukhi, M. Batais, Understanding the role of dipeptidyl peptidase-4 inhibitors in covid-19: Findings from a systematic review, Journal of Endocrinology and Metabolism 12 (2021) 10–18.
- Chaarvi Bansal (3 papers)
- Rohitash Chandra (64 papers)
- Vinti Agarwal (4 papers)
- P. R. Deepa (1 paper)