Dynamics-based Feature Augmentation of Graph Neural Networks for Variant Emergence Prediction (2401.03390v2)
Abstract: During the COVID-19 pandemic, a major driver of new surges has been the emergence of new variants. When a new variant emerges in one or more countries, other nations monitor its spread in preparation for its potential arrival. The impact of the new variant and the timings of epidemic peaks in a country highly depend on when the variant arrives. The current methods for predicting the spread of new variants rely on statistical modeling, however, these methods work only when the new variant has already arrived in the region of interest and has a significant prevalence. Can we predict when a variant existing elsewhere will arrive in a given region? To address this question, we propose a variant-dynamics-informed Graph Neural Network (GNN) approach. First, we derive the dynamics of variant prevalence across pairs of regions (countries) that apply to a large class of epidemic models. The dynamics motivate the introduction of certain features in the GNN. We demonstrate that our proposed dynamics-informed GNN outperforms all the baselines, including the currently pervasive framework of Physics-Informed Neural Networks (PINNs). To advance research in this area, we introduce a benchmarking tool to assess a user-defined model's prediction performance across 87 countries and 36 variants.
- “World health organization covid-19 dashboard,” 2023.
- T. L. Wiemken, F. Khan, L. Puzniak, W. Yang, J. Simmering, P. Polgreen, J. L. Nguyen, L. Jodar, and J. M. McLaughlin, “Seasonal trends in covid-19 cases, hospitalizations, and mortality in the united states and europe,” Scientific Reports, vol. 13, Mar. 2023.
- “Tracking sars-cov-2 variants,” 2023.
- P. V. Markov, M. Ghafari, M. Beer, K. Lythgoe, P. Simmonds, N. I. Stilianakis, and A. Katzourakis, “The evolution of sars-cov-2,” Nature Reviews Microbiology, vol. 21, p. 361–379, Apr. 2023.
- A. S. Lambrou, P. Shirk, M. K. Steele, P. Paul, C. R. Paden, B. Cadwell, H. E. Reese, Y. Aoki, N. Hassell, X.-Y. Zheng, et al., “Genomic surveillance for sars-cov-2 variants: predominance of the delta (b. 1.617. 2) and omicron (b. 1.1. 529) variants—united states, june 2021–january 2022,” Morbidity and Mortality Weekly Report, vol. 71, no. 6, p. 206, 2022.
- J. Sun, X. Chen, Z. Zhang, S. Lai, B. Zhao, H. Liu, S. Wang, W. Huan, R. Zhao, M. T. A. Ng, et al., “Forecasting the long-term trend of covid-19 epidemic using a dynamic model,” Scientific reports, vol. 10, no. 1, p. 21122, 2020.
- H. Hu, H. Du, J. Li, Y. Wang, X. Wu, C. Wang, Y. Zhang, G. Zhang, Y. Zhao, W. Kang, et al., “Early prediction and identification for severe patients during the pandemic of covid-19: a severe covid-19 risk model constructed by multivariate logistic regression analysis,” Journal of Global Health, vol. 10, no. 2, 2020.
- J. T. Wu, K. Leung, and G. M. Leung, “Nowcasting and forecasting the potential domestic and international spread of the 2019-ncov outbreak originating in wuhan, china: a modelling study,” The lancet, vol. 395, no. 10225, pp. 689–697, 2020.
- D. A. Shah, E. D. De Wolf, P. A. Paul, and L. V. Madden, “Accuracy in the prediction of disease epidemics when ensembling simple but highly correlated models,” PLOS Computational Biology, vol. 17, pp. 1–23, 03 2021.
- S. Palaniappan, R. V, and B. David, “Prediction of epidemic disease dynamics on the infection risk using machine learning algorithms,” SN computer science, vol. 3, no. 1, p. 47, 2022.
- M. Raissi, P. Perdikaris, and G. E. Karniadakis, “Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations,” Journal of Computational Physics, vol. 378, pp. 686–707, 2019.
- E. B. Hodcroft, “Covariants: Sars-cov-2 mutations and variants of interest,” 2021.
- “Cdc variant classifications.” https://www.cdc.gov/coronavirus/2019-ncov/variants/variant-classifications.html, 2023. Centers for Disease Control and Prevention.
- L. J. Beesley, K. R. Moran, K. Wagh, L. A. Castro, J. Theiler, H. Yoon, W. Fischer, N. W. Hengartner, B. Korber, and S. Y. Del Valle, “Sars-cov-2 variant transition dynamics are associated with vaccination rates, number of co-circulating variants, and convalescent immunity,” eBioMedicine, vol. 91, p. 104534, May 2023.
- “Covid 19 forecast hub us.”
- K. Sherratt, H. Gruson, R. Grah, H. Johnson, R. Niehus, B. Prasse, F. Sandmann, J. Deuschel, D. Wolffram, S. Abbott, and et al., “Predictive performance of multi-model ensemble forecasts of covid-19 across european nations,” Apr 2023.
- “The german and polish covid-19 forecasthub.”
- F. Scarselli, M. Gori, A. C. Tsoi, M. Hagenbuchner, and G. Monfardini, “The graph neural network model,” IEEE transactions on neural networks, vol. 20, no. 1, pp. 61–80, 2008.
- M. Raissi, P. Perdikaris, and G. E. Karniadakis, “Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations,” Journal of Computational physics, vol. 378, pp. 686–707, 2019.
- S. Nagpal, R. Pal, Ashima, A. Tyagi, S. Tripathi, A. Nagori, S. Ahmad, H. P. Mishra, R. Malhotra, R. Kutum, and T. Sethi, “Genomic surveillance of COVID-19 variants with language models and machine learning,” Frontiers in Genetics, vol. 13, Apr. 2022.
- S. Basu and R. H. Campbell, “Classifying COVID-19 variants based on genetic sequences using deep learning models,” June 2021.
- E. Kharazmi, M. Cai, X. Zheng, Z. Zhang, G. Lin, and G. E. Karniadakis, “Identifiability and predictability of integer-and fractional-order epidemiological models using physics-informed neural networks,” Nature Computational Science, vol. 1, no. 11, pp. 744–753, 2021.
- I. Kiselev, I. Akberdin, and F. Kolpakov, “Delay-differential seir modeling for improved modelling of infection dynamics,” Scientific Reports, vol. 13, no. 1, p. 13439, 2023.
- A. Rodríguez, J. Cui, N. Ramakrishnan, B. Adhikari, and B. A. Prakash, “Einns: Epidemiologically-informed neural networks,” in Proceedings of the AAAI Conference on Artificial Intelligence, vol. 37, pp. 14453–14460, 2023.
- X. Ning, L. Jia, Y. Wei, X.-A. Li, and F. Chen, “Epi-dnns: Epidemiological priors informed deep neural networks for modeling covid-19 dynamics,” Computers in biology and medicine, vol. 158, p. 106693, 2023.
- M. R. Davahli, K. Fiok, W. Karwowski, A. M. Aljuaid, and R. Taiar, “Predicting the dynamics of the covid-19 pandemic in the united states using graph theory-based neural networks,” International Journal of Environmental Research and Public Health, vol. 18, p. 3834, Apr 2021.
- J. Gao, R. Sharma, C. Qian, L. M. Glass, J. Spaeder, J. Romberg, J. Sun, and C. Xiao, “STAN: spatio-temporal attention network for pandemic prediction using real-world evidence,” Journal of the American Medical Informatics Association, vol. 28, pp. 733–743, 01 2021.
- G. Panagopoulos, G. Nikolentzos, and M. Vazirgiannis, “Transfer graph neural networks for pandemic forecasting,” 2021.
- I. Sung, S. Lee, M. Pak, Y. Shin, and S. Kim, “AutoCoV: tracking the early spread of COVID-19 in terms of the spatial and temporal patterns from embedding space by k-mer based deep learning,” BMC Bioinformatics, vol. 23, Mar. 2022.
- S. Ganesan and D. Subramani, “Spatio-temporal predictive modeling framework for infectious disease spread,” Scientific Reports, vol. 11, no. 1, p. 6741, 2021.
- S. Seo, C. Meng, and Y. Liu, “Physics-aware difference graph networks for sparsely-observed dynamics,” in International Conference on Learning Representations, 2019.
- S. Seo and Y. Liu, “Differentiable physics-informed graph networks,” arXiv preprint arXiv:1902.02950, 2019.
- A. Chopra, A. Rodríguez, J. Subramanian, A. Quera-Bofarull, B. Krishnamurthy, B. A. Prakash, and R. Raskar, “Differentiable agent-based epidemiology,” arXiv preprint arXiv:2207.09714, 2022.
- A. Chopra, E. Gel, J. Subramanian, B. Krishnamurthy, S. Romero-Brufau, K. S. Pasupathy, T. C. Kingsley, and R. Raskar, “Deepabm: scalable, efficient and differentiable agent-based simulations via graph neural networks,” arXiv preprint arXiv:2110.04421, 2021.
- S. Deng, S. Wang, H. Rangwala, L. Wang, and Y. Ning, “Graph message passing with cross-location attentions for long-term ili prediction,” arXiv preprint arXiv:1912.10202, 2019.
- A. Kapoor, X. Ben, L. Liu, B. Perozzi, M. Barnes, M. Blais, and S. O’Banion, “Examining covid-19 forecasting using spatio-temporal graph neural networks,” arXiv preprint arXiv:2007.03113, 2020.
- A. Srivastava, “The variations of sikjalpha model for covid-19 forecasting and scenario projections,” Epidemics, vol. 45, p. 100729, 2023.
- J. Chen, C. Gu, Z. Ruan, and M. Tang, “Competition of sars-cov-2 variants on the pandemic transmission dynamics,” Chaos, Solitons; Fractals, vol. 169, p. 113193, Apr. 2023.
- T. Hale, N. Angrist, R. Goldszmidt, B. Kira, A. Petherick, T. Phillips, S. Webster, E. Cameron-Blake, L. Hallas, S. Majumdar, et al., “A global panel database of pandemic policies (oxford covid-19 government response tracker),” Nature human behaviour, vol. 5, no. 4, pp. 529–538, 2021.
- K. Cho, B. Van Merriënboer, C. Gulcehre, D. Bahdanau, F. Bougares, H. Schwenk, and Y. Bengio, “Learning phrase representations using rnn encoder-decoder for statistical machine translation,” arXiv preprint arXiv:1406.1078, 2014.
- T. Cai, S. Luo, K. Xu, D. He, T.-Y. Liu, and L. Wang, “Graphnorm: A principled approach to accelerating graph neural network training,” 2021.
- Y. Cui, M. Jia, T.-Y. Lin, Y. Song, and S. Belongie, “Class-balanced loss based on effective number of samples,” 2019.
- S. Khare, C. Gurry, L. Freitas, M. B. Schultz, G. Bach, A. Diallo, N. Akite, J. Ho, R. T. Lee, W. Yeo, G. C. C. Team, and S. Maurer-Stroh, “Gisaid’s role in pandemic response,” China CDC Weekly, vol. 3, p. 1049, 2021.
- I. Aksamentov, C. Roemer, E. B. Hodcroft, and R. A. Neher, “Nextclade: clade assignment, mutation calling and quality control for viral genomes,” Journal of Open Source Software, vol. 6, no. 67, p. 3773, 2021.
- P1sec, “Country adjacency,” 2013.
- OpenFlights, “OpenFlights dataset.” https://openflights.org/, 2023. Accessed: August 10, 2023.
- J. Opitz and S. Burst, “Macro f1 and macro f1,” arXiv preprint arXiv:1911.03347, 2019.
- Z. C. Lipton, C. Elkan, and B. Naryanaswamy, “Optimal thresholding of classifiers to maximize f1 measure,” in Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2014, Nancy, France, September 15-19, 2014. Proceedings, Part II 14, pp. 225–239, Springer, 2014.
- “A timeline of covid-19 variants,” 2023.
- Majd Al Aawar (3 papers)
- Srikar Mutnuri (2 papers)
- Mansooreh Montazerin (8 papers)
- Ajitesh Srivastava (33 papers)