Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Advancing Real-time Pandemic Forecasting Using Large Language Models: A COVID-19 Case Study (2404.06962v1)

Published 10 Apr 2024 in cs.LG and cs.AI

Abstract: Forecasting the short-term spread of an ongoing disease outbreak is a formidable challenge due to the complexity of contributing factors, some of which can be characterized through interlinked, multi-modality variables such as epidemiological time series data, viral biology, population demographics, and the intersection of public policy and human behavior. Existing forecasting model frameworks struggle with the multifaceted nature of relevant data and robust results translation, which hinders their performances and the provision of actionable insights for public health decision-makers. Our work introduces PandemicLLM, a novel framework with multi-modal LLMs that reformulates real-time forecasting of disease spread as a text reasoning problem, with the ability to incorporate real-time, complex, non-numerical information that previously unattainable in traditional forecasting models. This approach, through a unique AI-human cooperative prompt design and time series representation learning, encodes multi-modal data for LLMs. The model is applied to the COVID-19 pandemic, and trained to utilize textual public health policies, genomic surveillance, spatial, and epidemiological time series data, and is subsequently tested across all 50 states of the U.S. Empirically, PandemicLLM is shown to be a high-performing pandemic forecasting framework that effectively captures the impact of emerging variants and can provide timely and accurate predictions. The proposed PandemicLLM opens avenues for incorporating various pandemic-related data in heterogeneous formats and exhibits performance benefits over existing models. This study illuminates the potential of adapting LLMs and representation learning to enhance pandemic forecasting, illustrating how AI innovations can strengthen pandemic responses and crisis management in the future.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (73)
  1. Chang, S. et al. Mobility network models of covid-19 explain inequities and inform reopening. \JournalTitleNature 589, 82–87 (2021).
  2. Giordano, G. et al. Modelling the covid-19 epidemic and implementation of population-wide interventions in italy. \JournalTitleNature medicine 26, 855–860 (2020).
  3. Modeling covid-19 scenarios for the united states. \JournalTitleNature medicine 27, 94–105 (2021).
  4. Gao, Y. et al. Machine learning based early warning system enables accurate mortality risk prediction for covid-19. \JournalTitleNature communications 11, 5033 (2020).
  5. Bracher, J. et al. A pre-registered short-term forecasting study of covid-19 in germany and poland during the second wave. \JournalTitleNature communications 12, 5173 (2021).
  6. Li, X. et al. Wastewater-based epidemiology predicts covid-19-induced weekly new hospital admissions in over 150 usa counties. \JournalTitleNature Communications 14, 4548 (2023).
  7. Du, H. et al. Incorporating variant frequencies data into short-term forecasting for covid-19 cases and deaths in the usa: a deep learning approach. \JournalTitleEbiomedicine 89 (2023).
  8. Reich, N. G. et al. Collaborative hubs: making the most of predictive epidemic modeling (2022).
  9. Epidemic tracking and forecasting: Lessons learned from a tumultuous year. \JournalTitleProceedings of the National Academy of Sciences 118, e2111456118 (2021).
  10. Public health factors help explain cross country heterogeneity in excess death during the covid19 pandemic. \JournalTitleScientific Reports 13, 16196 (2023).
  11. Hsiang, S. et al. The effect of large-scale anti-contagion policies on the covid-19 pandemic. \JournalTitleNature 584, 262–267 (2020).
  12. The emergence, genomic diversity and global spread of sars-cov-2. \JournalTitleNature 600, 408–418 (2021).
  13. Nixon, K. et al. An evaluation of prospective covid-19 modelling studies in the usa: From data to science translation. \JournalTitleThe Lancet Digital Health 4, e738–e747 (2022).
  14. The turning point and end of an expanding epidemic cannot be precisely forecast. \JournalTitleProceedings of the National Academy of Sciences 117, 26190–26196 (2020).
  15. Cramer, E. Y. et al. Evaluation of individual and ensemble probabilistic forecasts of covid-19 mortality in the united states. \JournalTitleProceedings of the National Academy of Sciences 119, e2113561119 (2022).
  16. Friedman, J. et al. Predictive performance of international covid-19 mortality forecasting models. \JournalTitleNature communications 12, 2609 (2021).
  17. Forecasting for covid-19 has failed. \JournalTitleInternational journal of forecasting 38, 423–438 (2022).
  18. Telenti, A. et al. After the pandemic: perspectives on the future trajectory of covid-19. \JournalTitleNature 596, 495–504 (2021).
  19. Nepomuceno, M. R. et al. Besides population age structure, health and other demographic factors can contribute to understanding the covid-19 burden. \JournalTitleProceedings of the National Academy of Sciences 117, 13881–13883 (2020).
  20. Ruggeri, K. et al. A synthesis of evidence for policy from behavioural science during covid-19. \JournalTitleNature 1–14 (2023).
  21. Searls, D. B. The language of genes. \JournalTitleNature 420, 211–217 (2002).
  22. Hadfield, J. et al. Nextstrain: real-time tracking of pathogen evolution. \JournalTitleBioinformatics 34, 4121–4123 (2018).
  23. Covid data tracker. https://covid.cdc.gov/covid-data-tracker. Accessed: 2024-01-01.
  24. Singhal, K. et al. Large language models encode clinical knowledge. \JournalTitleNature 620, 172–180 (2023).
  25. Yang, X. et al. A large language model for electronic health records. \JournalTitleNPJ Digital Medicine 5, 194 (2022).
  26. Jiang, L. Y. et al. Health system-scale language models are all-purpose prediction engines. \JournalTitleNature 1–6 (2023).
  27. Beaulieu-Jones, B. K. et al. Predicting seizure recurrence after an initial seizure-like episode from routine clinical notes using large language models: a retrospective cohort study. \JournalTitleThe Lancet Digital Health 5, e882–e894 (2023).
  28. Thirunavukarasu, A. J. et al. Large language models in medicine. \JournalTitleNature medicine 29, 1930–1940 (2023).
  29. Bzdok, D. et al. Data science opportunities of large language models for neuroscience and biomedicine. \JournalTitleNeuron (2024).
  30. Large language models are zero-shot time series forecasters. \JournalTitleNeurIPS (2023).
  31. Nixon, K. et al. Real-time covid-19 forecasting: challenges and opportunities of model performance and translation. \JournalTitleThe Lancet Digital Health 4, e699–e701 (2022).
  32. OpenAI. Introducing chatgpt. https://openai.com/blog/chatgpt (2023).
  33. Touvron, H. et al. Llama 2: Open foundation and fine-tuned chat models. \JournalTitlearXiv preprint arXiv:2307.09288 (2023).
  34. Rufibach, K. Use of brier score to assess binary predictions. \JournalTitleJournal of clinical epidemiology 63, 938–939 (2010).
  35. Real-time tracking and prediction of covid-19 infection using digital proxies of population mobility and mixing. \JournalTitleNature communications 12, 1501 (2021).
  36. Predictions for covid-19 with deep learning models of lstm, gru and bi-lstm. \JournalTitleChaos, Solitons & Fractals 140, 110212 (2020).
  37. A novel bidirectional lstm deep learning approach for covid-19 forecasting. \JournalTitleScientific Reports 13, 17953 (2023).
  38. Arima modelling & forecasting of covid-19 in top five affected countries. \JournalTitleDiabetes & metabolic syndrome: clinical research & reviews 14, 1419–1427 (2020).
  39. Cramer, E. Y. et al. The united states covid-19 forecast hub dataset. \JournalTitleScientific data 9, 462 (2022).
  40. Us regional differences in physical distancing: Evaluating racial and socioeconomic divides during the covid-19 pandemic. \JournalTitlePLoS One 16, e0259665 (2021).
  41. TAG-VE statement on Omicron sublineages BQ.1 and XBB. https://www.who.int/news/item/27-10-2022-tag-ve-statement-on-omicron-sublineages-bq.1-and-xbb. Accessed: 2024-01-01.
  42. Ma, K. C. Genomic surveillance for sars-cov-2 variants: Circulation of omicron lineages—united states, january 2022–may 2023. \JournalTitleMMWR. Morbidity and Mortality Weekly Report 72 (2023).
  43. Bedson, J. et al. A review and agenda for integrated disease models including social and behavioural factors. \JournalTitleNature human behaviour 5, 834–846 (2021).
  44. The challenges of modeling and forecasting the spread of covid-19. \JournalTitleProceedings of the National Academy of Sciences 117, 16732–16738 (2020).
  45. Dong, E. et al. The johns hopkins university center for systems science and engineering covid-19 dashboard: data collection process, challenges faced, and lessons learned. \JournalTitleThe lancet infectious diseases 22, e370–e376 (2022).
  46. Department of Health & Human Services. https://healthdata.gov/Hospital/COVID-19-Reported-Patient-Impact-and-Hospital-Capa/anag-cw7u/about_data. Accessed: 2024-01-01.
  47. An interactive web-based dashboard to track covid-19 in real time. \JournalTitleThe Lancet infectious diseases 20, 533–534 (2020).
  48. Association between vaccination rates and covid-19 health outcomes in the united states: a population-level statistical analysis. \JournalTitleBMC Public Health 24, 1–14 (2024).
  49. Altarawneh, H. N. et al. Effects of previous infection and vaccination on symptomatic omicron infections. \JournalTitleNew England Journal of Medicine 387, 21–34 (2022).
  50. COVID Data Tracker. https://covid.cdc.gov/covid-data-tracker. Accessed: 2024-01-01.
  51. State population totals and components of change: 2020-2023. https://www.census.gov/data/tables/time-series/demo/popest/2020s-state-total.html. Accessed: 2024-01-01.
  52. Dowd, J. B. et al. Demographic science aids in understanding the spread and fatality rates of covid-19. \JournalTitleProceedings of the National Academy of Sciences 117, 9696–9698 (2020).
  53. Bollyky, T. J. et al. Assessing covid-19 pandemic policies and behaviours and their economic and educational trade-offs across us states from jan 1, 2020, to july 31, 2022: an observational analysis. \JournalTitleThe Lancet 401, 1341–1360 (2023).
  54. The commonwealth fund 2019 scorecard on state health system performance (2022).
  55. Federal Elections 2020. https://www.fec.gov/introduction-campaign-finance/election-results-and-voting-information/federal-elections-2020/. Accessed: 2024-01-01.
  56. Haug, N. et al. Ranking the effectiveness of worldwide covid-19 government interventions. \JournalTitleNature human behaviour 4, 1303–1312 (2020).
  57. Hale, T. et al. A global panel database of pandemic policies (oxford covid-19 government response tracker). \JournalTitleNature human behaviour 5, 529–538 (2021).
  58. The potential of genomics for infectious disease forecasting. \JournalTitleNature Microbiology 7, 1736–1743 (2022).
  59. Covid-19 forecasting using new viral variants and vaccination effectiveness models. \JournalTitleComputers in Biology and Medicine 149, 105986 (2022).
  60. World Health Organization (WHO). https://www.who.int. Accessed: 2024-01-01.
  61. Coronavirus Disease 2019 (COVID-19). https://www.cdc.gov/coronavirus/2019-ncov/variants/variant-surveillance.html. Accessed: 2024-01-01.
  62. Surveillance and disease data on covid-19. https://www.ecdc.europa.eu/en/covid-19/situation-updates. Accessed: 2024-01-01.
  63. Brown, T. B. et al. Language models are few-shot learners. \JournalTitlearXiv preprint arXiv:2005.14165 (2020).
  64. Anil, R. et al. Palm 2 technical report. \JournalTitleCoRR (2023). 2305.10403.
  65. Bubeck, S. et al. Sparks of artificial general intelligence: Early experiments with GPT-4. \JournalTitleCoRR (2023).
  66. Vaswani, A. et al. Attention is all you need. In NeurIPS (2017).
  67. Complexity-based prompting for multi-step reasoning. In ICLR (2023).
  68. Yao, S. et al. React: Synergizing reasoning and acting in language models. In ICLR (2023).
  69. Liang, J. et al. Code as policies: Language model programs for embodied control. In ICRA (2023).
  70. Touvron, H. et al. Llama: Open and efficient foundation language models (2023). 2302.13971.
  71. Gage, P. A new algorithm for data compression. \JournalTitleC Users Journal 12, 23–38 (1994).
  72. Visual instruction tuning (2023). 2304.08485.
  73. Evaluation methods for ordinal classification. In Gao, Y. & Japkowicz, N. (eds.) Advances in Artificial Intelligence, 207–210 (Springer Berlin Heidelberg, Berlin, Heidelberg, 2009).
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Hongru Du (4 papers)
  2. Jianan Zhao (17 papers)
  3. Yang Zhao (382 papers)
  4. Shaochong Xu (1 paper)
  5. Xihong Lin (14 papers)
  6. Yiran Chen (176 papers)
  7. Lauren M. Gardner (3 papers)
  8. Hao Frank Yang (6 papers)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets