Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

SIR-RL: Reinforcement Learning for Optimized Policy Control during Epidemiological Outbreaks in Emerging Market and Developing Economies (2404.08423v2)

Published 12 Apr 2024 in cs.LG, physics.soc-ph, and q-bio.PE

Abstract: The outbreak of COVID-19 has highlighted the intricate interplay between public health and economic stability on a global scale. This study proposes a novel reinforcement learning framework designed to optimize health and economic outcomes during pandemics. The framework leverages the SIR model, integrating both lockdown measures (via a stringency index) and vaccination strategies to simulate disease dynamics. The stringency index, indicative of the severity of lockdown measures, influences both the spread of the disease and the economic health of a country. Developing nations, which bear a disproportionate economic burden under stringent lockdowns, are the primary focus of our study. By implementing reinforcement learning, we aim to optimize governmental responses and strike a balance between the competing costs associated with public health and economic stability. This approach also enhances transparency in governmental decision-making by establishing a well-defined reward function for the reinforcement learning agent. In essence, this study introduces an innovative and ethical strategy to navigate the challenge of balancing public health and economic stability amidst infectious disease outbreaks.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (73)
  1. Baker, R. E. et al. Infectious disease in an era of global change. \JournalTitleNature Reviews Microbiology 20, 193–205 (2022).
  2. Tan, M. K. Covid-19 in an inequitable world: the last, the lost and the least (2021).
  3. Who coronavirus (covid-19) dashboard. https://covid19.who.int/. Accessed: 2024-01-12.
  4. World economic outlook, april 2020: The great lockdown. https://www.imf.org/en/Publications/WEO/Issues/2020/04/14/World-Economic-Outlook-April-2020-The-Great-Lockdown-49306. Accessed: 2024-01-12.
  5. Nicola, M. et al. The socio-economic implications of the coronavirus pandemic (covid-19): A review. \JournalTitleInternational journal of surgery 78, 185–193 (2020).
  6. The impact of the covid-19 pandemic on global gdp growth. \JournalTitleJournal of the Japanese and International Economies 68, 101258 (2023).
  7. How will country-based mitigation measures influence the course of the covid-19 epidemic? \JournalTitleThe lancet 395, 931–934 (2020).
  8. Pandemic policy assessment by artificial intelligence. \JournalTitleScientific Reports 12, 13843 (2022).
  9. Chinazzi, M. et al. The effect of travel restrictions on the spread of the 2019 novel coronavirus (covid-19) outbreak. \JournalTitleScience 368, 395–400 (2020).
  10. Nguyen, T. et al. Covid-19 vaccine strategies for aotearoa new zealand: a mathematical modelling study. \JournalTitleThe Lancet Regional Health–Western Pacific 15 (2021).
  11. The balancing role of distribution speed against varying efficacy levels of covid-19 vaccines under variants. \JournalTitleScientific reports 12, 7493 (2022).
  12. Jalloh, M. F. et al. Drivers of covid-19 policy stringency in 175 countries and territories: Covid-19 cases and deaths, gross domestic products per capita, and health expenditures. \JournalTitleJournal of Global Health 12 (2022).
  13. Caldwell, J. M. et al. Understanding covid-19 dynamics and the effects of interventions in the philippines: A mathematical modelling study. \JournalTitleThe Lancet Regional Health–Western Pacific 14 (2021).
  14. Ferguson, N. M. et al. Report 9: Impact of non-pharmaceutical interventions (NPIs) to reduce COVID19 mortality and healthcare demand, vol. 16 (Imperial College London London, 2020).
  15. De Foo, C. et al. Health financing policies during the covid-19 pandemic and implications for universal health care: a case study of 15 countries. \JournalTitleThe Lancet Global Health 11, e1964–e1977 (2023).
  16. Mitigation strategies for pandemic influenza a: balancing conflicting policy objectives. \JournalTitlePLoS computational biology 7, e1001076 (2011).
  17. Pangallo, M. et al. The unequal effects of the health–economy trade-off during the covid-19 pandemic. \JournalTitleNature Human Behaviour 1–12 (2023).
  18. Disease-economy trade-offs under alternative epidemic control strategies. \JournalTitleNature communications 13, 3319 (2022).
  19. Exploring optimal control of epidemic spread using reinforcement learning. \JournalTitleScientific reports 10, 22106 (2020).
  20. Reinforcement learning-based decision support system for covid-19. \JournalTitleBiomedical Signal Processing and Control 68, 102676 (2021).
  21. A simple planning problem for covid-19 lock-down, testing, and tracing. \JournalTitleAmerican Economic Review: Insights 3, 367–382 (2021).
  22. Lukasz, R. An analytical model of covid-19 lockdowns (2020).
  23. Redlin, M. Differences in npi strategies against covid-19. \JournalTitleJournal of Regulatory Economics 62, 1–23 (2022).
  24. Covid-19 case doubling time associated with non-pharmaceutical interventions and vaccination: A global experience. \JournalTitleJournal of global health 11 (2021).
  25. Patel, M. D. et al. The joint impact of covid-19 vaccination and non-pharmaceutical interventions on infections, hospitalizations, and mortality: an agent-based simulation. \JournalTitleMedRxiv (2021).
  26. How did korea’s fiscal accounts fare during the covid-19 pandemic? \JournalTitlePeterson Institute for International Economics Policy Brief 23–8 (2023).
  27. The economic effects of covid-19 containment measures (2020).
  28. The macroeconomics of epidemics. \JournalTitleThe Review of Financial Studies 34, 5149–5187 (2021).
  29. How to cope with emerging viral diseases: Lessons from south korea’s strategy for covid-19, and collateral damage to cardiometabolic health. \JournalTitleThe Lancet Regional Health–Western Pacific 30 (2023).
  30. Coronavirus: South korea seeing a “stabilising trend”. https://www.bbc.com/news/av/world-asia-51897979. Accessed: 2024-01-12.
  31. Covid-19 coronavirus pandemic. https://www.worldometers.info/coronavirus/. Accessed: 2024-01-12.
  32. Hethcote, H. W. Three basic epidemiological models. In Applied mathematical ecology, 119–144 (Springer, 1989).
  33. Hethcote, H. W. The basic epidemiology models: models, expressions for r0, parameter estimation, and applications. In Mathematical understanding of infectious disease dynamics, 1–61 (World Scientific, 2009).
  34. Allen, L. J. A primer on stochastic epidemic models: Formulation, numerical simulation, and analysis. \JournalTitleInfectious Disease Modelling 2, 128–142 (2017).
  35. A sir model assumption for the spread of covid-19 in different communities. \JournalTitleChaos, Solitons & Fractals 139, 110057 (2020).
  36. The seirs model for infectious disease dynamics. \JournalTitleNature methods 17, 557–559 (2020).
  37. Seir model for covid-19 dynamics incorporating the environment and social distancing. \JournalTitleBMC Research Notes 13, 352 (2020).
  38. Adaptive sir model with vaccination: Simultaneous identification of rates and functions illustrated with covid-19. \JournalTitleScientific Reports 12, 15688 (2022).
  39. Sir model with vaccination: bifurcation analysis. \JournalTitleQualitative theory of dynamical systems 22, 105 (2023).
  40. Optimal vaccination strategies for an seir model of infectious diseases with logistic growth. \JournalTitleMathematical Biosciences & Engineering 15, 485–505 (2017).
  41. Turkyilmazoglu, M. An extended epidemic model with vaccination: Weak-immune sirvi. \JournalTitlePhysica A: Statistical Mechanics and its Applications 598, 127429 (2022).
  42. Modelling the impact of perfect and imperfect vaccination strategy against sars cov-2 by assuming varied vaccine efficacy over india. \JournalTitleClinical Epidemiology and Global Health 15, 101052 (2022).
  43. Hale, T. et al. A global panel database of pandemic policies (oxford covid-19 government response tracker). \JournalTitleNature human behaviour 5, 529–538 (2021).
  44. Lockdowns in sir models (2020).
  45. Atkeson, A. What will be the economic impact of covid-19 in the us? rough estimates of disease scenarios. Tech. Rep., National Bureau of Economic Research (2020).
  46. A time-dependent sir model for covid-19 with undetectable infected persons. \JournalTitleIeee transactions on network science and engineering 7, 3279–3294 (2020).
  47. Covid-19 pandemic–related policy stringency and economic decline: was it really inevitable? \JournalTitleEconomic research-Ekonomska istraživanja 36, 499–515 (2023).
  48. Cilloni, L. et al. The potential impact of the covid-19 pandemic on the tuberculosis epidemic a modelling analysis. \JournalTitleEClinicalMedicine 28 (2020).
  49. Health in financial crises: economic recession and tuberculosis in central and eastern europe. \JournalTitleJournal of the Royal Society Interface 7, 1559–1569 (2010).
  50. A general framework for optimising cost-effectiveness of pandemic response under partial intervention measures. \JournalTitleScientific Reports 12, 19482 (2022).
  51. Bastani, H. et al. Efficient and targeted covid-19 border testing via reinforcement learning. \JournalTitleNature 599, 108–113 (2021).
  52. Reinforcement learning: An introduction (MIT press, 2018).
  53. Dunn, W. N. Public policy analysis (routledge, 2015).
  54. Policy communities. In Handbook of Public Policy Analysis, 137–147 (CRC Press, 2006).
  55. Mnih, V. et al. Human-level control through deep reinforcement learning. \JournalTitlenature 518, 529–533 (2015).
  56. Francois-Lavet, V. et al. An introduction to deep reinforcement learning. \JournalTitleFoundations and Trends in Machine Learning 11, 219–354 (2018).
  57. Deep reinforcement learning: A brief survey. \JournalTitleIEEE Signal Processing Magazine 34, 26–38 (2017).
  58. Henderson, P. et al. Deep reinforcement learning that matters. In Proceedings of the AAAI conference on artificial intelligence, vol. 32 (2018).
  59. Bakker, B. Reinforcement learning with long short-term memory. \JournalTitleAdvances in neural information processing systems 14 (2001).
  60. Long short-term memory. \JournalTitleNeural computation 9, 1735–1780 (1997).
  61. Hens, N. et al. Seventy-five years of estimating the force of infection from current status data. \JournalTitleEpidemiology & Infection 138, 802–812 (2010).
  62. Massad, E. Ethical and transborder issues. In Global Health Informatics, 232–263 (Elsevier, 2017).
  63. Huber, P. J. Robust estimation of a location parameter. In Breakthroughs in statistics: Methodology and distribution, 492–518 (Springer, 1992).
  64. Implementing the nelder-mead simplex algorithm with adaptive parameters. \JournalTitleComputational Optimization and Applications 51, 259–277 (2012).
  65. Lockdowns in sir models (code) (2020).
  66. Mathieu, E. et al. Coronavirus pandemic (covid-19). \JournalTitleOur world in data (2020).
  67. Tw-sir: time-window based sir for covid-19 forecasts. \JournalTitleScientific reports 10, 22454 (2020).
  68. Covid-19 vaccine launch in india. https://www.unicef.org/india/stories/covid-19-vaccine-launch-india. Accessed: 2024-01-12.
  69. Oecd system of composite leading indicators. https://www.oecd.org/sdd/41629509.pdf. Accessed: 2024-01-12.
  70. Oecd system of composite leading indicators. https://www.oecd.org/sdd/leading-indicators/oecd-composite-leading-indicators-clis.htm. Accessed: 2024-01-12.
  71. Aws deepracer. https://aws.amazon.com/deepracer/league/. Accessed: 2024-01-12.
  72. Internet archive. https://archive.org. Accessed: 2024-01-12.
  73. OECD. Main economic indicators - complete database (2015).
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Maeghal Jain (1 paper)
  2. Ziya Uddin (7 papers)
  3. Wubshet Ibrahim (1 paper)