Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
149 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Learning epidemic trajectories through Kernel Operator Learning: from modelling to optimal control (2404.11130v2)

Published 17 Apr 2024 in math.NA, cs.LG, cs.NA, math.OC, and q-bio.PE

Abstract: Since infectious pathogens start spreading into a susceptible population, mathematical models can provide policy makers with reliable forecasts and scenario analyses, which can be concretely implemented or solely consulted. In these complex epidemiological scenarios, machine learning architectures can play an important role, since they directly reconstruct data-driven models circumventing the specific modelling choices and the parameter calibration, typical of classical compartmental models. In this work, we discuss the efficacy of Kernel Operator Learning (KOL) to reconstruct population dynamics during epidemic outbreaks, where the transmission rate is ruled by an input strategy. In particular, we introduce two surrogate models, named KOL-m and KOL-$\partial$, which reconstruct in two different ways the evolution of the epidemics. Moreover, we evaluate the generalization performances of the two approaches with different kernels, including the Neural Tangent Kernels, and compare them with a classical neural network model learning method. Employing synthetic but semi-realistic data, we show how the two introduced approaches are suitable for realizing fast and robust forecasts and scenario analyses, and how these approaches are competitive for determining optimal intervention strategies with respect to specific performance measures.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (42)
  1. Martcheva M. An introduction to mathematical epidemiology. vol. 61. Springer; 2015.
  2. Mathematical models in epidemiology. vol. 32. Springer; 2019.
  3. Modelling the COVID-19 epidemic and implementation of population-wide interventions in Italy. Nature medicine. 2020;26(6):855-60.
  4. Modelling the COVID-19 epidemic and the vaccination campaign in Italy by the SUIHTER model. Infectious Disease Modelling. 2022;7(2):45-63.
  5. The geography of COVID-19 spread in Italy and implications for the relaxation of confinement measures. Nature communications. 2020;11(1):42-64.
  6. Anatomy of the first six months of COVID-19 vaccination campaign in Italy. PLoS Computational Biology. 2022;18(5):e1010146.
  7. Combining the dynamic model and deep neural networks to identify the intensity of interventions during COVID-19 pandemic. PLOS Computational Biology. 2023;19(10):e1011535.
  8. Optimized numerical solutions of SIRDVW multiage model controlling SARS-CoV-2 vaccine roll out: An application to the Italian scenario. Infectious Disease Modelling. 2023;8(3):672-703.
  9. Optimal control of the spatial allocation of COVID-19 vaccines: Italy as a case study. PLoS computational biology. 2022;18(7):e1010237.
  10. Britton T, Leskelä L. Optimal intervention strategies for minimizing total incidence during an epidemic. SIAM Journal on Applied Mathematics. 2023;83(2):354-73.
  11. Optimal control of epidemic spreading in the presence of social heterogeneity. Philosophical Transactions of the Royal Society A. 2022;380(2224):20210160.
  12. Optimal, near-optimal, and robust epidemic control. Communications Physics. 2021;4(1):78.
  13. Time-optimal control strategies in SIR epidemic models. Mathematical biosciences. 2017;292:86-96.
  14. Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators. Nature machine intelligence. 2021;3(3):218-29.
  15. A physics-informed variational DeepONet for predicting crack path in quasi-brittle materials. Computer Methods in Applied Mechanics and Engineering. 2022;391:114587.
  16. Fourier neural operator for parametric partial differential equations. arXiv preprint arXiv:201008895. 2020.
  17. Kernel methods are competitive for operator learning. Journal of Computational Physics. 2024;496:112549.
  18. Genton MG. Classes of kernels for machine learning: a statistics perspective. Journal of machine learning research. 2001;2(Dec):299-312.
  19. Finite versus infinite neural networks: an empirical study. Advances in Neural Information Processing Systems. 2020;33:15156-72.
  20. Neural tangent kernel: Convergence and generalization in neural networks. Advances in neural information processing systems. 2018;31.
  21. On exact computation with an infinitely wide neural net. Advances in neural information processing systems. 2019;32.
  22. Kernel regression with infinite-width neural networks on millions of examples. arXiv preprint arXiv:230305420. 2023.
  23. Vector valued reproducing kernel Hilbert spaces of integrable functions and Mercer theorem. Analysis and Applications. 2006;4(04):377-408.
  24. Fleming WH, Rishel RW. Deterministic and stochastic optimal control. vol. 1. Springer Science & Business Media; 2012.
  25. SUIHTER: A new mathematical model for COVID-19. Application to the analysis of the second epidemic outbreak in Italy. Proceedings of the Royal Society A. 2021;477(2253):20210027.
  26. The effect of COVID-19 vaccination in Italy and perspectives for living with the virus. Nature communications. 2021;12(1):7272.
  27. Yong J, Zhou XY. Stochastic controls: Hamiltonian systems and HJB equations. vol. 43. Springer Science & Business Media; 2012.
  28. Predictive performance of multi-model ensemble forecasts of COVID-19 across European nations. Elife. 2023;12:e81916.
  29. Kernel methods through the roof: handling billions of points efficiently. Advances in Neural Information Processing Systems. 2020;33:14410-22.
  30. Owhadi H, Yoo GR. Kernel flows: From learning kernels from data into the abyss. Journal of Computational Physics. 2019;389:22-47.
  31. Shape-driven interpolation with discontinuous kernels: Error analysis, edge extraction, and applications in magnetic particle imaging. SIAM Journal on Scientific Computing. 2020;42(2):B472-91.
  32. Machine learning of space-fractional differential equations. SIAM Journal on Scientific Computing. 2019;41(4):A2485-509.
  33. Neural Tangents: Fast and Easy Infinite Neural Networks in Python. In: International Conference on Learning Representations; 2019. .
  34. Machine learning for fast and reliable solution of time-dependent differential equations. Journal of Computational physics. 2019;397:108852.
  35. A machine learning method for real-time numerical simulations of cardiac electromechanics. Computer methods in applied mechanics and engineering. 2022;393:114825.
  36. Machine learning of multiscale active force generation models for the efficient simulation of cardiac electromechanics. Computer Methods in Applied Mechanics and Engineering. 2020;370:113268.
  37. Nocedal J, Wright SJ. Numerical optimization. Springer; 1999.
  38. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nature methods. 2020;17(3):261-72.
  39. Gradient descent provably optimizes over-parameterized neural networks. arXiv preprint arXiv:181002054. 2018.
  40. Wide neural networks of any depth evolve as linear models under gradient descent. Advances in neural information processing systems. 2019;32.
  41. A generalized representer theorem. In: International conference on computational learning theory. Springer; 2001. p. 416-26.
  42. Kernels for vector-valued functions: A review. Foundations and Trends® in Machine Learning. 2012;4(3):195-266.
Citations (1)

Summary

We haven't generated a summary for this paper yet.