Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

GenCast: Diffusion-based ensemble forecasting for medium-range weather (2312.15796v2)

Published 25 Dec 2023 in cs.LG and physics.ao-ph

Abstract: Weather forecasts are fundamentally uncertain, so predicting the range of probable weather scenarios is crucial for important decisions, from warning the public about hazardous weather, to planning renewable energy use. Here, we introduce GenCast, a probabilistic weather model with greater skill and speed than the top operational medium-range weather forecast in the world, the European Centre for Medium-Range Forecasts (ECMWF)'s ensemble forecast, ENS. Unlike traditional approaches, which are based on numerical weather prediction (NWP), GenCast is a machine learning weather prediction (MLWP) method, trained on decades of reanalysis data. GenCast generates an ensemble of stochastic 15-day global forecasts, at 12-hour steps and 0.25 degree latitude-longitude resolution, for over 80 surface and atmospheric variables, in 8 minutes. It has greater skill than ENS on 97.4% of 1320 targets we evaluated, and better predicts extreme weather, tropical cyclones, and wind power production. This work helps open the next chapter in operational weather forecasting, where critical weather-dependent decisions are made with greater accuracy and efficiency.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (67)
  1. Deep learning for day forecasts from sparse observations. arXiv preprint arXiv:2306.06079, 2023.
  2. Precipitation nowcasting with generative diffusion models. arXiv preprint arXiv:2308.06733, 2023.
  3. Conditional image generation with score-based diffusion models. arXiv preprint arXiv:2111.13606, 2021.
  4. Pangu-Weather: A 3D high-resolution model for fast and accurate global weather forecast. arXiv preprint arXiv:2211.02556, 2022.
  5. Accurate medium-range global weather forecasting with 3d neural networks. Nature, 619(7970):533–538, 2023.
  6. Align your latents: High-resolution video synthesis with latent diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 22563–22575, 2023.
  7. The thorpex interactive grand global ensemble. Bulletin of the American Meteorological Society, 91(8):1059 – 1072, 2010. https://doi.org/10.1175/2010BAMS2853.1. URL https://journals.ametsoc.org/view/journals/bams/91/8/2010bams2853_1.xml.
  8. Stochastic representation of model uncertainties in the ECMWF ensemble prediction system. Quarterly Journal of the Royal Meteorological Society, 125(560):2887–2908, 1999.
  9. Potential use of an ensemble of analyses in the ECMWF ensemble prediction system. Quarterly Journal of the Royal Meteorological Society, 134(637):2051–2066, 2008.
  10. Fengwu: Pushing the skillful global medium-range weather forecast beyond 10 days lead. arXiv preprint arXiv:2304.02948, 2023.
  11. Adaspeech: Adaptive text to speech for custom voice, 2021.
  12. Computing fourier transforms and convolutions on the 2-sphere. Adv. Appl. Math., 15(2):202–250, June 1994.
  13. ECMWF. IFS Documentation CY46R1 - Part V: Ensemble Prediction System. 2019. 10.21957/38yug0cev. URL https://www.ecmwf.int/node/19309.
  14. L. Edward. The essence of chaos. 1993.
  15. Why should ensemble spread match the rmse of the ensemble mean? Journal of Hydrometeorology, 15(4):1708–1713, 2014.
  16. Prediff: Precipitation nowcasting with latent diffusion models. arXiv preprint arXiv:2307.10422, 2023.
  17. T. Gneiting and A. E. Raftery. Strictly proper scoring rules, prediction, and estimation. Journal of the American statistical Association, pages 359–378, 2007.
  18. Intercomparison of global model precipitation forecast skill in 2010/11 using the seeps score. Monthly Weather Review, 140(8):2720–2733, 2012.
  19. T. M. Hamill. Interpretation of rank histograms for verifying ensemble forecasts. Monthly Weather Review, 129(3):550–560, 2001.
  20. Flexible diffusion modeling of long videos. Advances in Neural Information Processing Systems, 35:27953–27965, 2022.
  21. The ERA5 global reanalysis. Quarterly Journal of the Royal Meteorological Society, 146(730):1999–2049, 2020.
  22. Denoising diffusion probabilistic models. Advances in neural information processing systems, 33:6840–6851, 2020.
  23. Imagen video: High definition video generation with diffusion models. arXiv preprint arXiv:2210.02303, 2022a.
  24. Video diffusion models, 2022b.
  25. Towards unifying diffusion models for probabilistic spatio-temporal graph learning, 2023.
  26. Tpu v4: An optically reconfigurable supercomputer for machine learning with hardware support for embeddings. In Proceedings of the 50th Annual International Symposium on Computer Architecture, pages 1–14, 2023.
  27. E. Kalnay. Atmospheric modeling, data assimilation and predictability. Cambridge university press, 2003.
  28. Elucidating the design space of diffusion-based generative models. Advances in Neural Information Processing Systems, 35:26565–26577, 2022.
  29. R. Keisler. Forecasting global weather with graph neural networks. arXiv preprint arXiv:2202.07575, 2022.
  30. Neural general circulation models. arXiv preprint arXiv:2311.07222, 2023.
  31. Turbulent flow simulation using autoregressive conditional diffusion models. arXiv preprint arXiv:2309.01745, 2023.
  32. FourCastNet: Accelerating global high-resolution weather forecasting using adaptive fourier neural operators. arXiv preprint arXiv:2208.05419, 2022.
  33. Graphcast: Learning skillful medium-range global weather forecasting. arXiv preprint arXiv:2212.12794, 2022.
  34. Learning skillful medium-range global weather forecasting. Science, page eadi2336, 2023.
  35. Continuous data assimilation for the IFS. ECMWF Newsletter, pages 21–26, Jan. 2019. 10.21957/9pl5fc37it. URL https://www.ecmwf.int/node/18882.
  36. M. Leutbecher. Diagnosis of ensemble forecasting systems. In Seminar on diagnosis of forecasting and data assimilation systems, pages 235–266, 2009.
  37. Fuxi: A cascade machine learning forecasting system for 15-day global weather forecast. npj Climate and Atmospheric Science, 6(190), 2023a.
  38. Seeds: Emulation of weather forecast ensembles with diffusion models. arXiv preprint arXiv:2306.14066, 2023b.
  39. I. Loshchilov and F. Hutter. Decoupled weight decay regularization. In International Conference on Learning Representations, 2018.
  40. DPM-Solver++: Fast solver for guided sampling of diffusion probabilistic models. arXiv preprint arXiv:2211.01095, 2022.
  41. Scaling transformer neural networks for skillful and reliable medium-range weather forecasting. arXiv preprint arXiv:2312.03876, 2023.
  42. T. Q. Nguyen and J. Salazar. Transformers without tears: Improving the normalization of self-attention. In Proceedings of the 16th International Conference on Spoken Language Translation, 2019.
  43. An assessment of the seeps and sedi metrics for the verification of 6 h forecast precipitation accumulations. Meteorological Applications, 20(2):164–175, 2013.
  44. T. Palmer. The ecmwf ensemble prediction system: Looking back (more than) 25 years and projecting forward 25 years. Quarterly Journal of the Royal Meteorological Society, 145:12–24, 2019.
  45. T. Palmer and R. Hagedorn. Predictability of weather and climate. Cambridge University Press, 2006.
  46. Fourcastnet: A global data-driven high-resolution weather model using adaptive fourier neural operators. arXiv preprint arXiv:2202.11214, 2022.
  47. Spatial verification of ensemble precipitation: An ensemble version of SAL. Weather Forecast., 33(4):1001–1020, Aug. 2018.
  48. WeatherBench: a benchmark data set for data-driven weather forecasting. Journal of Advances in Modeling Earth Systems, 12(11):e2020MS002203, 2020.
  49. Weatherbench 2: A benchmark for the next generation of data-driven global weather models. arXiv preprint arXiv:2308.15560, 2023.
  50. Autoregressive denoising diffusion models for multivariate probabilistic time series forecasting. In International Conference on Machine Learning, pages 8857–8868. PMLR, 2021.
  51. Skilful precipitation nowcasting using deep generative models of radar. Nature, 597(7878):672–677, 2021.
  52. Improver: The new probabilistic postprocessing system at the met office. Bulletin of the American Meteorological Society, 104(3):E680–E697, 2023.
  53. A new equitable score suitable for verifying precipitation in numerical weather prediction. Quarterly Journal of the Royal Meteorological Society, 136(650):1344–1363, 2010.
  54. Deep unsupervised learning using nonequilibrium thermodynamics. In International conference on machine learning, pages 2256–2265. PMLR, 2015.
  55. Score-based generative modeling through stochastic differential equations. In International Conference on Learning Representations, 2021. URL https://openreview.net/forum?id=PxTIG12RRHS.
  56. O. Talagrand. Evaluation of probabilistic prediction systems. In Workshop Proceedings" Workshop on Predictability", 20-22 October 1997, ECMWF, Reading, UK, 1999.
  57. Attention is all you need. Advances in neural information processing systems, 30, 2017.
  58. A. Weaver and P. Courtier. Correlation modelling on the sphere using a generalized diffusion equation. Quarterly Journal of the Royal Meteorological Society, 127(575):1815–1846, 2001.
  59. Diffstg: Probabilistic spatio-temporal graph forecasting with denoising diffusion models. arXiv preprint arXiv:2301.13629, 2023.
  60. Wikipedia. 2019 atlantic hurricane season, Dec 2023. URL https://en.wikipedia.org/wiki/2019_Atlantic_hurricane_season.
  61. D. Wilks. On the reliability of the rank histogram. Monthly Weather Review, 139(1):311–316, 2011.
  62. W.M.O. Manual on the Global Data-processing and Forecasting System. World Meteorological Organization, 2019. ISBN 978-92-63-10485-4. URL https://library.wmo.int/idurl/4/35703.
  63. Introduction to jma’s new global ensemble prediction system. CAS/JSC WGNE, Research Activities in Atmospheric and Oceanic Modelling, 42:6–13, 2018.
  64. Diffusion probabilistic modeling for video generation. Entropy, 25(10):1469, 2023.
  65. M. Zamo and P. Naveau. Estimation of the continuous ranked probability score with limited information and applications to ensemble weather forecasts. Mathematical Geosciences, 50(2):209–234, 2018.
  66. Skilful nowcasting of extreme precipitation with nowcastnet. Nature, 619(7970):526–532, 2023.
  67. May 2006 upgrade of the GEFS and first implementation of naefs systems, 2012b.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (12)
  1. Ilan Price (9 papers)
  2. Alvaro Sanchez-Gonzalez (25 papers)
  3. Ferran Alet (14 papers)
  4. Timo Ewalds (7 papers)
  5. Andrew El-Kadi (3 papers)
  6. Jacklynn Stott (4 papers)
  7. Shakir Mohamed (42 papers)
  8. Peter Battaglia (40 papers)
  9. Remi Lam (5 papers)
  10. Matthew Willson (14 papers)
  11. Tom R. Andersson (7 papers)
  12. Dominic Masters (11 papers)
Citations (83)

Summary

Overview of GenCast Model

GenCast is a machine learning-based generative model designed specifically for ensemble weather forecasting. It aims to produce a set of possible future weather scenarios, known as ensemble forecasts, that are not only accurate but also coherent over time and space. GenCast outperforms its counterparts by generating high-resolution global weather predictions with exceptional skill and reliability, even for rare and extreme events.

Methodology and Architecture

GenCast employs a diffusion model, a type of neural network that generates forecasts by refining an initial state comprised mostly of noise. The model is fed historical weather data and trained to predict future states. Its architecture is similar to previous models like GraphCast, but notably, it transitions from a spatial transformer network to a sparse transformer, enhancing its ability to preserve high-frequency content.

Results and Performance

Through an extensive evaluation using multiple metrics, GenCast demonstrates superior forecasting skill compared to both the European Centre for Medium-range Weather Forecasts (ECMWF)’s top operational ensemble forecast, ENS, and a leading ML model GraphCast-Perturbed. Notably, GenCast maintains high performance across all examined weather variables and up to 15 days into the future. It even excels at forecasting extreme weather events, a critical capability for many applications.

Physical Consistency and Reliability

One of the strengths of GenCast is its ability to generate physically consistent forecasts. It preserves the high-frequency spatial structure and produces reliable ensemble forecasts with spread and rank histograms comparable to or better than operational models. This ability is especially important for ensuring that forecasts remain plausible and useful for real-world decision-making.

Implications and Future Work

With its rapid computation on a single Cloud TPU device, GenCast opens new opportunities for producing large ensemble weather forecasts efficiently. However, it still requires high-quality initial conditions from traditional weather prediction systems. Moving forward, GenCast signifies a substantial advancement in the applicability and utility of ML for weather forecasting, potentially revolutionizing how weather predictions are made and used.