Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
126 tokens/sec
GPT-4o
47 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

DiffDA: a Diffusion Model for Weather-scale Data Assimilation (2401.05932v3)

Published 11 Jan 2024 in cs.CE and cs.AI

Abstract: The generation of initial conditions via accurate data assimilation is crucial for weather forecasting and climate modeling. We propose DiffDA as a denoising diffusion model capable of assimilating atmospheric variables using predicted states and sparse observations. Acknowledging the similarity between a weather forecast model and a denoising diffusion model dedicated to weather applications, we adapt the pretrained GraphCast neural network as the backbone of the diffusion model. Through experiments based on simulated observations from the ERA5 reanalysis dataset, our method can produce assimilated global atmospheric data consistent with observations at 0.25 deg (~30km) resolution globally. This marks the highest resolution achieved by ML data assimilation models. The experiments also show that the initial conditions assimilated from sparse observations (less than 0.96% of gridded data) and 48-hour forecast can be used for forecast models with a loss of lead time of at most 24 hours compared to initial conditions from state-of-the-art data assimilation in ERA5. This enables the application of the method to real-world applications, such as creating reanalysis datasets with autoregressive data assimilation.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (29)
  1. page 35–144. Cambridge University Press, July 2023. ISBN 9781009157896. doi: 10.1017/9781009157896.002. URL http://dx.doi.org/10.1017/9781009157896.002.
  2. G. Andry et al. Data assimilation as simulation-based inference. 2023.
  3. Deep learning for day forecasts from sparse observations. arXiv preprint arXiv:2306.06079, 2023.
  4. Ens-10: A dataset for post-processing ensemble weather forecasts. Advances in Neural Information Processing Systems, 35:21974–21987, 2022.
  5. R. N. Bannister. A review of operational methods of variational and ensemble-variational data assimilation. Quarterly Journal of the Royal Meteorological Society, 143(703):607–633, 2017.
  6. Accurate medium-range global weather forecasting with 3d neural networks. Nature, 619(7970):533–538, 2023.
  7. The evolution of the ecmwf hybrid data assimilation system. Quarterly Journal of the Royal Meteorological Society, 142(694):287–303, 2016.
  8. JAX: composable transformations of Python+NumPy programs, 2018. URL http://github.com/google/jax.
  9. Diffusion posterior sampling for general noisy inverse problems. In The Eleventh International Conference on Learning Representations, 2022.
  10. Data assimilation fundamentals: A unified formulation of the state and parameter estimation problem. Springer Nature, 2022.
  11. Representation learning with unconditional denoising diffusion models for dynamical systems. EGUsphere, 2023:1–39, 2023.
  12. The era5 global reanalysis. Quarterly Journal of the Royal Meteorological Society, 146(730):1999–2049, 2020.
  13. Denoising diffusion probabilistic models. Advances in neural information processing systems, 33:6840–6851, 2020.
  14. Learning skillful medium-range global weather forecasting. Science, page eadi2336, 2023.
  15. Data assimilation. Cham, Switzerland: Springer, 214:52, 2015.
  16. Seeds: Emulation of weather forecast ensembles with diffusion models. arXiv preprint arXiv:2306.14066, 2023.
  17. I. Loshchilov and F. Hutter. Fixing weight decay regularization in adam. 2018.
  18. Repaint: Inpainting using denoising diffusion probabilistic models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 11461–11471, 2022.
  19. Generative residual diffusion modeling for km-scale atmospheric downscaling. arXiv preprint arXiv:2309.15214, 2023.
  20. Residual diffusion modeling for km-scale atmospheric downscaling. 2024.
  21. Fourcastnet: A global data-driven high-resolution weather model using adaptive fourier neural operators. arXiv preprint arXiv:2202.11214, 2022.
  22. Gencast: Diffusion-based ensemble forecasting for medium-range weather. arXiv preprint arXiv:2312.15796, 2023.
  23. Weatherbench: a benchmark data set for data-driven weather forecasting. Journal of Advances in Modeling Earth Systems, 12(11):e2020MS002203, 2020.
  24. Weatherbench 2: A benchmark for the next generation of data-driven global weather models. arXiv preprint arXiv:2308.15560, 2023.
  25. F. Rozet and G. Louppe. Score-based data assimilation. arXiv preprint arXiv:2306.10574, 2023a.
  26. F. Rozet and G. Louppe. Score-based data assimilation for a two-layer quasi-geostrophic model. arXiv preprint arXiv:2310.01853, 2023b.
  27. Photorealistic text-to-image diffusion models with deep language understanding. Advances in Neural Information Processing Systems, 35:36479–36494, 2022.
  28. Score-based generative modeling through stochastic differential equations. arXiv preprint arXiv:2011.13456, 2020.
  29. Scipy 1.0: fundamental algorithms for scientific computing in python. Nature methods, 17(3):261–272, 2020.
Citations (20)

Summary

We haven't generated a summary for this paper yet.