Kunyu: A High-Performing Global Weather Model Beyond Regression Losses (2312.08264v1)
Abstract: Over the past year, data-driven global weather forecasting has emerged as a new alternative to traditional numerical weather prediction. This innovative approach yields forecasts of comparable accuracy at a tiny fraction of computational costs. Regrettably, as far as I know, existing models exclusively rely on regression losses, producing forecasts with substantial blurring. Such blurring, although compromises practicality, enjoys an unfair advantage on evaluation metrics. In this paper, I present Kunyu, a global data-driven weather forecasting model which delivers accurate predictions across a comprehensive array of atmospheric variables at 0.35{\deg} resolution. With both regression and adversarial losses integrated in its training framework, Kunyu generates forecasts with enhanced clarity and realism. Its performance outpaces even ECMWF HRES in some aspects such as the estimation of anomaly extremes, while remaining competitive with ECMWF HRES on evaluation metrics such as RMSE and ACC. Kunyu is an important step forward in closing the utility gap between numerical and data-driven weather prediction.
- “The Numerics of Physical Parametrization in the ECMWF Model” In Frontiers in Earth Science 6, 2018 DOI: 10.3389/feart.2018.00137
- “Accurate medium-range global weather forecasting with 3D neural networks” In Nature 619.7970, 2023, pp. 533–538 DOI: 10.1038/s41586-023-06185-3
- “FengWu: Pushing the Skillful Global Medium-range Weather Forecast beyond 10 Days Lead”, 2023 arXiv:2304.02948 [cs.AI]
- “FuXi: A cascade machine learning forecasting system for 15-day global weather forecast”, 2023 arXiv:2306.12873 [physics.ao-ph]
- Shiv Ram Dubey and Satish Kumar Singh “Transformer-based Generative Adversarial Networks in Computer Vision: A Comprehensive Survey”, 2023 arXiv:2302.08641 [cs.CV]
- ECMWF “IFS Documentation CY48R1 - Part III: Dynamics and Numerical Procedures” In IFS Documentation CY48R1 ECMWF, 2023 DOI: 10.21957/26f0ad3473
- “An Evaluation of Tropical Cyclone Genesis Forecasts from Global Numerical Models” In Weather and Forecasting 28.6 Boston MA, USA: American Meteorological Society, 2013, pp. 1423–1445 DOI: https://doi.org/10.1175/WAF-D-13-00008.1
- “The ERA5 global reanalysis” In Quarterly Journal of the Royal Meteorological Society 146.730, 2020, pp. 1999–2049 DOI: https://doi.org/10.1002/qj.3803
- “GPM IMERG Final Precipitation L3 Half Hourly 0.1 degree x 0.1 degree V06” NASA Goddard Earth Sciences DataInformation Services Center, 2019 DOI: 10.5067/GPM/IMERG/3B-HH/06
- “GPM IMERG Late Precipitation L3 Half Hourly 0.1 degree x 0.1 degree V06” NASA Goddard Earth Sciences DataInformation Services Center, 2019 DOI: 10.5067/GPM/IMERG/3B-HH-L/06
- “FourCastNet: Accelerating Global High-Resolution Weather Forecasting Using Adaptive Fourier Neural Operators” In Proceedings of the Platform for Advanced Scientific Computing Conference, PASC ’23 Davos, Switzerland: Association for Computing Machinery, 2023 DOI: 10.1145/3592979.3593412
- “Learning skillful medium-range global weather forecasting” In Science, 2023, pp. eadi2336 DOI: 10.1126/science.adi2336
- “ViTGAN: Training GANs with Vision Transformers” In International Conference on Learning Representations, 2022 URL: https://openreview.net/forum?id=dwg5rXg1WS_
- “Swin Transformer: Hierarchical Vision Transformer using Shifted Windows” In 2021 IEEE/CVF International Conference on Computer Vision (ICCV) Los Alamitos, CA, USA: IEEE Computer Society, 2021, pp. 9992–10002 DOI: 10.1109/ICCV48922.2021.00986
- Lars M. Mescheder, Andreas Geiger and Sebastian Nowozin “Which Training Methods for GANs do actually Converge?” In International Conference on Machine Learning, 2018 URL: https://api.semanticscholar.org/CorpusID:3345317
- “Spectral Normalization for Generative Adversarial Networks” In International Conference on Learning Representations, 2018 URL: https://openreview.net/forum?id=B1QRgziT-
- Patrick J. Roddy and Jason D. McEwen “Sifting Convolution on the Sphere” In IEEE Signal Processing Letters 28, 2021, pp. 304–308 DOI: 10.1109/LSP.2021.3050961
- Olaf Ronneberger, Philipp Fischer and Thomas Brox “U-Net: Convolutional Networks for Biomedical Image Segmentation” In CoRR abs/1505.04597, 2015 arXiv: http://arxiv.org/abs/1505.04597
- Nathanaël Schaeffer “Efficient spherical harmonic transforms aimed at pseudospectral numerical simulations” In Geochemistry, Geophysics, Geosystems 14.3, 2013, pp. 751–758 DOI: https://doi.org/10.1002/ggge.20071
- Benjamin A. Schenkel and Robert E. Hart “An Examination of Tropical Cyclone Position, Intensity, and Intensity Life Cycle within Atmospheric Reanalysis Datasets” In Journal of Climate 25.10 Boston MA, USA: American Meteorological Society, 2012, pp. 3453–3475 DOI: https://doi.org/10.1175/2011JCLI4208.1
- Jonathan A. Weyn, Dale R. Durran and Rich Caruana “Can Machines Learn to Predict Weather? Using Deep Learning to Predict Gridded 500-hPa Geopotential Height From Historical Weather Data” In Journal of Advances in Modeling Earth Systems 11.8, 2019, pp. 2680–2693 DOI: https://doi.org/10.1029/2019MS001705
- “StyleSwin: Transformer-Based GAN for High-Resolution Image Generation” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 11304–11314