Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
139 tokens/sec
GPT-4o
47 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

KARINA: An Efficient Deep Learning Model for Global Weather Forecast (2403.10555v1)

Published 13 Mar 2024 in cs.LG, cs.AI, cs.CV, and physics.ao-ph

Abstract: Deep learning-based, data-driven models are gaining prevalence in climate research, particularly for global weather prediction. However, training the global weather data at high resolution requires massive computational resources. Therefore, we present a new model named KARINA to overcome the substantial computational demands typical of this field. This model achieves forecasting accuracy comparable to higher-resolution counterparts with significantly less computational resources, requiring only 4 NVIDIA A100 GPUs and less than 12 hours of training. KARINA combines ConvNext, SENet, and Geocyclic Padding to enhance weather forecasting at a 2.5{\deg} resolution, which could filter out high-frequency noise. Geocyclic Padding preserves pixels at the lateral boundary of the input image, thereby maintaining atmospheric flow continuity in the spherical Earth. SENet dynamically improves feature response, advancing atmospheric process modeling, particularly in the vertical column process as numerous channels. In this vein, KARINA sets new benchmarks in weather forecasting accuracy, surpassing existing models like the ECMWF S2S reforecasts at a lead time of up to 7 days. Remarkably, KARINA achieved competitive performance even when compared to the recently developed models (Pangu-Weather, GraphCast, ClimaX, and FourCastNet) trained with high-resolution data having 100 times larger pixels. Conclusively, KARINA significantly advances global weather forecasting by efficiently modeling Earth's atmosphere with improved accuracy and resource efficiency.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (35)
  1. Survey of data assimilation methods for convective-scale numerical weather prediction at operational centres. Quarterly Journal of the Royal Meteorological Society, 144(713):1218–1256, 2018.
  2. Ross N Bannister. A review of operational methods of variational and ensemble-variational data assimilation. Quarterly Journal of the Royal Meteorological Society, 143(703):607–633, 2017.
  3. Deep learning-based weather prediction: a survey. Big Data Research, 23:100178, 2021.
  4. Sub-seasonal forecasting with a large ensemble of deep-learning weather prediction models. Journal of Advances in Modeling Earth Systems, 13(7):e2021MS002502, 2021.
  5. Accurate medium-range global weather forecasting with 3d neural networks. Nature, 619(7970):533–538, 2023.
  6. Fengwu: Pushing the skillful global medium-range weather forecast beyond 10 days lead. arXiv preprint arXiv:2304.02948, 2023a.
  7. Fuxi: A cascade machine learning forecasting system for 15-day global weather forecast. arXiv preprint arXiv:2306.12873, 2023b.
  8. Squeezenet: Alexnet-level accuracy with 50x fewer parameters and< 0.5 mb model size. arXiv preprint arXiv:1602.07360, 2016.
  9. Masking strategies for background bias removal in computer vision models. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 4397–4405, 2023.
  10. Weatherbench: a benchmark data set for data-driven weather forecasting. Journal of Advances in Modeling Earth Systems, 12(11):e2020MS002203, 2020.
  11. Improving data-driven global weather prediction using deep convolutional neural networks on a cubed sphere. Journal of Advances in Modeling Earth Systems, 12(9):e2020MS002109, 2020.
  12. Data-driven medium-range weather prediction with a resnet pretrained on climate simulations: A new model for weatherbench. Journal of Advances in Modeling Earth Systems, 13(2):e2020MS002405, 2021.
  13. Fourcastnet: A global data-driven high-resolution weather model using adaptive fourier neural operators. arXiv preprint arXiv:2202.11214, 2022.
  14. Graphcast: Learning skillful medium-range global weather forecasting. arXiv preprint arXiv:2212.12794, 2022.
  15. Climax: A foundation model for weather and climate. arXiv preprint arXiv:2301.10343, 2023.
  16. Swinvrnn: A data-driven ensemble forecasting model via learned distribution perturbation. Journal of Advances in Modeling Earth Systems, 15(2):e2022MS003211, 2023.
  17. Swinrdm: integrate swinrnn with diffusion model towards high-resolution and high-quality weather forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 37, pages 322–330, 2023c.
  18. The era5 global reanalysis. Quarterly Journal of the Royal Meteorological Society, 146(730):1999–2049, 2020.
  19. Added value of high-resolution regional climate model in simulating precipitation based on the changes in kinetic energy. Geoscience Letters, 9(1):1–14, 2022.
  20. Advancing data-driven weather forecasting: Time-sliding data augmentation of era5. arXiv preprint arXiv:2402.08185, 2024.
  21. The subseasonal to seasonal (s2s) prediction project database. Bulletin of the American Meteorological Society, 98(1):163–173, 2017.
  22. A convnet for the 2020s. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 11976–11986, 2022.
  23. Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF international conference on computer vision, pages 10012–10022, 2021.
  24. Layer normalization. arXiv preprint arXiv:1607.06450, 2016.
  25. Gaussian error linear units (gelus). arXiv preprint arXiv:1606.08415, 2016.
  26. Regularization in resnet with stochastic depth. Advances in Neural Information Processing Systems, 34:15464–15474, 2021.
  27. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778, 2016.
  28. Spatiotemporal semantic network for enso forecasting over long time horizon. Applied Intelligence, 53(6):6464–6480, 2023.
  29. Maps as representations: Expert novice comparison of projection understanding. Cognition and Instruction, 20(3):283–321, 2002.
  30. Physics-inspired adaptions to low-parameter neural network weather forecast systems. Artificial Intelligence for the Earth Systems, 3(1):e230046, 2024.
  31. Revisiting fine-tuning for few-shot learning. arXiv preprint arXiv:1910.00216, 2019.
  32. Fourcastnext: Improving fourcastnet training with limited compute. arXiv preprint arXiv:2401.05584, 2024.
  33. Increase in the potential predictability of the arctic oscillation via intensified teleconnection with enso after the mid-1990s. Climate Dynamics, 49:2147–2160, 2017.
  34. Diurnal cycle of upper-air temperature estimated from radiosondes. Journal of Geophysical Research: Atmospheres, 110(D9), 2005.
  35. Spherical fourier neural operators: Learning stable dynamics on the sphere. In International conference on machine learning, pages 2806–2823. PMLR, 2023.
Citations (2)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com