Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

AIFS -- ECMWF's data-driven forecasting system (2406.01465v2)

Published 3 Jun 2024 in physics.ao-ph

Abstract: Machine learning-based weather forecasting models have quickly emerged as a promising methodology for accurate medium-range global weather forecasting. Here, we introduce the Artificial Intelligence Forecasting System (AIFS), a data driven forecast model developed by the European Centre for Medium-Range Weather Forecasts (ECMWF). AIFS is based on a graph neural network (GNN) encoder and decoder, and a sliding window transformer processor, and is trained on ECMWF's ERA5 re-analysis and ECMWF's operational numerical weather prediction (NWP) analyses. It has a flexible and modular design and supports several levels of parallelism to enable training on high-resolution input data. AIFS forecast skill is assessed by comparing its forecasts to NWP analyses and direct observational data. We show that AIFS produces highly skilled forecasts for upper-air variables, surface weather parameters and tropical cyclone tracks. AIFS is run four times daily alongside ECMWF's physics-based NWP model and forecasts are available to the public under ECMWF's open data policy.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (16)
  1. Simon Lang (17 papers)
  2. Mihai Alexe (7 papers)
  3. Matthew Chantry (27 papers)
  4. Jesper Dramsch (6 papers)
  5. Florian Pinault (5 papers)
  6. Baudouin Raoult (4 papers)
  7. Christian Lessig (27 papers)
  8. Michael Maier-Gerber (4 papers)
  9. Linus Magnusson (6 papers)
  10. Ana Prieto Nemesio (1 paper)
  11. Peter D. Dueben (7 papers)
  12. Andrew Brown (31 papers)
  13. Florian Pappenberger (10 papers)
  14. Florence Rabier (2 papers)
  15. Mariana C. A. Clare (5 papers)
  16. Zied Ben Bouallègue (5 papers)
Citations (26)

Summary

  • The paper introduces a novel forecasting system leveraging graph neural networks and transformers to enhance medium-range weather predictions.
  • It details a modular encoder-processor-decoder architecture optimized via advanced parallelism for efficient high-resolution data processing.
  • Empirical results show improved forecast skill over traditional NWP models, notably in tropical cyclone tracking and upper-air analysis.

Artificial Intelligence Forecasting System (AIFS) for Weather Prediction

The presented paper introduces the Artificial Intelligence Forecasting System (AIFS), a data-driven weather forecasting model spearheaded by the European Centre for Medium-Range Weather Forecasts (ECMWF). AIFS leverages modern machine learning techniques to provide accurate medium-range global weather forecasts. The system is constructed upon a Graph Neural Network (GNN) leveraging a graph-based encoder-decoder structure and a sliding window transformer as the processing core. AIFS is trained on ECMWF's ERA5 re-analysis and operational numerical weather prediction (NWP) analyses, showcasing flexibility and modularity in design for high-resolution data processing.

Model Architecture and Design

AIFS’s architecture is notably modular, composed of an encoder-processor-decoder framework. The encoder abstracts information from the input grid to a lower resolution processor grid using a cut-off radius to maintain necessary spatial relationships. The model’s processor is an innovative adaptation of a pre-norm transformer with shifted window attention mechanisms, enabling proficient handling of spatial data without dependencies on explicit edge information. Enhancements include the use of a reduced Gaussian grid for higher efficiency, minimizing the variability near the poles, thus reducing data points and edges. Additional learnable parameters on nodes and edges further enhance model adaptability.

To optimize computational efficiency and memory use during training, AIFS employs data parallelism and tensor parallelism across multiple GPUs, performing efficient load balancing and memory allocation which is crucial for high-resolution data handling. The sequence parallelism strategy specifically facilitates advanced training methodologies such as rollout, which iteratively refines forecasts over multiple time steps.

Training Methodology

Training AIFS involves a phased approach beginning with pre-training on historical ERA5 data, followed by a rollout phase for more extended future forecasts up to 72 hours, and subsequent fine-tuning on real-time operational NWP analyses. The training strategy ensures efficient backpropagation through temporal sequences, improving longer lead-time predictions. The use of advanced optimization techniques such as AdamW within this setting allows for a more responsive learning process tailored for high-dimensional atmospheric data.

Empirical Results and Forecast Skill

AIFS exhibits superior performance in producing forecasts, especially for upper-air parameters and tropical cyclone tracking, achieving a lead-time advantage over current state-of-the-art NWP models such as the IFS. Robust verification against both NWP analyses and independent observational data, including radiosondes and SYNOP observations, substantiated the forecast accuracy across multiple domains. For instance, verification against Northern Hemisphere forecasts demonstrated significant improvements in metrics such as the anomaly correlation coefficient (ACC) and root mean square error (RMSE).

The model however does exhibit some blurring effects at extended lead times, a characteristic observed in many models optimizing based on mean squared error (MSE). Future improvements focused on refining loss functions or training with probabilistic objectives could ameliorate this.

Future Implications and Research Directions

The modularity and efficiency of AIFS not only make it an invaluable tool for real-time weather forecasting but also open paths for extensive research in probabilistic forecasting, enhancing ensemble-based predictions, and integrating observational data more intricately during inference stages. Additionally, extending its utility into long-range forecasting and fine-scale regional modeling are key areas indicated for future exploration.

The release of AIFS under an open-source framework will potentially propel further research collaborations, encouraging enhancements to ensemble forecasting capabilities following promising parallels in advancements seen in diffusion models and generative approaches.

In conclusion, AIFS stands as a notable advancement in the domain of data-driven weather forecasting, providing a robust framework for future explorations both in atmospheric sciences and the broader field of artificial intelligence applications.

Youtube Logo Streamline Icon: https://streamlinehq.com