GenCast: Diffusion-Based Weather Forecasting

Updated 2 March 2026

GenCast is a generative diffusion model that predicts global medium-range and seasonal weather by iteratively denoising noisy atmospheric fields.
It employs a three-phase workflow—encoding of reanalysis inputs, noise injection with neural denoising, and ensemble generation—to capture forecast uncertainty.
The model outperforms traditional operational systems like ECMWF ENS, offering efficient, calibrated predictions and enhanced reliability for extreme weather events.

GenCast refers to two distinct, technically unrelated frameworks in the academic literature. The dominant and most cited usage is as a state-of-the-art diffusion-based ensemble forecasting model for probabilistic medium-range and seasonal weather prediction. In a separate domain, GenCast also appears as a shorthand for the Generalized Cascade Model in click modeling, a unifying EM-based framework for user click sequences in information retrieval. This article focuses primarily on GenCast as it pertains to generative meteorological forecasting, with a brief section on its click-modeling context.

1. Diffusion-Based Weather Forecasting with GenCast

GenCast is a conditional diffusion model for global medium-range and seasonal weather prediction. Developed and evaluated by Google DeepMind and collaborators, GenCast operates by learning to map meteorological initial states (from ERA5 reanalysis) and boundary conditions to the conditional distribution of future atmospheric states, producing calibrated probabilistic forecasts directly through neural inference rather than numerical integration of PDEs (Price et al., 2023, Antonio et al., 8 Sep 2025).

The core GenCast workflow consists of three principal phases:

Initialization and Encoding: The model ingests initial atmospheric fields—including geopotential height, temperature, humidity, and wind at multiple pressure levels—on a regular latitude–longitude grid, concatenated with boundary forcings and the requested forecast lead time.
Diffusion Sampling and Denoising: Gaussian noise, calibrated to a permuted schedule of variances, is injected into a latent representation. Through a sequence of T neural-network–parameterized denoising steps (typically 20–40 for operational settings), the noisy candidate is iteratively refined toward the learned atmospheric data manifold.
Ensemble Generation: The denoising process is repeated multiple times from different independent noise seeds, yielding an ensemble of distinct stochastic forecast fields whose collective spread represents the internal epistemic uncertainty.

This architectural and procedural framework generalizes across both medium-range (1–15 days) and seasonal (30–120 days) timescales by varying the autoregressive horizon and prescribed boundary inputs (Price et al., 2023, Antonio et al., 8 Sep 2025).

2. Mathematical Foundations and Model Structure

GenCast models the one-step generative transition as the conditional distribution $p(X^{t+1} \mid X^t, X^{t-1})$ using a diffusion process over a residual field. The forward process is defined by corrupting this residual with isotropic Gaussian noise:

$Z^t_\sigma = Z^t + \varepsilon, \quad \varepsilon \sim \mathcal G(\varepsilon | \sigma)$

where $Z^t$ denotes the normalized field difference (forecast increment), and $\sigma$ traces a predefined noise schedule. A preconditioned neural denoiser $D_\theta$ operates on this noisy input along with contextual fields. The denoising diffusion process is solved using a high-order ODE integrator (e.g., DPMSolver++2S), with 20–40 noise levels spanning the scheduled interval (Price et al., 2023, Savary et al., 23 Sep 2025).

The network architecture employs:

A latitude–longitude to icosahedral-mesh encoder via graph convolutions.
A 16-layer graph-transformer with self-attention (32-hop neighborhood, 512 feature dimension).
A decoder mapping from the mesh back to the regular grid (Price et al., 2023).
Explicit channel- and level-wise loss weighting for vertical and physical-variable balancing.

Mathematically, the overall loss combines denoising error and, in production instances, physics-based regularizers:

$L_{\text{total}} = \mathbb{E}_{x_0, t}\left[\|x_0 - \hat{x}_0\|^2\right] + \lambda_{\text{phys}}\mathbb{E}_{\hat{x}}\bigl[\|\nabla \cdot v(\hat{x})\|^2 + \| \mathrm{Min}(0, q(\hat{x})) \|^2 + \cdots \bigr]$

where constraint terms (mass continuity, humidity non-negativity, etc.) enforce meteorological fidelity (Rathkopf, 11 Apr 2025).

3. Training Regimen and Physical Constraints

GenCast is trained exclusively on gridded fields from the ERA5 reanalysis, a product in which nonlinear data assimilation has already reconciled all available observations with governing conservation laws. During training:

The principal objective is weighted mean-squared denoising over all grid points and variables.
Physics-informed penalties promote strict adherence to fundamental conservation principles (mass, energy, momentum), exclusion of nonphysical states (e.g., negative humidity), and satisfaction of known climatological symmetries.
Stochastic data perturbations and noise-inflation are used to enhance generalizability.

A key distinction is that GenCast is never trained directly against raw observations but always against reanalyses that have internalized all available measurement and physical information (Price et al., 2023, Rathkopf, 11 Apr 2025).

4. Ensemble Forecasting, Uncertainty Quantification, and Error Screening

GenCast's stochastic sampling mechanism is designed to emulate forecast spread and quantify uncertainty analogous to operational NWP ensemble systems. For each initialization, GenCast generates $N$ ensemble members via independent noise seeds. Diagnostics include:

The ensemble mean $\bar{f}(x)$ , treated as the best central estimate.
The ensemble spread $S = \sqrt{\frac{1}{N}\sum_{i}(f_i - \bar{f})^2}$ , representing epistemic uncertainty.
Comparison with observed root mean square error (RMSE) to compute the spread–skill ratio ( $SSR = S / \text{RMSE}$ ), where $SSR \approx 1$ signals optimal calibration.
Outlier suppression by discarding trajectories or regions where local deviations exceed expected spread (Rathkopf, 11 Apr 2025, Price et al., 2023).

Operationally, this allows high-dimensional forecast fields to be systematically screened for "corrosive hallucinations"—highly anomalous patterns that would disrupt downstream decision processes (Rathkopf, 11 Apr 2025).

5. Performance Benchmarks and Practical Impact

GenCast outperforms or matches the ECMWF ENS (the global operational standard) on multiple key metrics:

Skill: CRPS and ensemble-mean RMSE are superior on 97.4% and 96% of medium-range forecast targets, respectively.
Extreme events: Reliability and economic value are significantly better for heatwaves, tropical cyclones, and wind-power volatility, e.g., up to 1.9× increase in relative economic value for heatwave forecasts at 1-day lead.
Computational efficiency: A 50-member, 15-day global ensemble is generated in ~8 minutes on a single Cloud TPU v5, versus hours per member for ENS on 1000 CPU cores (Price et al., 2023).
Seasonal forecasting: Despite being trained for 12-h targets, GenCast, with persisted or observed SST forcing, achieves ACCs and NAO correlations approaching or exceeding SEAS5 in many extratropical and orographic regions. Probabilistic calibration is strong with observed ocean forcing, though underdispersive for persisted modes (Antonio et al., 8 Sep 2025).

GenCast has also been demonstrated as a component in particle filter–based data assimilation, where plugging its pre-trained score-based diffusion sampler into a fully-adapted auxiliary particle filter (FA-APF) yields stable, nonlinear posterior ensembles in high dimensions (Savary et al., 23 Sep 2025).

6. Adversarial Robustness and Limitations

Recent analysis has highlighted novel adversarial vulnerabilities in autoregressive diffusion models such as GenCast. Tailored perturbations to assimilated observational data—statistically indistinguishable from natural noise but introduced via, e.g., a single satellite—can manipulate subsequent weather predictions, including fabrication of extreme meteorological phenomena. The threat model exploits GenCast's dependence on gridded analysis fields, which themselves derive from a global, multi-source data assimilation process. The diffusion architecture per se does not include internal robustness mechanisms against poisoned data; instead, all error handling is delegated to preprocessing (variational data assimilation) and inference-time outlier detection (Imgrund et al., 22 Apr 2025).

Limitations of GenCast and its current deployment include:

Lack of explicit mechanisms for missingness or quality-weighted assimilation of incomplete inputs.
Overconfidence in underdispersive ensembles for certain seasonal configurations unless boundary forcing is supplied at each step (Antonio et al., 8 Sep 2025).
Absence of dynamic ocean, land, or cryosphere coupling in its vanilla configuration; improved coupling is an ongoing area of research.

7. GenCast in Click Modeling: The Generalized Cascade Model

The name GenCast also appears in click-modeling literature as an abbreviation for the "Generalized Cascade Model" (GCM), an Input-Output Hidden Markov Model–based framework for modeling user click sequences. The GCM unifies a family of Dynamic Bayesian Network click models, including the classic "cascade" and Dependent Click Model, into a common EM-estimation formalism (Ruijt et al., 2021).

Key properties:

Latent binary decisions per position are aggregated as a hidden state $h_t$ .
The model chain propagates via transition probabilities dependent on covariates (document features, previous clicks) and emits observables (clicks) via state-conditional probabilities.
The full EM estimation (expectation–maximization) is solved efficiently with classic forward–backward algorithms due to the IO-HMM structure.
The approach generalizes and subsumes prior ad hoc click-model E-step derivations.

No direct technical connection exists between this GenCast and the meteorological forecasting framework; the overlap is purely nominal.

References:

(Price et al., 2023) GenCast: Diffusion-based ensemble forecasting for medium-range weather
(Rathkopf, 11 Apr 2025) Hallucination, reliability, and the role of generative AI in science
(Imgrund et al., 22 Apr 2025) Adversarial Observations in Weather Forecasting
(Savary et al., 23 Sep 2025) Training-Free Data Assimilation with GenCast
(Antonio et al., 8 Sep 2025) Seasonal forecasting using the GenCast probabilistic machine learning model
(Ruijt et al., 2021) The Generalized Cascade Click Model: A Unified Framework for Estimating Click Models