Papers
Topics
Authors
Recent
Search
2000 character limit reached

Improving Ensemble Forecasts of Abnormally Deflecting Tropical Cyclones with Fused Atmosphere-Ocean-Terrain Data

Published 31 Mar 2026 in cs.LG and cs.AI | (2603.29200v2)

Abstract: Deep learning-based tropical cyclone (TC) forecasting methods have demonstrated significant potential and application advantages, as they feature much lower computational cost and faster operation speed than numerical weather prediction models. However, existing deep learning methods still have key limitations: they can only process a single type of sequential trajectory data or homogeneous meteorological variables, and fail to achieve accurate forecasting of abnormal deflected TCs. To address these challenges, we present two groundbreaking contributions. First, we have constructed a multimodal and multi-source dataset named AOT-TCs for TC forecasting in the Northwest Pacific basin. As the first dataset of its kind, it innovatively integrates heterogeneous variables from the atmosphere, ocean, and land, thus obtaining a comprehensive and information-rich meteorological dataset. Second, based on the AOT-TCs dataset, we propose a forecasting model that can handle both normal and abnormally deflected TCs. This is the first TC forecasting model to adopt an explicit atmosphere-ocean-terrain coupling architecture, enabling it to effectively capture complex interactions across physical domains. Extensive experiments on all TC cases in the Northwest Pacific from 2017 to 2024 show that our model achieves state-of-the-art performance in TC forecasting: it not only significantly improves the forecasting accuracy of normal TCs but also breaks through the technical bottleneck in forecasting abnormally deflected TCs.

Summary

  • The paper presents AOT-TCNet, a multimodal deep learning model that fuses atmospheric, oceanic, and terrain data to improve tropical cyclone track and intensity forecasts, notably for abnormal deflections.
  • It employs a mode-adaptive mixture-of-experts architecture to decompose complex TC trajectories, reducing prediction error by 10–13 km in 6–18 hour forecasts.
  • Experimental results on the comprehensive AOT-TCs dataset show 53–71.8% improvement in intensity predictions, highlighting its potential for operational rapid-response deployment.

Improving Ensemble Forecasts of Abnormally Deflecting Tropical Cyclones with Fused Atmosphere-Ocean-Terrain Data

Motivation and Challenges

The increasing frequency and unpredictability of extreme tropical cyclone (TC) events in the context of global climate variability emphasize the urgent need for enhanced forecasting approaches. Traditional numerical weather prediction (NWP) models, despite their comprehensive physical basis, suffer from high computational costs and limited short-term accuracy, especially when rapid atmospheric changes and anomalous track behaviors are involved. Deep learning (DL)-based TC forecasting models have emerged with substantial computational efficiency and competitive predictive performance—yet, they typically process only homogenous meteorological or trajectory data and exhibit significant deficiencies in forecasting abnormally deflected TC tracks. These challenging cases occur under complex physical couplings across ocean, atmosphere, and terrain, leading to abrupt trajectory changes driven by heterogeneous environmental factors (Figure 1). Figure 1

Figure 1: Abnormal deflection of TC GEIMI during Taiwan landfall, illustrating terrain-induced venturi effect and eye dissolution.

Multimodal Dataset Construction

Central to this work is the creation of the AOT-TCs dataset, the most extensive and information-rich TC dataset for the Western North Pacific basin. This dataset integrates atmospheric variables (ERA5: wind fields, geopotential height at multiple pressure levels), oceanic observations (CODCv1: SST and SSS), and high-resolution terrain information (GEBCO: elevation data). It further includes essential non-spatial variables such as translational speed, historic movement direction, intensity variability, month, and the NiÑo3.4 index to capture ENSO-induced track variability.

AOT-TCs enables explicit modeling of physical mechanisms relevant to TCs, including terrain-lifted wind acceleration, anomalous deflection, and oceanic thermodynamic regulation, thereby supporting an unprecedented level of multidimensional meteorological analysis. The dataset spans from 1950 to 2024, offering complete coverage for holistic TC lifecycle studies.

AOT-TCNet Model Architecture

AOT-TCNet introduces a multimodal deep learning architecture explicitly coupling atmospheric, oceanic, and terrain variables for TC track and intensity forecasting. The model employs hypothesis-reflecting encoders based on LSTM, MLP, and 3D-UNet for feature extraction from sequential TC data, categorical/environmental variables, and spatiotemporal atmospheric fields, respectively. The cross-modal fusion mechanism aligns and concatenates these heterogeneous features, yielding a unified representation suitable for physically informed prediction.

The core innovation lies in the mode-adaptive Mixture-of-Experts (TMA-MoE) subarchitecture, which decomposes complex TC trajectory distributions into distinct sub-modes (e.g., straight, recurving, abruptly deflecting). A router network, implemented as a probabilistic 3-layer MLP, selects the most relevant expert for each input based on matching scores between predicted and ground truth trajectories. The experts, constructed as independent generative networks, specialize in distinct TC behavior modes. This prevents mode collapse, improves prediction robustness, and enhances the model's capacity to capture the multimodal nature of TC tracks. Figure 2

Figure 2: End-to-end architecture of AOT-TCNet with multimodal encoders and TMA-MoE for trajectory and intensity prediction.

Training Loss Design

The optimization process harmonizes reconstruction (Huber and MSE losses for trajectory and meteorological variables), distribution contrastive loss (to align generative output with ground truth distribution manifold), and a mode classification loss (to enforce expert specialization and semantic mode differentiation). The final joint objective leverages Jensen-Shannon Divergence (JSD) to both approximate data and maximize inter-expert diversity, crucial for accurate modeling of anomalous behaviors.

Experimental Evaluation and Comparative Results

Comprehensive evaluations conducted on the AOT-TCs dataset (2017–2024) demonstrate that AOT-TCNet delivers state-of-the-art accuracy in track and intensity forecasting, notably outperforming both traditional NWP models and deep learning baselines in ultra-short-term scenarios, especially for challenging abnormally deflected TCs.

  • In 6–18 hour forecasts, AOT-TCNet achieves trajectory errors 10–13 km lower than operational CMO baselines and maintains consistent advantages in intensity forecasting (53–71.8% improvement across all horizons).
  • At longer (24h) horizons, NWP models regain parity or slight superiority, a known limitation due to the inherent uncertainty and data-driven forecast constraints.
  • Advanced deep learning models developed originally for pedestrian/vehicle trajectory prediction (e.g., SGAN, MMSTN) fail to achieve comparable performance on TC forecasting, highlighting the unique environmental complexity of meteorology.

Ablation studies validate the synergistic effect of multimodal variables and TMA-MoE; the introduction of terrain and ocean variables leads to substantial accuracy gains in both track and intensity prediction, while multimodal expert routing further alleviates mode mixing and instability. Figure 3

Figure 3: Forecast results for selected TCs in 2024 with historical (red), ground truth (blue), and model predictions (green).

Figure 4

Figure 4: Ensemble forecasting for abnormally deflected TCs in 2024, capturing abrupt path shifts and projected trends.

Abnormally Deflecting TC Case Study

AOT-TCNet is specifically evaluated on abnormally deflected TC cases—defined as exhibiting >45° rightward or >30° leftward turns within 12 hours. Previous models tend to over-smooth predictions and fail to capture sharp turning dynamics. Leveraging multi-generator architectures and terrain information, AOT-TCNet achieves the best precision across all competitive methods, predicting both abrupt trajectory changes and ensemble uncertainty regions.

Detailed Case: Super TC YAGI (2024)

AOT-TCNet's performance is showcased on TC YAGI, which underwent multiple landfalls and abrupt track changes, causing severe impacts. Unlike operational models (e.g., NCEP-GFS, ECMWF-IFS), which exhibited systematic northward bias due to subtropical high initialization error, AOT-TCNet captured the westward movement accurately, aligning predictions with observed ground truth both in spatial track and intensity evolution. Figure 5

Figure 5

Figure 5

Figure 5

Figure 5

Figure 5

Figure 5

Figure 5

Figure 5

Figure 5

Figure 5

Figure 5

Figure 5

Figure 5

Figure 5

Figure 5

Figure 5: Time-sequential trajectory predictions for TC YAGI using AOT-TCNet—input (red), prediction (green), ground truth (blue).

Implications and Future Directions

This study provides a robust multimodal DL framework for coupled atmosphere-ocean-terrain TC forecasting, with explicit physical interpretability and competitive accuracy in challenging anomalous-track scenarios. The integration of TMA-MoE and the AOT-TCs dataset advance both methodological and operational capacity for rapid-response disaster mitigation.

Practically, the demonstrated robustness and efficiency suggest deployment in resource-constrained forecasting centers, enabling improved real-time prediction under climate change-induced variability and extremes. Theoretically, the architecture offers a template for further multi-physics fusion in high-impact weather events, and generative multi-expert design may extend to other domains characterized by strong multimodality and abrupt transitions.

Future research may converge on:

  • Extending multimodal fusion to include additional geophysical variables (e.g., hydrological, biospheric).
  • Upgrading ensemble prediction strategies with uncertainty quantification and probabilistic causality modeling.
  • Integrating online learning for real-time adaptation as environmental data streams evolve under warming climate conditions.

Conclusion

AOT-TCNet, leveraging the comprehensive AOT-TCs dataset and a physically coupled mode-adaptive mixture-of-experts architecture, sets a new benchmark in TC forecasting, especially for abnormally deflecting scenarios. The approach significantly advances both short-term predictive accuracy and physical consistency, facilitating reliable operational deployment and laying a foundation for further innovation in multimodal geophysical forecasting (2603.29200).

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We found no open problems mentioned in this paper.

Collections

Sign up for free to add this paper to one or more collections.