DUALFloodGNN: Physically-Informed Flood Modeling
- The paper demonstrates that DUALFloodGNN achieves improved predictive accuracy and efficiency by integrating explicit hydrodynamic variables and physics-based conservation laws.
- DUALFloodGNN employs a unified message-passing mechanism and dynamic curriculum learning to enforce both global and local mass conservation for stable multi-step forecasts.
- Experimental results show significant reductions in node-volume, edge-flow, and water-depth RMSE alongside orders-of-magnitude faster inference compared to traditional numerical solvers.
DUALFloodGNN is a physics-informed graph neural network architecture designed to enable rapid and physically consistent operational flood modeling on unstructured spatial domains. It extends standard Encode–Process–Decode GNN frameworks with explicit representations of hydrodynamic quantities and conservation laws, achieving improved predictive accuracy and computational efficiency compared to previous flood modeling approaches that leverage deep learning on spatial graphs (Acosta et al., 30 Dec 2025).
1. Model Architecture
DUALFloodGNN represents the computational domain as a directed graph , where each node corresponds to a cell centroid in a 2D unstructured mesh, and each directed edge represents adjacency between cells. Node features combine static attributes (cell area, elevation) and dynamic variables (water volume , rainfall , upstream inflow , downstream outflow ) concatenated over previous time steps. Edge features encode static properties (channel width, slope) and dynamic flow .
The model does not directly forecast future states and but predicts their residual increments and , which are added to the latest state, improving stability in multi-step inference.
Input Construction:
Encoder Mapping:
Message Passing: For each layer , node and edge embeddings are jointly updated using messages:
A ReLU activation is applied at intermediate layers (except final), and biases are omitted in all MLPs to prevent spurious isolated-node volumes.
Decoding:
After process layers,
2. Physics-Informed Loss Formulations
Training is governed by a composite loss function incorporating both standard prediction errors and physics-informed penalties that enforce mass conservation.
Prediction Loss (Node and Edge MSE):
Global Mass Conservation:
Local Mass Conservation (Nodewise):
Construct weighted adjacency ,
Inflow and outflow per node : Local physical relation: Corresponding loss:
Total Physics-Informed Loss:
Single-Step Loss:
3. Joint Prediction of Volume and Flow
DUALFloodGNN employs a unified message-passing mechanism whereby the shared messages update both node and edge latent states, thus enforcing intrinsic coupling between water volume (scalar-like) and flow (vector-like) variables. Overwriting of edge embeddings with the same message that updates node states enables information about transport and accumulation to coexist within a single latent space. Distinct decoders for and yield the final predictions for nodes and edges, respectively. This methodology obviates the computational expense of line-graph constructions and produces physically consistent multi-output inference in a single forward pass.
4. Multi-Step Training and Dynamic Curriculum
The model is optimized not only for single-step predictions but also for accurate multi-step autoregressive rollout. During training, for batch time-steps , the model is unrolled into -step sequences, using its own predicted increments at each step for input construction. The rollout loss is the mean over the temporal sequence:
Dynamic curriculum learning is employed: training starts with , and only after convergence at each rollout horizon is the sequence length incremented by (typically ), at which point the learning rate decays by a factor . This staged progression stabilizes learning and mitigates error amplification in long-horizon autoregressive simulations.
5. Empirical Performance and Evaluation
On a Wollombi River test set comprising 1129 nodes, 2743 edges, 56 events, and using 14-fold cross-validation, DUALFloodGNN exhibits substantial quantitative improvements over alternative GNN architectures:
| Metric | DUALFloodGNN | Best Comparator | Relative Improvement |
|---|---|---|---|
| Node-volume RMSE | m³ | GINE m³ | |
| Edge-flow RMSE | $25.9$ m³/s | GraphSAGE edge: $44.0$ m³/s | |
| Water-depth RMSE | $0.21$ m | HydroGraphNet: $0.76$ m | |
| Flood CSI (0.05 m depth) | $0.90$ | HydroGraphNet: $0.69$; SWE-GNN: $0.47$ | |
| Inference time, 576 steps | s | HydroGraphNet: s | Orders faster than full numerical solvers |
Temporal RMSE plots confirm the persistence of low error across rollout, whereas competitors demonstrate more rapid error growth. Visual inundation maps show that DUALFloodGNN more closely tracks the ground truth, especially in the resolution of microscale lateral flows.
6. Implementation Details and Computational Scaling
The model is implemented in PyTorch and PyTorch-Geometric, with computation on NVIDIA V100 GPUs. Typical training uses GNN layers, MLPs with width 128, and dropout rate 0.1. Loss weights are optimized by grid search; for example . The curriculum step and learning rate decay are standard.
Training time is approximately 2 hours per cross-validation fold, with memory usage around 4 GB per V100 GPU. The method scales linearly with mesh size and is demonstrated to handle up to cells in multi-GPU environments.
7. Limitations and Prospective Extensions
Although validated on flow-dominated events in a single catchment, DUALFloodGNN's generalization to rainfall-dominated floods, urban drainage systems, or coastal flooding scenarios—including significant wave effects—remains to be established. The use of absolute-value physics losses produces sub-gradients only at zero; investigation of alternative (e.g., Huber-type) residual penalties is warranted. Representation of complex, time-dependent boundary conditions such as moving stage hydrographs is not yet incorporated, and would require further architectural developments. Integration with uncertainty quantification frameworks, for example Bayesian GNNs or model ensembles, is necessary for operational deployment with reliable confidence intervals.
The principal contribution of DUALFloodGNN lies in its joint encoding of hydrodynamic volume and flow variables, its explicit enforcement of global and local conservation laws through loss function design, and its robust training strategy for multi-step inference. Open-source code and data-processing resources are provided to support reproduction and adaptation (Acosta et al., 30 Dec 2025).