Non-intrusive Learning of Physics-Informed Spatio-temporal Surrogate for Accelerating Design

Published 15 Apr 2026 in cs.LG and physics.flu-dyn | (2604.14424v1)

Abstract: Most practical engineering design problems involve nonlinear spatio-temporal dynamical systems. Multi-physics simulations are often performed to capture the fine spatio-temporal scales which govern the evolution of these systems. However, these simulations are often high-fidelity in nature, and can be computationally very expensive. Hence, generating data from these expensive simulations becomes a bottleneck in an end-to-end engineering design process. Spatio-temporal surrogate modeling of these dynamical systems has been a popular data-driven solution to tackle this computational bottleneck. This is because accurate machine learning models emulating the dynamical systems can be orders of magnitude faster than the actual simulations. However, one key limitation of purely data-driven approaches is their lack of generalizability to inputs outside the training distribution. In this paper, we propose a physics-informed spatio-temporal surrogate modeling (PISTM) framework constrained by the physics of the underlying dynamical system. The framework leverages state-of-the-art advancements in the field of Koopman autoencoders to learn the underlying spatio-temporal dynamics in a non-intrusive manner, coupled with a spatio-temporal surrogate model which predicts the behavior of the Koopman operator in a specified time window for unknown operating conditions. We evaluate our framework on a prototypical fluid flow problem of interest: two-dimensional incompressible flow around a cylinder.

Abstract PDF Upgrade to Chat

Authors (2)

Summary

The paper introduces a framework using Koopman autoencoders and GP regression to emulate nonlinear dynamical systems with high computational speedup.
The paper demonstrates stable, physics-consistent predictions with relative errors below 10% for most test cases, even for unseen operating conditions.
The paper enables real-time surrogate modeling, reducing simulation times from 170 minutes to 3 seconds for complex fluid flow scenarios.

Physics-Informed Spatio-Temporal Surrogates for Fast Emulation of Dynamical Systems

Overview of the PISTM Framework

The paper introduces a non-intrusive Physics-Informed Spatio-Temporal Modeling (PISTM) framework to accelerate the emulation of nonlinear dynamical systems, with a specific focus on engineering design scenarios where high-fidelity multi-physics simulations are computationally expensive. The framework leverages Koopman autoencoders for encoding the underlying spatio-temporal dynamics and Gaussian process (GP) regression for rapid emulation at unseen operating conditions, enabling efficient surrogate modeling without requiring explicit knowledge of governing physical equations.

Figure 1: Schematic overview of the proposed physics-informed spatio-temporal modeling (PISTM) framework of dynamical systems.

Motivation and Positioning

Traditional deep learning architectures such as CNNs and LSTMs have been used for surrogate modeling of high-dimensional spatio-temporal fields but suffer from limited generalizability outside their training distributions due to the absence of explicit physical constraints. On the other hand, physics-informed neural networks (PINNs) enforce governing equations during training, but their dependence on explicit PDEs limits applicability when such equations are only partially or not at all known. The paper addresses this limitation by embedding physics-informed constraints through Koopman operator theory, which linearizes the evolution of nonlinear systems in a transformed latent space, thus extending generalizability to previously unseen operating and boundary conditions.

Methodological Advances

The proposed methodology consists of several distinct stages:

Training Data Generation: Employs Latin Hypercube Sampling to select operating conditions (e.g., Reynolds numbers $Re$ ) for which high-fidelity simulations are performed to generate training datasets.
Koopman Autoencoder Training: A distinct Koopman autoencoder is trained for each operating condition, encoding the forward and backward system dynamics and enforcing a physics-based stability constraint. Each autoencoder provides a consistent latent-space representation of the temporal evolution.
Reduced Order Modeling (ROM): A convolutional autoencoder is trained on Koopman-generated predictions from all training conditions to yield an efficient low-dimensional latent space representation, paired with a decoder that reconstructs the full field from this code.
Latent Space Emulation with GPs: For unseen operating conditions, a Gaussian process regression model predicts the latent coefficients of the ROM as a function of operating condition and time. GP-based interpolation is selected due to its robustness in regimes of limited training data and its capability to provide uncertainty quantification.
Real-time Prediction: For an arbitrary new operating condition, the predicted latent codes are decoded to reconstruct the surrogate field dynamics over the desired temporal horizon, eliminating the need for expensive simulations or test-phase Koopman learning.

This approach is illustrated concisely in Figure 1.

Quantitative and Qualitative Results

The PISTM framework is validated on the canonical problem of two-dimensional incompressible viscous flow around a circular cylinder, solved with the Lattice Boltzmann Method. The task involves forecasting the velocity field for previously unseen $Re$ values, given historical data for a different subset of $Re$ . The evaluation metrics include three relative prediction errors: between (i) true and emulated data ( $\varepsilon_E$ ), (ii) Koopman prediction and emulated data ( $\varepsilon_{KE}$ ), and (iii) true data and Koopman prediction ( $\varepsilon_K$ ), over prediction horizons up to 10 timesteps.

Figure 2: Comparison of $\varepsilon_E$ , $\varepsilon_{KE}$ , and $\varepsilon_K$ for 5 test conditions and 45 training conditions as a function of $Re$ .

Empirical results demonstrate that the proposed framework maintains $Re$ 0 for all test cases except one, which has sparse training coverage in its $Re$ 1 range. Importantly, the prediction errors do not diverge with time due to the physics-constrained structure of the Koopman layer, contrasting with the well-known drift encountered in physics-agnostic models.

A qualitative comparison (see Figure 3) further illustrates that the spatial distribution and magnitude of the surrogate's prediction errors are nearly indistinguishable from the errors made by Koopman predictors that were trained on the ground-truth historical data for the test condition (a scenario not available in practice for truly unseen test cases). Crucially, the entire PISTM pipeline delivers predictions $Re$ 2 times faster than full high-fidelity CFD, with a single test case evaluated in 3 seconds versus 170 minutes for a full simulation.

Figure 3: Qualitative comparison among true data, Koopman predictions, emulated data, and their error maps for test case $Re$ 3 at $Re$ 4.

Implications and Discussion

The non-intrusive nature of the PISTM framework represents a key practical advantage for high-value engineering contexts (e.g., design optimization, uncertainty quantification, rapid virtual prototyping), where the cost of new high-fidelity simulations at arbitrary conditions is prohibitive. By leveraging Koopman-based latent dynamics and GP surrogate modeling, the method ensures physics-consistent, time-stable extrapolation to new operational regimes. The architecture is sufficiently generic to be applicable beyond fluid flow systems, potentially extending to any nonlinear dynamical system for which only partial or implicit physical knowledge is available.

Theoretically, the fusion of Koopman operator theory and probabilistic ROM interpolation through GPs enables balancing physical consistency, data efficiency, and computational tractability. In contrast to PINNs, there is no requirement for explicit PDEs or handcrafted physical residuals; all that is necessary is access to historical field snapshots for a set of operating conditions.

The strong numerical results highlight the robust generalization capabilities, given sufficiently informative training coverage in the operating condition space. However, performance degradation is observed where the training data are sparse, indicating a need for adaptive sampling or active learning strategies to optimize training set selection in future work.

Future Directions

Extension of the framework to multi-physics and multi-scale environments, integration with adaptive DoE/active learning, or hierarchical surrogate modeling (e.g., multi-fidelity fusions) are immediate next steps. Additional prospects include uncertainty quantification for risk-averse optimization and extension to control or inverse design tasks. Incorporating richer structure in the GP emulator, such as deep kernel learning, or augmenting the Koopman autoencoders with attention mechanisms or domain decomposition strategies, could further enhance generalizability and performance across even more complex problem classes.

Conclusion

The PISTM framework utilizes Koopman autoencoders and Gaussian process regression to deliver fast, stable, and physically-consistent spatio-temporal surrogate modeling for nonlinear dynamical systems without requiring direct access to governing equations. The obtained surrogates demonstrate low generalization error on previously unseen conditions and provide significant computational speedup, offering a scalable pathway for accelerating the solution of expensive simulation-driven engineering design and analysis tasks (2604.14424).

Markdown Report Issue