Multi-step EDMD for Nonlinear Systems

Updated 24 January 2026

Multi-step EDMD is a data-driven framework that leverages Koopman operator theory to approximate nonlinear dynamics via linear representations in a lifted function space.
It minimizes long-horizon errors by directly learning multi-step state outputs, avoiding error compounding seen in recursive single-step methods.
The framework employs advanced dictionary learning and sparsity techniques, yielding robust surrogates applicable to density forecasting, system identification, and model predictive control.

The multi-step Extended Dynamic Mode Decomposition (EDMD) framework is a family of data-driven numerical methods for forecasting, modeling, and controlling nonlinear dynamical systems via linear approximations in function space. By leveraging action of the Koopman operator on a finite set of observables, multi-step EDMD enables high-fidelity multi-horizon prediction and robust surrogate model construction for stochastic, deterministic, and controlled systems. Key advantages over classical single-step EDMD include direct minimization of long-horizon errors, avoidance of error compounding under recursive prediction, and improved tractability for real-time and control applications.

1. Mathematical Foundations and Koopman Lifting

The multi-step EDMD framework generalizes the spectral approach of Koopman operator theory to nonlinear systems, including both deterministic maps and stochastic differential equations (SDEs) with diffusion. Consider a continuous-time SDE

$dX_t = b(X_t)\,dt + \sigma(X_t)\,dW_t$

with initial distribution $X_0 \sim p_0(x)$ , drift $b:\mathbb{R}^d\to\mathbb{R}^d$ , and noise $\sigma:\mathbb{R}^d\to\mathbb{R}^{d\times s}$ , or a discrete-time system $x_{t+1}=f(x_t, u_t)$ . The associated stochastic Koopman semigroup $\{K^t\}$ acts on observables $\psi$ by $K^t\psi(x) = \mathbb{E}[\psi(X_t^x)]$ , defining a linear semigroup generated by the infinitesimal generator

$\mathcal{L}v(x) = b(x)\cdot\nabla v(x) + \frac{1}{2}\Sigma(x):\nabla^2 v(x),\quad \Sigma(x)=\sigma(x)\sigma(x)^\top,$

with Fokker–Planck adjoint $\mathcal{L}^*p = -\nabla\cdot[b\,p] + \frac{1}{2}\nabla\cdot\nabla\cdot[\Sigma p]$ (Zhao et al., 2022). Under ergodicity, there exists a unique invariant density $X_0 \sim p_0(x)$ 0, and the $X_0 \sim p_0(x)$ 1 Hilbert space inner product governs the spectral structure of $X_0 \sim p_0(x)$ 2 and $X_0 \sim p_0(x)$ 3.

Deterministic systems use analogous constructions, with observables $X_0 \sim p_0(x)$ 4 spanning a finite-dimensional subspace, and evolution in this lifted space governed by approximations of the infinite-dimensional Koopman operator (Schurig et al., 10 Nov 2025, Meda et al., 4 Apr 2025).

2. Finite-Dimensional EDMD Approximations and Multi-Step Forecasting

The core idea of EDMD is to approximate $X_0 \sim p_0(x)$ 5 by a finite-dimensional matrix acting on the lifted state, using empirical data:

Snapshot pairs $X_0 \sim p_0(x)$ 6 are generated via dynamics and possible control inputs.
The observables define the lift $X_0 \sim p_0(x)$ 7; the data matrices are $X_0 \sim p_0(x)$ 8.
Gram matrices are $X_0 \sim p_0(x)$ 9, $b:\mathbb{R}^d\to\mathbb{R}^d$ 0.
The Koopman approximation is $b:\mathbb{R}^d\to\mathbb{R}^d$ 1, with eigenpairs $b:\mathbb{R}^d\to\mathbb{R}^d$ 2 giving approximate eigenvalues and eigenfunctions (Zhao et al., 2022, Meda et al., 4 Apr 2025).

Multi-step prediction proceeds by recursive iteration:

$b:\mathbb{R}^d\to\mathbb{R}^d$ 3

where the lifted state $b:\mathbb{R}^d\to\mathbb{R}^d$ 4, $b:\mathbb{R}^d\to\mathbb{R}^d$ 5 selects physical states from the lifted coordinates, and $b:\mathbb{R}^d\to\mathbb{R}^d$ 6 recovers outputs or measurements (Meda et al., 4 Apr 2025). In stochastic density forecasting, spectral decompositions using $b:\mathbb{R}^d\to\mathbb{R}^d$ 7 generate multi-step probability evolution via $b:\mathbb{R}^d\to\mathbb{R}^d$ 8 or eigenmode propagation (Zhao et al., 2022).

3. Dictionary Design, Manifold Optimization, and Sparsity

The expressiveness and generalization of EDMD surrogates depend critically on the choice of observables. Standard bases include monomials, radial basis functions (RBFs), and neural networks (Meda et al., 4 Apr 2025). Dictionary learning is further refined by geometric optimization on the Grassmann manifold: selecting a subspace $b:\mathbb{R}^d\to\mathbb{R}^d$ 9 that minimizes multi-step forecast error via Riemannian optimization yields approximately invariant, low-dimensional subspaces that are robust to out-of-domain states (Schurig et al., 10 Nov 2025). The error metric integrates projection error over finite horizons and test initial conditions, and the optimization descends to Grassmannian geometry with Riemannian gradients and QR-based retraction.

Explicit dictionary pruning and structure discovery are enabled by including $\sigma:\mathbb{R}^d\to\mathbb{R}^{d\times s}$ 0-type regularization in least-squares identification, which removes irrelevant observables and yields parsimonious, efficient surrogates (Wu et al., 17 Jan 2026). Parallel decomposition across states, steps, and dictionary rows supports scalable computation.

4. Multi-Step Least-Squares Identification and Error Control

A key advance of the multi-step EDMD framework is direct learning of the condensed multi-step state-output map. Rather than identifying an operator $\sigma:\mathbb{R}^d\to\mathbb{R}^{d\times s}$ 1 for single-step propagation and composing $\sigma:\mathbb{R}^d\to\mathbb{R}^{d\times s}$ 2 (which leads to error compounding and potential instability for $\sigma:\mathbb{R}^d\to\mathbb{R}^{d\times s}$ 3), the framework fits for each prediction step $\sigma:\mathbb{R}^d\to\mathbb{R}^{d\times s}$ 4 a linear map from dictionary lifts and past inputs to the $\sigma:\mathbb{R}^d\to\mathbb{R}^{d\times s}$ 5-step-ahead state:

$\sigma:\mathbb{R}^d\to\mathbb{R}^{d\times s}$ 6

The identification is performed via convex least-squares:

$\sigma:\mathbb{R}^d\to\mathbb{R}^{d\times s}$ 7

with row-wise $\sigma:\mathbb{R}^d\to\mathbb{R}^{d\times s}$ 8 sparsity for dictionary pruning (Wu et al., 17 Jan 2026). This produces stable, direct multi-step surrogates whose error does not grow with prediction horizon $\sigma:\mathbb{R}^d\to\mathbb{R}^{d\times s}$ 9. Theorem 3 in (Wu et al., 17 Jan 2026) provides, under Sobolev regularity and bounded-dictionary assumptions, high-probability error bounds independent of $x_{t+1}=f(x_t, u_t)$ 0:

$x_{t+1}=f(x_t, u_t)$ 1

where $x_{t+1}=f(x_t, u_t)$ 2 is the dictionary size, $x_{t+1}=f(x_t, u_t)$ 3 the number of data, and $x_{t+1}=f(x_t, u_t)$ 4 the degree for polynomial lifts. In contrast, one-step EDMD bounds exhibit exponential-in- $x_{t+1}=f(x_t, u_t)$ 5 error blow-up when $x_{t+1}=f(x_t, u_t)$ 6.

5. Applications, Numerical Performance, and Practical Considerations

Multi-step EDMD has been demonstrated for prediction, nonlinear system identification, density forecasting, and model predictive control (MPC) across a wide range of systems:

Density forecast for SDEs: Approximates the semigroup generated by the Fokker–Planck operator, achieving weak convergence of the density representation as data and dictionary sizes increase (Zhao et al., 2022).
Nonlinear system surrogates: For electric vehicle (EV) cabin climate, multi-step EDMD with RBF dictionaries achieves state RMSE $x_{t+1}=f(x_t, u_t)$ 7 and power RMSE $x_{t+1}=f(x_t, u_t)$ 8 for $x_{t+1}=f(x_t, u_t)$ 9 basis, outperforming polynomial and neural-network dictionaries at small $\{K^t\}$ 0 (Meda et al., 4 Apr 2025).
Koopman MPC: Direct multi-step identification solves the full condensed map for state and control over control horizon $\{K^t\}$ 1, enabling stable closed-loop control where one-step EDMD leads to diverging errors and controller instability. In benchmarks, multi-step EDMD achieves stable MSE under open and closed loop for $\{K^t\}$ 2, while one-step MSE diverges for $\{K^t\}$ 3 due to spectral radius $\{K^t\}$ 4 (Wu et al., 17 Jan 2026).
Dictionary learning: Grassmannian-based shaping yields low-dimensional surrogates with out-of-domain generalization and large speedups in predictor evaluation (Schurig et al., 10 Nov 2025).

Per-step computational costs are governed by $\{K^t\}$ 5 matrix-vector products and $\{K^t\}$ 6 dictionary evaluation (with $\{K^t\}$ 7 cost per basis). For $\{K^t\}$ 8– $\{K^t\}$ 9, real-time execution on embedded platforms is practical (Meda et al., 4 Apr 2025).

6. Algorithmic and Implementation Summary

A generic multi-step EDMD workflow proceeds as:

Dictionary selection: Choose functions $\psi$ 0—polynomial, RBF, or learned—appropriate to system structure and data availability.
Data acquisition: Collect snapshot (state, next-state) or (trajectory) data under representative state and control distributions.
Lifting and matrix assembly: Formulate $\psi$ 1, $\psi$ 2 and associated Gram matrices; assemble multi-step input-output structures for supervised learning.
Identification: Solve (regularized) least-squares for operator matrices, possibly under parallelizable decomposition across step $\psi$ 3 and state $\psi$ 4; perform dictionary pruning as necessary.
Prediction: For given initial condition and input sequence, propagate via

$\psi$ 5

up to desired horizon, reconstruct state and output via $\psi$ 6 and $\psi$ 7 matrices.

Validation: Assess with trajectory RMSE, Consistency Index (CI), or finite-horizon forecast errors; compare performance with alternative dictionaries or identification schemes (Zhao et al., 2022, Meda et al., 4 Apr 2025, Schurig et al., 10 Nov 2025, Wu et al., 17 Jan 2026).

When relevant (e.g., Grassmannian dictionary learning), implement Riemannian optimization to shape active observables for improved multi-step invariance and forecast accuracy (Schurig et al., 10 Nov 2025). Closed-loop extensions with linear MPC/LQR or online adaptation using recursive least squares are direct, leveraging the lifted linear structure of the surrogate.

7. Convergence Guarantees and Limitations

Convergence properties are established in terms of increasing data ( $\psi$ 8), basis set expressiveness ( $\psi$ 9), and regularity of the lifting functions. For stochastic systems, EDMD approximations converge to the Galerkin projection of the Fokker–Planck semigroup in $K^t\psi(x) = \mathbb{E}[\psi(X_t^x)]$ 0, and the truncated density forecast $K^t\psi(x) = \mathbb{E}[\psi(X_t^x)]$ 1 converges weakly to the true solution (Zhao et al., 2022). In deterministic and control settings, multi-step EDMD avoids error blow-up inherent to one-step recursion, with finite-time sample error scaling optimally in dictionary size and sample count (Wu et al., 17 Jan 2026). Overfitting and lack of generalization can arise with excessive dictionary dimension, motivating use of Grassmannian shaping and sparsity regularization (Schurig et al., 10 Nov 2025, Wu et al., 17 Jan 2026).

A plausible implication is that careful system-specific dictionary design, appropriate regularization, and direct multi-horizon identification are essential for the robust practical application of EDMD in high-dimensional settings and real-time control scenarios.

Markdown Report Issue Upgrade to Chat

References (4)

Data-driven probability density forecast for stochastic dynamical systems (2022)

Shaping the Koopman dictionary by learning on the Grassmannian (2025)

Koopman-Based Methods for EV Climate Dynamics: Comparing eDMD Approaches (2025)

Least-Squares Multi-Step Koopman Operator Learning for Model Predictive Control (2026)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Multi-step EDMD Framework.

Multi-step EDMD for Nonlinear Systems

1. Mathematical Foundations and Koopman Lifting

2. Finite-Dimensional EDMD Approximations and Multi-Step Forecasting

3. Dictionary Design, Manifold Optimization, and Sparsity

4. Multi-Step Least-Squares Identification and Error Control

5. Applications, Numerical Performance, and Practical Considerations

6. Algorithmic and Implementation Summary

7. Convergence Guarantees and Limitations

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Don't miss out on important new AI/ML research

Multi-step EDMD for Nonlinear Systems

1. Mathematical Foundations and Koopman Lifting

2. Finite-Dimensional EDMD Approximations and Multi-Step Forecasting

3. Dictionary Design, Manifold Optimization, and Sparsity

4. Multi-Step Least-Squares Identification and Error Control

5. Applications, Numerical Performance, and Practical Considerations

6. Algorithmic and Implementation Summary

7. Convergence Guarantees and Limitations

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Related Topics

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research