Koopman Autoencoder & AIKAE
- Koopman autoencoder is a neural latent-variable model that uses Koopman operator theory to map nonlinear dynamics into a linearly evolving latent space.
- The AIKAE variant augments an invertible encoder with additional non-invertible components to boost expressiveness while maintaining exact state reconstruction.
- These models facilitate interpretable modal analysis, efficient surrogate modeling, and robust long-term predictions in applications such as weather forecasting and satellite data assimilation.
A Koopman autoencoder is a class of neural latent-variable models that enforces linear evolution in the learned latent space, in accordance with Koopman operator theory, for modeling nonlinear dynamical systems. These models aim to learn nonlinear state embeddings (“liftings”) in which time evolution is linear, enabling direct multi-step prediction, interpretable mode analysis, and efficient surrogate modeling for a wide range of physical, spatiotemporal, and data-driven systems. A recent major development within this class is the Augmented Invertible Koopman AutoEncoder (AIKAE), which resolves representational limitations of previous invertible Koopman autoencoders by augmenting their dimension with a non-invertible subspace, while maintaining exact state reconstruction and globally linear latent dynamics (Frion et al., 17 Mar 2025).
1. Theoretical Foundations
Koopman operator theory provides a framework to linearize nonlinear dynamical systems via observables. For a system of the form
there exists an infinite-dimensional linear operator, the Koopman operator , such that for any observable : In practice, the goal is to find a finite set of observables (or, in modern terms, a neural encoder ) that spans an approximately invariant subspace for , such that the finite-dimensional approximation
holds. A Koopman autoencoder further includes a learned decoder to reconstruct the state, jointly optimizing , 0, and 1 to minimize reconstruction and prediction losses: 2
Invertible Koopman autoencoders (IKAEs) adopt a bijective normalizing flow for the encoder, with an analytically invertible decoder, achieving exact reconstruction and tractable latent density modeling (Frion et al., 17 Mar 2025, Tayal et al., 2023).
2. AIKAE Architecture
The AIKAE introduces an augmented latent space to improve expressiveness beyond the input dimension, circumventing the bottleneck induced by invertibility constraints in prior IKAEs. The architecture is defined as follows:
- Invertible encoder 3: 4 is composed of 5 coupling layers (RealNVP/affine transformations), each operating as
6
or
7
where 8 are MLPs. Invertibility is exact, with tractable computation of the Jacobian determinant.
- Non-invertible augmentation encoder 9: 0 is an arbitrary feed-forward network, producing additional latent coordinates 1.
- Combined latent state: Form 2, with 3. Latent dynamics proceed via
4
Only the invertible coordinates are decoded:
5
3. Objective Functions and Optimization
AIKAE is optimized by joint minimization of loss terms:
- Prediction loss (long-term forecast):
6
- Linearity loss (Koopman invariance):
7
- Orthogonality/norm-preservation loss:
8
or regularization directly on 9:
0
- (Optionally) Flow log-likelihood (if modeling with stochastic flows):
1
Total loss combines these terms, and end-to-end training is performed with Adam. Multi-step prediction horizons and curriculum strategies are used to ensure both short- and long-horizon forecast quality (Frion et al., 17 Mar 2025).
4. Comparison to Other Koopman Autoencoder Variants
The following table summarizes main Koopman AE variants and key distinguishing features:
| Model | Encoder Type | Latent Dim | Decoder | Key Losses | Constraints |
|---|---|---|---|---|---|
| KAE | General NN (MLP) | 2 | MLP | Rec, Lin, Pred | None |
| IKAE | Invertible Flow (RealNVP, NICE) | 3 | 4 | Lin, Pred | Bijectivity, 5 |
| AIKAE | Invertible + Non-invertible | 6 | 7 | Pred, Lin, Orth | Partial bijectivity |
| CKAE | MLP / Linear | 8 | MLP | Fwd/Bwd, Consistency | 9 |
| KIA | Invertible NN | 0 | Invertible NN | Fwd/Bwd, Rec | Invertibility |
AIKAE is unique in disentangling invertible (for exact state recovery) and augmentation (for increased expressivity), allowing latent inflation and improved learning of nearly Koopman-invariant subspaces (Frion et al., 17 Mar 2025).
5. Applications and Empirical Evaluation
AIKAE and its predecessors have been applied to a diverse set of time-series and spatiotemporal prediction problems:
- Informer long-term forecasting benchmark: On ETTh1, ETTh2, ETTm1/2, ECL, Traffic, Weather, AIKAE outperforms previous IKAEs and competitive models (PatchTST, TimesNet, iTransformer) by 1–3% MSE/MAE, achieving monotonic performance gains with increased lookback window size. Unlike transformers, AIKAE does not saturate with longer input histories and is robust to the curse of horizon (Frion et al., 17 Mar 2025).
- Satellite data assimilation: On Sentinel-2 spectral image sequences, AIKAE achieves minimum MSE/MAE relative to KAEs, non-augmented IKAEs, or zero-padded IKAEs, particularly for forecasting and cross-region transfer. Exact invertibility renders it more stable to overfitting and allows variational assimilation via reverse-time latent optimization (Frion et al., 17 Mar 2025).
6. Interpretability and Operator Structure
AIKAE, by virtue of invertible latent flows and explicit linear Koopman evolution, admits interpretability in both reconstruction and spectral analysis. The eigen-spectrum of the Koopman matrix 1 encodes dominant modes, frequencies, and decay rates. Disentanglement between invertible and augmentation coordinates allows separation of static/reversible and dynamic/non-reversible features, which is beneficial for both data assimilation and transfer tasks (Frion et al., 17 Mar 2025).
7. Extensions and Future Directions
Research opportunities include:
- Stochastic Koopman models: Leveraging the flow's tractable log-likelihood for uncertainty quantification and probabilistic forecasting.
- Trend/seasonality/frequency augmentation: Integrating Fourier blocks or trend decomposition in the encoder, analogous to architectures such as Fedformer, to enhance performance for highly structured signals.
- Hierarchical and multi-scale representations: Constructing layered or multi-resolution latent spaces to scale AIKAE for very high-dimensional or spatiotemporal systems (Frion et al., 17 Mar 2025).
AIKAE represents the current state-of-the-art for invertible Koopman autoencoding in time series forecasting and spatiotemporal assimilation, combining exact reconstruction, latent space inflation, and stable, interpretable linear prediction.