Data-Driven Forecasting of High-Dimensional Chaotic Systems with Long Short-Term Memory Networks (1802.07486v5)

Published 21 Feb 2018 in physics.comp-ph, cs.LG, and nlin.CD

Abstract: We introduce a data-driven forecasting method for high-dimensional chaotic systems using long short-term memory (LSTM) recurrent neural networks. The proposed LSTM neural networks perform inference of high-dimensional dynamical systems in their reduced order space and are shown to be an effective set of nonlinear approximators of their attractor. We demonstrate the forecasting performance of the LSTM and compare it with Gaussian processes (GPs) in time series obtained from the Lorenz 96 system, the Kuramoto-Sivashinsky equation and a prototype climate model. The LSTM networks outperform the GPs in short-term forecasting accuracy in all applications considered. A hybrid architecture, extending the LSTM with a mean stochastic model (MSM-LSTM), is proposed to ensure convergence to the invariant measure. This novel hybrid method is fully data-driven and extends the forecasting capabilities of LSTM networks.

Authors (5)

Pantelis R. Vlachas (11 papers)
Wonmin Byeon (27 papers)
Zhong Y. Wan (1 paper)
Themistoklis P. Sapsis (40 papers)
Petros Koumoutsakos (68 papers)

Citations (439)

View on Semantic Scholar

Summary

Data-Driven Forecasting of High-Dimensional Chaotic Systems with Long Short-Term Memory Networks

The paper explores the use of Long Short-Term Memory (LSTM) networks for effective forecasting of high-dimensional chaotic systems. Such systems are characterized by intricate dynamics across multiple spatiotemporal scales, making them challenging to predict using traditional methods. The authors present a data-driven approach leveraging LSTM networks to address this challenge, indicating an advancement in the application of neural networks to model complex dynamical phenomena.

Methodology

The paper introduces LSTM networks, known for their ability to handle sequences with temporal dependencies, for forecasting chaotic systems in the reduced order space. LSTM networks, as nonlinear approximators, capitalize on recent historical information to predict the evolution of chaotic systems. In this context, the system state is projected onto a lower-dimensional representation using techniques such as Discrete Fourier Transform (DFT) and Singular Value Decomposition (SVD), which effectively reduce the dimensionality of the problem while retaining critical dynamics.

The paper contrasts LSTM performance against Gaussian Processes (GPs), another popular method for time-series prediction, revealing that LSTMs generally outperform GPs in short-term forecasting tasks. This is attributed to LSTMs' capacity to capture complex nonlinear interactions within the data through their recurrent architecture.

Key Results

The LSTM models consistently demonstrated superior short-term prediction accuracy across various systems, including the Lorenz 96 system, the Kuramoto-Sivashinsky equation, and a barotropic climate model. Particularly in scenarios modeled by the Lorenz 96 and Kuramoto-Sivashinsky equations, the LSTM networks effectively captured the underlying system dynamics, extending the predictability horizon beyond that achieved by GPs.

However, both LSTM and GP models exhibited a tendency to diverge from the invariant measure over the long term due to error accumulation inherent in iteratively forecasting chaotic dynamics. To mitigate this divergence, a hybrid model combining LSTM with a Mean Stochastic Model (MSM) is proposed. This hybrid MSM-LSTM ensures convergence to the invariant measure by utilizing MSM in regions not well-represented by the training data, leveraging the strengths of MSM in capturing long-term statistical behavior.

Implications

This paper's implications are significant for advancing predictive modeling in systems exhibiting nonlinear and chaotic dynamics. The effective use of LSTM networks can enable more reliable short-term predictions, which are crucial where immediate dynamical behaviors are of interest—such as weather forecasting and climate modeling.

Moreover, the hybrid LSTM-MSM approach offers a pathway for integrating machine learning with traditional stochastic modeling techniques to provide robust long-term quantitative forecasts. It underscores the potential of combining advanced data-driven paradigms with existing theoretical models to enhance overall predictive performance.

Future Directions

For future work, the authors suggest further improving the LSTM framework by including mechanisms to model unmodeled lower-energy modes stochastically and employing a mixture of models to capture diverse dynamical behaviors across different attractor regions. These enhancements could further bolster the ability of neural networks to adaptively and accurately reflect the dynamic intricacies of real-world chaotic systems. Additionally, considering deployment in broader contexts where system equations remain unknown or are only partially understood could reveal further applications of this forecasting methodology.

In summary, the paper provides a solid foundation for using LSTM networks in forecasting high-dimensional chaotic systems, presenting a methodological advancement with both practical applications and theoretical significance in the domain of dynamical systems.

PDF Markdown

Related Papers

Find Related Papers