Forecasting Economics and Financial Time Series: ARIMA vs. LSTM (1803.06386v1)

Published 16 Mar 2018 in cs.LG, q-fin.ST, and stat.ML

Abstract: Forecasting time series data is an important subject in economics, business, and finance. Traditionally, there are several techniques to effectively forecast the next lag of time series data such as univariate Autoregressive (AR), univariate Moving Average (MA), Simple Exponential Smoothing (SES), and more notably Autoregressive Integrated Moving Average (ARIMA) with its many variations. In particular, ARIMA model has demonstrated its outperformance in precision and accuracy of predicting the next lags of time series. With the recent advancement in computational power of computers and more importantly developing more advanced machine learning algorithms and approaches such as deep learning, new algorithms are developed to forecast time series data. The research question investigated in this article is that whether and how the newly developed deep learning-based algorithms for forecasting time series data, such as "Long Short-Term Memory (LSTM)", are superior to the traditional algorithms. The empirical studies conducted and reported in this article show that deep learning-based algorithms such as LSTM outperform traditional-based algorithms such as ARIMA model. More specifically, the average reduction in error rates obtained by LSTM is between 84 - 87 percent when compared to ARIMA indicating the superiority of LSTM to ARIMA. Furthermore, it was noticed that the number of training times, known as "epoch" in deep learning, has no effect on the performance of the trained forecast model and it exhibits a truly random behavior.

Authors (2)

Sima Siami-Namini (8 papers)
Akbar Siami Namin (29 papers)

Citations (208)

View on Semantic Scholar

Summary

A Comparative Analysis of Forecasting Economic and Financial Time Series: ARIMA vs. LSTM

The paper under consideration conducts a methodical comparison between traditional statistical models and modern deep learning approaches for forecasting economic and financial time series data. Specifically, it assesses the performance of the Autoregressive Integrated Moving Average (ARIMA) model against Long Short-Term Memory (LSTM) networks—a subset of Recurrent Neural Networks (RNNs) renowned for capturing long-term dependencies in sequential data.

Key Findings and Methodological Overview

Through empirical analysis, the paper outlines definitive advantages of LSTM models over ARIMA models in terms of prediction accuracy for financial and economic time series. The paper employs Root Mean Square Error (RMSE) as the primary metric for evaluating model performance, with LSTM models demonstrating a significant reduction in error rates compared to ARIMA. Specifically, LSTM models show an average error reduction of 84% to 87% across the examined datasets.

The research methodology involves a comprehensive experimental setup where historical monthly financial data and various economic indicators serve as the basis for modeling. The datasets cover a diverse range of indices and commodities, enhancing the robustness of the comparative analysis. Both models are trained on 70% of the data and tested on the remaining 30% to ensure a stringent evaluation framework. The "Rolling Forecasting Origin" method is employed, indicating a detailed consideration of out-of-sample forecasting capabilities.

Theoretical Implications and Technical Considerations

The theoretical foundation of the paper underscores the contrasting approaches of ARIMA and LSTM in handling temporal dependencies. ARIMA's reliance on linear assumptions and stationarity contrasts sharply with LSTM's ability to model complex, nonlinear relationships without restrictive assumptions regarding data stationarity. The account of ARIMA's integration step for non-stationary data and LSTM's gated architecture for maintaining long-term dependencies provides insights into their fundamental operational mechanics.

It is worth noting that the paper observes the number of iterations, termed "epochs" in neural network parlance, have no significant impact on LSTM's predictive accuracy. This finding is intriguing as it diverges from typical expectations in machine learning, where epochs are often crucial for model convergence. The randomness exhibited by model performance across different epochs may suggest overfitting concerns or the need for further optimization techniques.

Practical Implications and Prospective Research Directions

From a practical standpoint, the superiority of LSTM in reducing prediction errors emphasizes its potential utility in economic and financial forecasting applications, where accuracy is critical. The results propel the discourse on integrating deep learning methodologies within traditional econometric toolsets, especially considering the volatility and complex dynamism inherent in financial markets.

Future research could explore hybrid models, combining ARIMA's strengths in linear trend capturing with LSTM's prowess in nonlinear pattern recognition. Additionally, extending the paper to include other architectures within the deep learning spectrum, such as CNN-LSTMs or attention-based models, could provide further enhancements in forecasting efficacy.

In conclusion, this paper substantiates LSTM's efficacy over ARIMA in forecasting economic and financial time series through well-founded experimental analyses. The implications for both machine learning practitioners and econometricians are profound, offering a paradigm shift towards adopting neural network-based approaches for improved forecasting performance.

PDF Markdown

Related Papers

Find Related Papers