Deep Bidirectional and Unidirectional LSTM Recurrent Neural Network for Network-wide Traffic Speed Prediction (1801.02143v2)

Published 7 Jan 2018 in cs.LG

Abstract: Short-term traffic forecasting based on deep learning methods, especially long short-term memory (LSTM) neural networks, has received much attention in recent years. However, the potential of deep learning methods in traffic forecasting has not yet fully been exploited in terms of the depth of the model architecture, the spatial scale of the prediction area, and the predictive power of spatial-temporal data. In this paper, a deep stacked bidirectional and unidirectional LSTM (SBU- LSTM) neural network architecture is proposed, which considers both forward and backward dependencies in time series data, to predict network-wide traffic speed. A bidirectional LSTM (BDLSM) layer is exploited to capture spatial features and bidirectional temporal dependencies from historical data. To the best of our knowledge, this is the first time that BDLSTMs have been applied as building blocks for a deep architecture model to measure the backward dependency of traffic data for prediction. The proposed model can handle missing values in input data by using a masking mechanism. Further, this scalable model can predict traffic speed for both freeway and complex urban traffic networks. Comparisons with other classical and state-of-the-art models indicate that the proposed SBU-LSTM neural network achieves superior prediction performance for the whole traffic network in both accuracy and robustness.

Authors (4)

Zhiyong Cui (34 papers)
Ruimin Ke (16 papers)
Ziyuan Pu (27 papers)
Yinhai Wang (45 papers)

Citations (390)

View on Semantic Scholar

Summary

Overview of Stacked Bi-directional and Unidirectional LSTM Networks for Traffic Speed Prediction

The research paper "Stacked Bidirectional and Unidirectional LSTM Recurrent Neural Network for Network-wide Traffic Speed Prediction" by Zhiyong Cui, Ruimin Ke, Ziyuan Pu, and Yinhai Wang proposes an advanced model for short-term traffic forecasting utilizing deep learning, specifically LSTM neural networks. The paper introduces the Stacked Bidirectional and Unidirectional LSTM (SBU-LSTM) architecture, designed to enhance predictive power by addressing both forward and backward dependencies in time-series data.

Methodology and Novel Contributions

Traffic forecasting traditionally employs both classical statistical methods and various neural network (NN)-based computational intelligence approaches. However, the paper observes that existing methods generally focus on shallow architectures or limited spatial-temporal aspects without fully exploiting LSTM capabilities. Addressing these gaps, the authors propose a deep architecture that incorporates both bidirectional and unidirectional LSTM layers, effectively capturing spatial-temporal features. Specifically, the bidirectional LSTM (BDLSTM) layer caters to both forward and backward temporal dependencies, while a subsequent unidirectional LSTM layer processes the captured features for prediction tasks.

Key components also include a masking mechanism to handle input data with missing values, making the architecture robust in real-world scenarios where sensor failures or data dropouts occur. This comprehensive approach enables the model to predict traffic speeds across both freeway and complex urban networks with high accuracy and robustness.

Numerical Results and Performance

Extensive experiments leverage two kinds of traffic data: high-resolution freeway sensor data and wide-ranging urban INRIX data. The proposed SBU-LSTM outperforms classical machine learning models such as Support Vector Machines (SVM) and Random Forest, achieving lower mean absolute errors (MAE) and mean absolute percentage errors (MAPE). Specifically, for single-location predictions, the SBU-LSTM yields an MAE of 2.42 mph, surpassing other recurrent neural network architectures, including GRU networks.

Further experimentation with network-wide predictions highlights the model's ability to maintain performance across varied network sizes without significant degradation. The scalability of SBU-LSTM allows for effective adaptation to different network conditions.

Implications and Future Work

This research has significant theoretical and practical implications. Theoretically, it demonstrates the efficacy of integrating bidirectional recurrent structures within deep predictive frameworks, offering a paradigm shift for time-series analysis in traffic speed forecasting. Practically, the model's applicability to large-scale networks and varied geographies makes it suitable for real-world ITS applications, promising enhancements in traffic management and planning.

Future developments could explore integrating additional data dimensions such as environmental conditions and incident reports to distinguish recurring from non-recurring congestion events. Moreover, extending the model’s capabilities to accommodate other modes of transportation or to predict additional network states (e.g., congestion levels) would enhance its utility further. Incorporating novel elements such as graph-based networks for spatial feature learning could also expand the model’s application potential. The platform implementation could catalyze AI-driven solutions in transportation ecosystems, leading to smarter, more responsive urban environments.

PDF Markdown