Multi-output Bus Travel Time Prediction with Convolutional LSTM Neural Network (1903.02791v1)

Published 7 Mar 2019 in stat.ML and cs.LG

Abstract: Accurate and reliable travel time predictions in public transport networks are essential for delivering an attractive service that is able to compete with other modes of transport in urban areas. The traditional application of this information, where arrival and departure predictions are displayed on digital boards, is highly visible in the city landscape of most modern metropolises. More recently, the same information has become critical as input for smart-phone trip planners in order to alert passengers about unreachable connections, alternative route choices and prolonged travel times. More sophisticated Intelligent Transport Systems (ITS) include the predictions of connection assurance, i.e. to hold back services in case a connecting service is delayed. In order to operate such systems, and to ensure the confidence of passengers in the systems, the information provided must be accurate and reliable. Traditional methods have trouble with this as congestion, and thus travel time variability, increases in cities, consequently making travel time predictions in urban areas a non-trivial task. This paper presents a system for bus travel time prediction that leverages the non-static spatio-temporal correlations present in urban bus networks, allowing the discovery of complex patterns not captured by traditional methods. The underlying model is a multi-output, multi-time-step, deep neural network that uses a combination of convolutional and long short-term memory (LSTM) layers. The method is empirically evaluated and compared to other popular approaches for link travel time prediction and currently available services, including the currently deployed model in Copenhagen, Denmark. We find that the proposed model significantly outperforms all the other methods we compare with, and is able to detect small irregular peaks in bus travel times very quickly.

Citations (211)

View on Semantic Scholar

Summary

The paper introduces a ConvLSTM-based model that significantly reduces prediction errors (MAE, RMSE, MAPE) compared to traditional methods.
The methodology integrates convolutional filters within LSTM layers to capture spatio-temporal dependencies and enhance multi-horizon forecasting.
Empirical validation with Copenhagen transit data shows improved peak-hour adaptability and operational reliability in urban transport.

Insights on Multi-output Bus Travel Time Prediction Using Convolutional LSTM Networks

The utilization of intelligent transport systems (ITS) for real-time information dissemination is a significant advancement within public transportation domains. However, accurate bus travel time prediction remains challenging in dense urban areas due to erratic congestion and varying conditions. This paper introduces a novel approach combining Convolutional and Long Short-Term Memory (ConvLSTM) networks for multi-output bus travel time prediction, demonstrating superior performance over several state-of-the-art methodologies.

Overview of the Methodology

This research proposes leveraging convLSTM layers to model the non-static spatio-temporal correlations inherent in urban bus networks. The model aggregates a sequence encoder/decoder framework, enhancing ITS applications with improved predictive capabilities. The key architectural aspect involves decoding temporal patterns while retaining spatial dependencies across network links. The ConvLSTM structure achieves this by integrating convolutional filters in both input-to-state and state-to-state transitions, efficient for learning spatial dependencies over time, thus improving forecast precision across multiple time horizons.

Compared against a pure LSTM network, the inclusion of convolutional filters in this work significantly reduces parameters requiring optimization, allowing for deeper network layers and more comprehensive pattern discovery. Experimental results show the ConvLSTM model outperforms existing bus travel time prediction models, including the operational model used by Denmark's Movia and Google Traffic estimations, especially during peak congestion times when precision is critical.

Empirical Findings and Comparative Analysis

The empirical evaluation using Greater Copenhagen's transport authority data reflects the merits of integrating ConvLSTM architectures in complex urban transit systems. The proposed model delivers prominent reductions in mean absolute error (MAE), root mean square error (RMSE), and mean absolute percentage error (MAPE) in predicting link travel times over traditional historical averages, standard LSTM, and the existing Movia system.

The ConvLSTM method demonstrates impressive adaptability to unanticipated variances in travel time during peak hours, an aspect where more conventional models fall short. There is a marked improvement observed in the detailed instances analyzed, with the model quickly capturing and responding to abrupt traffic patterns, a crucial trait for system reliability in metropolitan landscapes.

Implications and Future Directions

The implications for transport authorities are manifold. Implementing such predictive systems can enhance travel experience by reliably informing passenger choices and advancing route efficiency through dynamic adjustments during service disruptions. Moreover, the potential for integrating additional contextual data—such as weather and special events—could further elevate predictive precision.

The model's deployment within the Greater Copenhagen region appears within reach, relying minimally on additional infrastructural requirements given its foundation in standard AVL outputs. However, further developments might explore integration with live control systems for on-the-fly adjustments, database enrichment using geological and temporal markers, and scaling through multi-route modeling to encapsulate entire networks.

Future research could extend these principles into cross-domain applications, such as ensemble methods for holistic traffic management solutions. Graph-based convolutional approaches also present promising avenues to grasp interlink dependencies beyond linear sequences, potentially unfolding new paradigms of multi-modal transport reliability.

In conclusion, this ConvLSTM-based model represents a pragmatic step towards intelligent public transport solutions, combining rigorous methodological advancements with proven operational enhancements within transit agencies. The prospect of deploying a highly granular, adaptable, and computationally scalable system promises substantive gains in service quality and operational resilience.

PDF Markdown