Adaptive Graph Convolutional Recurrent Network for Traffic Forecasting (2007.02842v2)

Published 6 Jul 2020 in cs.LG and stat.ML

Abstract: Modeling complex spatial and temporal correlations in the correlated time series data is indispensable for understanding the traffic dynamics and predicting the future status of an evolving traffic system. Recent works focus on designing complicated graph neural network architectures to capture shared patterns with the help of pre-defined graphs. In this paper, we argue that learning node-specific patterns is essential for traffic forecasting while the pre-defined graph is avoidable. To this end, we propose two adaptive modules for enhancing Graph Convolutional Network (GCN) with new capabilities: 1) a Node Adaptive Parameter Learning (NAPL) module to capture node-specific patterns; 2) a Data Adaptive Graph Generation (DAGG) module to infer the inter-dependencies among different traffic series automatically. We further propose an Adaptive Graph Convolutional Recurrent Network (AGCRN) to capture fine-grained spatial and temporal correlations in traffic series automatically based on the two modules and recurrent networks. Our experiments on two real-world traffic datasets show AGCRN outperforms state-of-the-art by a significant margin without pre-defined graphs about spatial connections.

Citations (956)

View on Semantic Scholar

Summary

The paper introduces AGCRN, which integrates Node Adaptive Parameter Learning and Data Adaptive Graph Generation to capture detailed spatial-temporal traffic patterns.
It outperforms state-of-the-art models with over 5% improvement in MAE and MAPE on real-world datasets, demonstrating robust forecasting performance.
The adaptive design eliminates the need for pre-defined graphs, paving the way for scalable and efficient intelligent transportation systems.

Adaptive Graph Convolutional Recurrent Network for Traffic Forecasting

The paper delineates a pragmatic approach to addressing the problem of traffic forecasting, which is inherently complex due to its spatial and temporal correlations. Traditional methods such as ARIMA and VAR fall short of capturing these nonlinear patterns and intricate dependencies. Moreover, recent deep learning models have also shown limitations due to their reliance on pre-defined graphs and inability to learn node-specific patterns effectively.

Central to the proposed method are two components: Node Adaptive Parameter Learning (NAPL) and Data Adaptive Graph Generation (DAGG). These components collectively build an Adaptive Graph Convolutional Recurrent Network (AGCRN) that autonomously captures fine-grained spatial and temporal correlations without the need for pre-defined graphs.

Node Adaptive Parameter Learning (NAPL)

The NAPL module introduces an adaptive mechanism to learn node-specific patterns via parameter factorization. Typically, GCNs share parameters across all nodes, leading to a suboptimal representation of unique node attributes. NAPL addresses this by factorizing the node-specific parameters into a shared weight pool and a node embedding matrix. It effectively draws node-specific parameters from this pool based on learned node embeddings, achieving a reduced parameter space while still accounting for node-specific patterns. This approach enhances the ability to model diverse traffic patterns associated with different nodes, making the model robust to a variety of traffic dynamics.

Data Adaptive Graph Generation (DAGG)

Another salient feature of AGCRN is the DAGG module, developed to infer the inter-dependencies among traffic series directly from data. Traditional GCN models require pre-defining a spatial adjacency matrix, which can be biased and incomplete. DAGG circumvents this by learning an adaptive adjacency matrix from node embeddings, adjusting dynamically during training. This ensures the derived spatial correlations are task-specific and not limited by pre-defined heuristics.

Adaptive Graph Convolutional Recurrent Network (AGCRN)

Combining NAPL and DAGG within a recurrent framework led to the creation of AGCRN. By integrating these two modules into a GRU-based architecture, AGCRN effectively captures both node-specific spatial and temporal correlations. Through shared embedding matrices, the model ensures consistency and interpretability across different NAPL-GCN layers and the DAGG module. The end-to-end trainability of AGCRN allows it to autonomously learn meaningful node representations and spatial dependencies, which can further be applied to other tasks.

Experimental Results

The empirical assessment of AGCRN was conducted on two real-world traffic datasets, PeMSD4 and PeMSD8. The results were compelling, demonstrating that AGCRN outperformed state-of-the-art models across multiple evaluation metrics: MAE, RMSE, and MAPE. Specifically, AGCRN showed over 5% relative improvement in MAE and MAPE for both datasets. This signifies its superior capability in accurately predicting short-term and long-term traffic patterns. Moreover, the model’s performance deterioration rate was slower, indicating robustness in long-term forecasting.

Implications and Future Directions

The implications of this research are substantial both theoretically and practically. Theoretically, AGCRN's ability to learn and adaptively model node-specific patterns and spatial dependencies adds a potent tool to the domain of time-series analysis. Practically, such a model can enhance Intelligent Transportation Systems (ITS), aiding in traffic scheduling, management, and ultimately alleviating urban mobility challenges.

Future developments could venture towards expanding the applicability of NAPL and DAGG beyond traffic forecasting to other correlated time series prediction tasks, such as epidemic transmission, weather forecasting, and financial markets. Further research could also explore scaling AGCRN to even larger datasets and more complex domains, validating its generalizability and robustness.

In summary, while traffic forecasting presents intricate challenges, the Adaptive Graph Convolutional Recurrent Network meticulously designed with NAPL and DAGG modules offers a significant advancement. It inherently learns and adapts to the spatial-temporal intricacies of traffic data, ensuring superior forecasting performance. This research paves the way for more adaptive and intelligent systems capable of handling the dynamic, evolving nature of modern urban transport networks.

PDF Markdown