Spatio-Temporal Joint Graph Convolutional Networks for Traffic Forecasting (2111.13684v3)
Abstract: Recent studies have shifted their focus towards formulating traffic forecasting as a spatio-temporal graph modeling problem. Typically, they constructed a static spatial graph at each time step and then connected each node with itself between adjacent time steps to create a spatio-temporal graph. However, this approach failed to explicitly reflect the correlations between different nodes at different time steps, thus limiting the learning capability of graph neural networks. Additionally, those models overlooked the dynamic spatio-temporal correlations among nodes by using the same adjacency matrix across different time steps. To address these limitations, we propose a novel approach called Spatio-Temporal Joint Graph Convolutional Networks (STJGCN) for accurate traffic forecasting on road networks over multiple future time steps. Specifically, our method encompasses the construction of both pre-defined and adaptive spatio-temporal joint graphs (STJGs) between any two time steps, which represent comprehensive and dynamic spatio-temporal correlations. We further introduce dilated causal spatio-temporal joint graph convolution layers on the STJG to capture spatio-temporal dependencies from distinct perspectives with multiple ranges. To aggregate information from different ranges, we propose a multi-range attention mechanism. Finally, we evaluate our approach on five public traffic datasets and experimental results demonstrate that STJGCN is not only computationally efficient but also outperforms 11 state-of-the-art baseline methods.
- S. Wang, J. Cao, and P. S. Yu, “Deep learning for spatio-temporal data mining: A survey,” IEEE Transactions on Knowledge and Data Engineering, 2020.
- D. A. Tedjopurnomo, Z. Bao, B. Zheng, F. M. Choudhury, and K. Qin, “A survey on modern deep neural network for traffic prediction: Trends, methods and challenges,” IEEE Transactions on Knowledge and Data Engineering, 2020.
- C. Zheng, C. Wang, X. Fan, J. Qi, and X. Yan, “Stpc-net: Learn massive geo-sensory data as spatio-temporal point clouds,” IEEE Transactions on Intelligent Transportation Systems, 2021.
- Y. Li, R. Yu, C. Shahabi, and Y. Liu, “Diffusion convolutional recurrent neural network: Data-driven traffic forecasting,” in ICLR, 2018.
- W. Cheng, Y. Shen, Y. Zhu, and L. Huang, “A neural attention model for urban air quality inference: learning the weights of monitoring stations,” in AAAI, 2018, pp. 2151–2158.
- B. Yu, H. Yin, and Z. Zhu, “Spatio-temporal graph convolutional networks: A deep learning framework for traffic forecasting,” in IJCAI, 2018, pp. 3634–3640.
- Z. Wu, S. Pan, G. Long, J. Jiang, and C. Zhang, “Graph wavenet for deep spatial-temporal graph modeling,” in IJCAI, 2019.
- S. Guo, Y. Lin, N. Feng, C. Song, and HuaiyuWan, “Attention based spatial-temporal graph convolutional networks for traffic flow forecasting,” in AAAI, 2019, pp. 922–929.
- C. Song, Y. Lin, S. Guo, and HuaiyuWan, “Spatial-temporal synchronous graph convolutional networks: A new framework for spatial-temporal network data forecasting,” in AAAI, 2020.
- L. Bai, L. Yao, C. Li, X. Wang, and C. Wang, “Adaptive graph convolutional recurrent network for traffic forecasting,” in NeurIPS, 2020.
- Y. Chen, I. Segovia-Dominguez, and Y. R. Gel, “Z-gcnets: Time zigzags at graph convolutional networks for time series forecasting,” in ICML, 2021.
- Z. Wu, S. Pan, F. Chen, G. Long, C. Zhang, and P. S. Yu, “A comprehensive survey on graph neural networks,” IEEE Transactions on Neural Networks and Learning Systems, vol. 32, no. 1, pp. 4–24, 2021.
- T. N. Kipf and M. Welling, “Semi-supervised classification with graph convolutional networks,” in ICLR, 2017.
- M. Zhang and Y. Chen, “Link prediction based on graph neural networks,” in NeurIPS, 2018, pp. 5165–5175.
- M. M. Bronstein, J. Bruna, Y. LeCun, A. Szlam, and P. Vandergheynst, “Geometric deep learning: going beyond euclidean data,” IEEE Signal Processing Magazine, vol. 34, no. 4, pp. 18–42, 2017.
- M. Niepert, M. Ahmed, and K. Kutzkov, “Learning convolutional neural networks for graphs,” in ICML, 2016, pp. 2014–2023.
- W. L. Hamilton, R. Ying, and J. Leskovec, “Inductive representation learning on large graphs,” in NeurIPS, 2017, pp. 1024–1034.
- P. Veličković, G. Cucurull, A. Casanova, A. Romero, P. Liò, and Y. Bengio, “Graph attention networks,” in ICLR, 2018.
- J. Jia and A. R. Benson, “Residual correlation in graph neural network regression,” in Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2020, pp. 588–598.
- Q. Li, Z. Han, and X.-M. Wu, “Deeper insights into graph convolutional networks for semi-supervised learning,” in AAAI, 2020.
- J. Bruna, W. Zaremba, A. Szlam, and Y. LeCun, “Spectral networks and deep locally connected networks on graphs,” in ICLR, 2014.
- M. Defferrard, X. Bresson, and P. Vandergheynst, “Convolutional neural networks on graphs with fast localized spectral filtering,” in NeurIPS, 2016, pp. 3844–3852.
- S. Zhang, Y. Guo, P. Zhao, C. Zheng, and X. Chen, “A graph-based temporal attention framework for multi-sensor traffic flow forecasting,” IEEE Transactions on Intelligent Transportation Systems, 2021.
- S. Yan, Y. Xiong, and D. Lin, “Spatial temporal graph convolutional networks for skeleton-based action recognition,” in AAAI, 2018, p. 3482–3489.
- L. Shi, Y. Zhang, J. Cheng, and H. Lu, “Two-stream adaptive graph convolutional networks for skeleton-based action recognition,” in CVPR, 2019, p. 12026–12035.
- X. Yin, G. Wu, J. Wei, Y. Shen, H. Qi, and B. Yin, “A comprehensive survey on traffic prediction,” arXiv preprint arXiv:2004.08555, 2020.
- J. Sun, J. Zhang, Q. Li, X. Yi, Y. Liang, and Y. Zheng, “Predicting citywide crowd flows in irregular regions using multi-view graph convolutional networks,” IEEE Transactions on Knowledge and Data Engineering, 2020.
- J. Gu, Q. Zhou, J. Yang, Y. Liu, F. Zhuang, Y. Zhao, and H. Xiong, “Exploiting interpretable patterns for flow prediction in dockless bike sharing systems,” IEEE Transactions on Knowledge and Data Engineering, 2020.
- R. Jiang, Z. Cai, Z. Wang, C. Yang, Z. Fan, Q. Chen, K. Tsubouchi, X. Song, and R. Shibasaki, “Deepcrowd: A deep model for large-scale citywide crowd density and flow prediction,” IEEE Transactions on Knowledge and Data Engineering, 2021.
- S. Guo, Y. Lin, H. Wan, X. Li, and G. Cong, “Learning dynamics and heterogeneity of spatial-temporal graph data for traffic forecasting,” IEEE Transactions on Knowledge and Data Engineering, 2021.
- X. Ma, Z. Tao, Y. Wang, H. Yu, and Y. Wang, “Long short-term memory neural network for traffic speed prediction using remote microwave sensor data,” Transportation Research Part C: Emerging Technologies, vol. 54, pp. 187–197, 2015.
- J. Zhang, Y. Zheng, and D. Qi, “Deep spatio-temporal residual networks for citywide crowd flows prediction,” in AAAI, 2017, pp. 1655–1661.
- H. Yao, F. Wu, J. Ke, X. Tang, Y. Jia, S. Lu, P. Gong, J. Ye, and Z. Li, “Deep multi-view spatial-temporal network for taxi demand prediction,” in AAAI, 2018, pp. 2588–2595.
- H. Yao, X. Tang, H. Wei, G. Zheng, and Z. Li, “Revisiting spatial-temporal similarity: A deep learning framework for traffic prediction,” in AAAI, 2019.
- C. Zheng, X. Fan, C. Wen, L. Chen, C. Wang, and J. Li, “Deepstd: Mining spatio-temporal disturbances of multiple context factors for citywide traffic flow prediction,” IEEE Transactions on Intelligent Transportation Systems, vol. 21, no. 9, pp. 3744–3755, 2020.
- J. Zhang, Y. Zheng, J. Sun, and D. Qi, “Flow prediction in spatio-temporal networks based on multitask deep learning,” IEEE Transactions on Knowledge and Data Engineering, vol. 32, no. 3, 2020.
- J. Ye, J. Zhao, K. Ye, and C. Xu, “How to build a graph-based deep learning architecture in traffic domain: A survey,” arXiv preprint arXiv:2005.11691, 2020.
- Y. H. Lau and R. C.-W. Wong, “Spatio-temporal graph convolutional networks for traffic forecasting: Spatial layers first or temporal layers first?” in Proceedings of the 29th International Conference on Advances in Geographic Information Systems, 2021, pp. 427–430.
- C. Zheng, X. Fan, C. Wang, and J. Qi, “Gman: A graph multi-attention network for traffic prediction,” in AAAI, 2020, pp. 1234–1241.
- X. Wang, Y. Ma, Y. Wang, W. Jin, X. Wang, J. Tang, C. Jia, and J. Yu, “Traffic flow prediction via spatial temporal graph neural network,” in WWW, 2020, pp. 1082–1092.
- A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Łukasz Kaiser, and I. Polosukhin, “Attention is all you need,” in NeurIPS, 2017, pp. 5998–6008.
- Y.-J. Lu and C.-T. Li, “Agstn: Learning attention-adjusted graph spatio-temporal networks for short-term urban sensor value forecasting,” in 2020 IEEE International Conference on Data Mining (ICDM). IEEE, 2020, pp. 1148–1153.
- Y. Wang, H. Yin, H. Chen, T. Wo, J. Xu, and K. Zheng, “Origin-destination matrix prediction via graph convolution: a new perspective of passenger demand modeling,” in KDD, 2019, pp. 1227–1235.
- W. Chen, L. Chen, Y. Xie, W. Cao, Y. Gao, and X. Feng, “Multi-range attentive bicomponent graph convolutional network for traffic forecasting,” in AAAI, 2020.
- D. Deng, C. Shahabi, U. Demiryurek, L. Zhu, R. Yu, and Y. Liu, “Latent space model for road networks to predict time-varying traffic,” in KDD, 2016.
- V. Nair and G. E. Hinton, “Rectified linear units improve restricted boltzmann machines,” in ICML, 2010, pp. 807–814.
- M. Schuster and K. K. Paliwal, “Bidirectional recurrent neural networks,” IEEE Transactions on Signal Processing, vol. 45, no. 11, pp. 2673–2681, 1997.
- A. van den Oord, S. Dieleman, H. Zen, K. Simonyan, O. Vinyals, A. Graves, N. Kalchbrenner, A. Senior, and K. Kavukcuoglu, “Wavenet: A generative model for raw audio,” arXiv preprint arXiv:1609.03499, 2016.
- S. Bai, J. Z. Kolter, and V. Koltun, “An empirical evaluation of generic convolutional and recurrent networks for sequence modeling,” arXiv preprint arXiv:1803.01271, 2018.
- K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in CVPR, 2016, pp. 770–778.
- Z. Cui, R. Ke, and Y. Wang, “Deep bidirectional and unidirectional lstm recurrent neural network for network-wide traffic speed prediction,” arXiv preprint arXiv:1801.02143, 2018.
- Z. Cui, K. Henrickson, R. Ke, and Y. Wang, “Traffic graph convolutional recurrent neural network: A deep learning framework for network-scale traffic learning and forecasting,” IEEE Transactions on Intelligent Transportation Systems, 2019.
- Y. Shin and Y. Yoon, “Pgcn: Progressive graph convolutional networks for spatial-temporal traffic forecasting,” arXiv preprint arXiv:2202.08982, 2022.
- S. Ioffe and C. Szegedy, “Batch normalization: Accelerating deep network training by reducing internal covariate shift,” in ICML, 2015, pp. 448–456.
- D. P. Kingma and J. L. Ba, “Adam: a method for stochastic optimization,” in ICLR, 2015.
- I. Sutskever, O. Vinyals, and Q. V. Le, “Sequence to sequence learning with neural networks,” in NeurIPS, 2014, pp. 3104–3112.
- H. Drucker, C. J. Burges, L. Kaufman, A. Smola, and V. Vapoik, “Support vector regression machines,” in NeurIPS, 1997, pp. 155–161.
- G. Carlsson and V. de Silva, “Zigzag persistence,” Foundations of computational mathematics, vol. 10, pp. 367–405, 2010.
- Chuanpan Zheng (4 papers)
- Xiaoliang Fan (17 papers)
- Shirui Pan (198 papers)
- Haibing Jin (2 papers)
- Zhaopeng Peng (7 papers)
- Zonghan Wu (11 papers)
- Cheng Wang (386 papers)
- Philip S. Yu (592 papers)