Spatial-Temporal Large Language Model for Traffic Prediction (2401.10134v4)
Abstract: Traffic prediction, an essential component for intelligent transportation systems, endeavours to use historical data to foresee future traffic features at specific locations. Although existing traffic prediction models often emphasize developing complex neural network structures, their accuracy has not improved. Recently, LLMs have shown outstanding capabilities in time series analysis. Differing from existing models, LLMs progress mainly through parameter expansion and extensive pretraining while maintaining their fundamental structures. Motivated by these developments, we propose a Spatial-Temporal LLM (ST-LLM) for traffic prediction. In the ST-LLM, we define timesteps at each location as tokens and design a spatial-temporal embedding to learn the spatial location and global temporal patterns of these tokens. Additionally, we integrate these embeddings by a fusion convolution to each token for a unified spatial-temporal representation. Furthermore, we innovate a partially frozen attention strategy to adapt the LLM to capture global spatial-temporal dependencies for traffic prediction. Comprehensive experiments on real traffic datasets offer evidence that ST-LLM is a powerful spatial-temporal learner that outperforms state-of-the-art models. Notably, the ST-LLM also exhibits robust performance in both few-shot and zero-shot prediction scenarios. The code is publicly available at https://github.com/ChenxiLiu-HNU/ST-LLM.
- Adaptive graph convolutional recurrent network for traffic forecasting. Advances in neural information processing systems, 33:17804–17815, 2020.
- LightTS: Lightweight time series classification with adaptive ensemble distillation. Proceedings of the ACM on Management of Data, 1(2):171:1–171:27, 2023.
- Tempo: Prompt-based generative pre-trained transformer for time series forecasting. arXiv preprint arXiv:2310.04948, 2023.
- Tensor extended kalman filter and its application to traffic prediction. IEEE Transactions on Intelligent Transportation Systems, 24(12):13813–13829, 2023.
- Gatgpt: A pre-trained large language model with graph attention network for spatiotemporal imputation. arXiv preprint arXiv:2311.14332, 2023.
- Graph neural controlled differential equations for traffic forecasting. In Thirty-Sixth Conference on Artificial Intelligence, Virtual Event, February 22 - March 1, 2022.
- Empowering spatial knowledge graph for mobile traffic prediction. In Proceedings of the 31st ACM International Conference on Advances in Geographic Information Systems, pages 1–11, 2023.
- Attention based spatial-temporal graph convolutional networks for traffic flow forecasting. In The Thirty-Third Conference on Artificial Intelligence, Honolulu, Hawaii, USA, January 27 - February 1, pages 922–929, 2019.
- Learning dynamics and heterogeneity of spatial-temporal graph data for traffic forecasting. IEEE Transactions on Knowledge and Data Engineering, 34(11):5415–5428, 2022.
- Spatio-temporal self-supervised learning for traffic flow prediction. In Brian Williams, Yiling Chen, and Jennifer Neville, editors, Thirty-Seventh Conference on Artificial Intelligence, Washington, DC, USA, February 7-14, pages 4356–4364, 2023.
- Pdformer: Propagation delay-aware dynamic long-range transformer for traffic flow prediction. In Brian Williams, Yiling Chen, and Jennifer Neville, editors, Thirty-Seventh Conference on Artificial Intelligence, Washington, DC, USA, February 7-14, pages 4365–4373, 2023.
- Spatio-temporal graph neural networks for predictive learning in urban computing: A survey. IEEE Transactions on Knowledge and Data Engineering, pages 1–20, 2023.
- Time-llm: Time series forecasting by reprogramming large language models. International Conference on Learning Representations, 2023.
- Large models for time series and spatio-temporal data: A survey and outlook. arXiv preprint arXiv:2310.10196, 2023.
- MELTR: meta loss transformer for learning to fine-tune video foundation models. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada, June 17-24, pages 20105–20115, 2023.
- Short-term traffic flow prediction using seasonal arima model with limited input data. European Transport Research Review, 7(3):1–9, 2015.
- Spatio-temporal graph mixformer for traffic forecasting. Expert Systems with Applications, 228:120281, 2023.
- Diffusion convolutional recurrent neural network: Data-driven traffic forecasting. In International Conference on Learning Representations, pages 1–16, 2018.
- Dynamic graph convolutional recurrent network for traffic prediction: Benchmark and solution. ACM Transactions on Knowledge Discovery from Data, 17(1):9:1–9:21, 2023.
- Self-attention convlstm for spatiotemporal prediction. In Thirty-Fourth Conference on Artificial Intelligence, New York, NY, USA, February 7-12, pages 11531–11538, 2020.
- Foreseeing private car transfer between urban regions with multiple graph-based generative adversarial networks. World Wide Web, 25(6):2515–2534, 2022.
- Exploiting spatiotemporal correlations of arrive-stay-leave behaviors for private car flow prediction. IEEE Transactions on Network Science and Engineering, 9(2):834–847, 2022.
- Spatio-temporal adaptive embedding makes vanilla transformer sota for traffic forecasting. In Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, pages 4125–4129, 2023.
- itransformer: Inverted transformers are effective for time series forecasting. arXiv preprint arXiv:2310.06625, 2023.
- Frozen pretrained transformers as universal computation engines. In Thirty-Sixth Conference on Artificial Intelligence, Virtual Event, February 22 - March 1, pages 7628–7636, 2022.
- Benchmarking large language model capabilities for conditional generation. In Anna Rogers, Jordan L. Boyd-Graber, and Naoaki Okazaki, editors, Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics, Toronto, Canada, July 9-14, pages 9194–9213, 2023.
- Mba-stnet: Bayes-enhanced discriminative multi-task learning for flow prediction. IEEE Trans. Knowl. Data Eng., 35(7):7164–7177, 2023.
- A unified replay-based continuous learning framework for spatio-temporal prediction on streaming data. IEEE International Conference on Data Engineering, 2024.
- Shikai Qiu Nate Gruver, Marc Finzi and Andrew Gordon Wilson. Large language models are zero shot time series forecasters. In Advances in Neural Information Processing Systems, pages 1–29, 2023.
- Knowledge of cultural moral norms in large language models. In Anna Rogers, Jordan L. Boyd-Graber, and Naoaki Okazaki, editors, Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics, Toronto, Canada, July 9-14, pages 428–446, 2023.
- Lag-llama: Towards foundation models for time series forecasting. arXiv preprint arXiv:2310.08278, 2023.
- Decoupled dynamic spatial-temporal graph neural network for traffic forecasting. Proc. VLDB Endow., 15(11):2733–2746, jul 2022.
- Stepdeep: A novel spatial-temporal mobility event prediction framework based on deep neural network. In Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining, pages 724–733, 2018.
- Spatial-temporal synchronous graph convolutional networks: A new framework for spatial-temporal network data forecasting. In Proceedings of the AAAI conference on artificial intelligence, volume 34, pages 914–921, 2020.
- Test: Text prototype aligned embedding to activate llm’s ability for time series. arXiv preprint arXiv:2308.08241, 2023.
- Multi-task adversarial spatial-temporal networks for crowd flow prediction. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management, page 1555–1564, 2020.
- Diffstg: Probabilistic spatio-temporal graph forecasting with denoising diffusion models. In Proceedings of the 31st ACM International Conference on Advances in Geographic Information Systems, pages 60:1–60:12, 2023.
- Graph wavenet for deep spatial-temporal graph modeling. In Sarit Kraus, editor, Proceedings of the Twenty-Eight International Joint Conference on Artificial Intelligence, Macao, China, August 10-16, pages 1907–1913, 2019.
- AutoCTS+: Joint neural architecture and hyperparameter search for correlated time series forecasting. Proceedings of the ACM on Management of Data, 1(1):97:1–97:26, 2023.
- Leveraging language foundation models for human mobility forecasting. In Proceedings of the 30th International Conference on Advances in Geographic Information Systems, Seattle, Washington, November 1-4, pages 90:1–90:9, 2022.
- Coupled layer-wise graph convolution for transportation demand prediction. In Thirty-Fifth Conference on Artificial Intelligence, Virtual Event, February 2-9, pages 4617–4625, 2021.
- Deep learning on traffic prediction: Methods, analysis, and future directions. IEEE Transactions on Intelligent Transportation Systems, 23(6):4927–4943, 2021.
- Spatio-temporal graph convolutional networks: A deep learning framework for traffic forecasting. In Proceedings of the Twenty-Seven International Joint Conference on Artificial Intelligence, page 3634–3640, 2018.
- Hint-aug: Drawing hints from foundation vision transformers towards boosted few-shot parameter-efficient tuning. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada, June 17-24, pages 11102–11112, 2023.
- Hetero-convlstm: A deep learning approach to traffic accident prediction on heterogeneous spatio-temporal data. In Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining, pages 984–992, 2018.
- Gman: A graph multi-attention network for traffic prediction. In Proceedings of the AAAI conference on artificial intelligence, volume 34, pages 1234–1241, 2020.
- One Fits All: Power general time series analysis by pretrained lm. In Advances in Neural Information Processing Systems, pages 1–34, 2023.
- Crest: A credible spatiotemporal learning framework for uncertainty-aware traffic forecasting. In The 17th ACM International Conference on Web Search and Data Mining, pages 1–10, 2024.
- Chenxi Liu (84 papers)
- Sun Yang (7 papers)
- Qianxiong Xu (14 papers)
- Zhishuai Li (16 papers)
- Cheng Long (65 papers)
- Ziyue Li (68 papers)
- Rui Zhao (241 papers)