Spatial-Temporal-Decoupled Masked Pre-training for Spatiotemporal Forecasting (2312.00516v3)
Abstract: Spatiotemporal forecasting techniques are significant for various domains such as transportation, energy, and weather. Accurate prediction of spatiotemporal series remains challenging due to the complex spatiotemporal heterogeneity. In particular, current end-to-end models are limited by input length and thus often fall into spatiotemporal mirage, i.e., similar input time series followed by dissimilar future values and vice versa. To address these problems, we propose a novel self-supervised pre-training framework Spatial-Temporal-Decoupled Masked Pre-training (STD-MAE) that employs two decoupled masked autoencoders to reconstruct spatiotemporal series along the spatial and temporal dimensions. Rich-context representations learned through such reconstruction could be seamlessly integrated by downstream predictors with arbitrary architectures to augment their performances. A series of quantitative and qualitative evaluations on six widely used benchmarks (PEMS03, PEMS04, PEMS07, PEMS08, METR-LA, and PEMS-BAY) are conducted to validate the state-of-the-art performance of STD-MAE. Codes are available at https://github.com/Jimmy-7664/STD-MAE.
- STG2seq: spatial-temporal graph to sequence model for multi-step passenger demand forecasting. In 28th International Joint Conference on Artificial Intelligence, IJCAI 2019, 1981–1987. International Joint Conferences on Artificial Intelligence.
- Adaptive graph convolutional recurrent network for traffic forecasting. Advances in Neural Information Processing Systems, 33: 17804–17815.
- BEiT: BERT Pre-Training of Image Transformers. In International Conference on Learning Representations.
- Spectral temporal graph neural network for multivariate time-series forecasting. Advances in Neural Information Processing Systems, 33: 17766–17778.
- Freeway performance measurement system: mining loop detector data. Transportation Research Record, 1748(1): 96–102.
- Z-GCNETs: Time Zigzags at Graph Convolutional Networks for Time Series Forecasting. In International Conference on Machine Learning, 1684–1694. PMLR.
- Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555.
- EnhanceNet: Plugin neural networks for enhancing correlated time series forecasting. In 2021 IEEE 37th International Conference on Data Engineering (ICDE), 1739–1750. IEEE.
- Towards spatio-temporal aware traffic time series forecasting. In 2022 IEEE 38th International Conference on Data Engineering (ICDE), 2900–2913. IEEE.
- Electra: Pre-training text encoders as discriminators rather than generators. arXiv preprint arXiv:2003.10555.
- Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805.
- Spatial-temporal graph ode networks for traffic flow forecasting. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, 364–373.
- Masked autoencoders as spatiotemporal learners. Advances in neural information processing systems, 35: 35946–35958.
- Hierarchical Graph Convolution Networks for Traffic Forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 35, 151–159.
- Attention based spatial-temporal graph convolutional networks for traffic flow forecasting. In Proceedings of the AAAI conference on artificial intelligence, volume 33, 922–929.
- Learning dynamics and heterogeneity of spatial-temporal graph data for traffic forecasting. IEEE Transactions on Knowledge and Data Engineering.
- Dynamic and Multi-faceted Spatio-temporal Deep Learning for Traffic Speed Forecasting. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, 547–555.
- Masked autoencoders are scalable vision learners. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 16000–16009.
- Long short-term memory. Neural computation, 9(8): 1735–1780.
- PDFormer: Propagation Delay-aware Dynamic Long-range Transformer for Traffic Flow Prediction. In AAAI. AAAI Press.
- Spatio-temporal meta-graph learning for traffic forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 37, 8078–8086.
- DL-Traff: Survey and Benchmark of Deep Learning Models for Urban Traffic Prediction. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management, 4515–4525.
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of NAACL-HLT, 4171–4186.
- Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980.
- Dstagnn: Dynamic spatial-temporal aware graph neural network for traffic flow forecasting. In International conference on machine learning, 11906–11917. PMLR.
- Albert: A lite bert for self-supervised learning of language representations. arXiv preprint arXiv:1909.11942.
- Learning to Remember Patterns: Pattern Matching Memory Networks for Traffic Forecasting. In International Conference on Learning Representations.
- Spatial-Temporal Fusion Graph Neural Networks for Traffic Flow Forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 35, 4189–4196.
- Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting. In International Conference on Learning Representations.
- Spatio-temporal adaptive embedding makes vanilla transformer sota for traffic forecasting. In Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 4125–4129.
- Pyraformer: Low-complexity pyramidal attention for long-range time series modeling and forecasting. In International conference on learning representations.
- Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692.
- Lc-rnn: A deep learning model for traffic speed prediction. In IJCAI, volume 2018, 27th.
- Long short-term memory neural network for traffic speed prediction using remote microwave sensor data. Transportation Research Part C: Emerging Technologies, 54.
- Wavenet: A generative model for raw audio. arXiv preprint arXiv:1609.03499.
- Utilizing real-world transportation data for accurate traffic prediction. In 2012 ieee 12th international conference on data mining, 595–604. IEEE.
- Exploring Progress in Multivariate Time Series Forecasting: Comprehensive Benchmarking and Heterogeneity Analysis. arXiv preprint arXiv:2310.06119.
- Spatial-temporal identity: A simple yet effective baseline for multivariate time series forecasting. In Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 4454–4458.
- Pre-training Enhanced Spatial-temporal Graph Neural Network for Multivariate Time Series Forecasting. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 1567–1577.
- Spatial-temporal synchronous graph convolutional networks: A new framework for spatial-temporal network data forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 34, 914–921.
- Vector autoregressions. Journal of Economic perspectives, 15(4): 101–115.
- Videomae: Masked autoencoders are data-efficient learners for self-supervised video pre-training. Advances in neural information processing systems, 35: 10078–10093.
- Neural discrete representation learning. Advances in neural information processing systems, 30.
- Attention is all you need. Advances in neural information processing systems, 30.
- Translating math formula images to LaTeX sequences using deep neural networks with sequence-level training. International Journal on Document Analysis and Recognition (IJDAR), 24(1-2): 63–75.
- Autoformer: Decomposition transformers with auto-correlation for long-term series forecasting. In Thirty-Fifth Conference on Neural Information Processing Systems.
- Connecting the dots: Multivariate time series forecasting with graph neural networks. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 753–763.
- Graph WaveNet for Deep Spatial-Temporal Graph Modeling. In IJCAI.
- Simmim: A simple framework for masked image modeling. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 9653–9663.
- Autoformer: Decomposition transformers with auto-correlation for long-term series forecasting. Advances in Neural Information Processing Systems, 34.
- Spatial-temporal transformer networks for traffic flow forecasting. arXiv preprint arXiv:2001.02908.
- Spatio-temporal graph convolutional networks: a deep learning framework for traffic forecasting. In Proceedings of the 27th International Joint Conference on Artificial Intelligence, 3634–3640.
- T-gcn: A temporal graph convolutional network for traffic prediction. IEEE Transactions on Intelligent Transportation Systems, 21(9): 3848–3858.
- Gman: A graph multi-attention network for traffic prediction. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 34, 1234–1241.
- Informer: Beyond efficient transformer for long sequence time-series forecasting. In Proceedings of AAAI.
- Fedformer: Frequency enhanced decomposed transformer for long-term series forecasting. In International Conference on Machine Learning, 27268–27286. PMLR.
- Haotian Gao (5 papers)
- Renhe Jiang (50 papers)
- Zheng Dong (41 papers)
- Jinliang Deng (13 papers)
- Xuan Song (61 papers)
- Yuxin Ma (38 papers)