Holistic Graph-based Motion Prediction (2301.13545v2)
Abstract: Motion prediction for automated vehicles in complex environments is a difficult task that is to be mastered when automated vehicles are to be used in arbitrary situations. Many factors influence the future motion of traffic participants starting with traffic rules and reaching from the interaction between each other to personal habits of human drivers. Therefore we present a novel approach for a graph-based prediction based on a heterogeneous holistic graph representation that combines temporal information, properties and relations between traffic participants as well as relations with static elements like the road network. The information are encoded through different types of nodes and edges that both are enriched with arbitrary features. We evaluated the approach on the INTERACTION and the Argoverse dataset and conducted an informative ablation study to demonstrate the benefit of different types of information for the motion prediction quality.
- “HDGT: Heterogeneous Driving Graph Transformer for Multi-Agent Trajectory Prediction via Scene Encoding” arXiv, 2022 DOI: 10.48550/ARXIV.2205.09753
- Xiaoyu Mo, Yang Xing and Chen Lv “Heterogeneous Edge-Enhanced Graph Attention Network For Multi-Agent Trajectory Prediction” arXiv, 2021 DOI: 10.48550/ARXIV.2106.07161
- “AgentFormer: Agent-Aware Transformers for Socio-Temporal Multi-Agent Forecasting” In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 9813–9823
- “Learning Lane Graph Representations for Motion Forecasting” In Computer Vision – ECCV 2020 Cham: Springer International Publishing, 2020, pp. 541–556
- Thomas N. Kipf and Max Welling “Semi-Supervised Classification with Graph Convolutional Networks” arXiv, 2016 DOI: 10.48550/ARXIV.1609.02907
- William L. Hamilton, Rex Ying and Jure Leskovec “Inductive Representation Learning on Large Graphs” arXiv, 2017 DOI: 10.48550/ARXIV.1706.02216
- “Graph Attention Networks” arXiv, 2017 DOI: 10.48550/ARXIV.1710.10903
- Shaked Brody, Uri Alon and Eran Yahav “How Attentive are Graph Attention Networks?” arXiv, 2021 DOI: 10.48550/ARXIV.2105.14491
- Matthias Fey and Jan Eric Lenssen “Fast Graph Representation Learning with PyTorch Geometric” arXiv, 2019 DOI: 10.48550/ARXIV.1903.02428
- “A Survey on Heterogeneous Graph Embedding: Methods, Techniques, Applications and Sources” In IEEE Transactions on Big Data, 2022, pp. 1–1 DOI: 10.1109/TBDATA.2022.3177455
- “Heterogeneous Graph Attention Network” In The World Wide Web Conference, WWW ’19 San Francisco, CA, USA: Association for Computing Machinery, 2019, pp. 2022–2032 DOI: 10.1145/3308558.3313562
- “Heterogeneous Graph Transformer” In Proceedings of The Web Conference 2020, WWW ’20 Taipei, Taiwan: Association for Computing Machinery, 2020, pp. 2704–2710 DOI: 10.1145/3366423.3380027
- “Attention is All you Need” In Advances in Neural Information Processing Systems 30 Curran Associates, Inc., 2017 URL: https://proceedings.neurips.cc/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf
- “Social LSTM: Human Trajectory Prediction in Crowded Spaces” In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 961–971 DOI: 10.1109/CVPR.2016.110
- “PRECOG: PREdiction Conditioned on Goals in Visual Multi-Agent Settings” In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2019
- Nicholas Rhinehart, Kris M. Kitani and Paul Vernaza “R2P2: A ReparameteRized Pushforward Policy for Diverse, Precise Generative Path Forecasting” In Proceedings of the European Conference on Computer Vision (ECCV), 2018
- “Long Short-Term Memory” In Neural Computation 9.8, 1997, pp. 1735–1780 DOI: 10.1162/neco.1997.9.8.1735
- “On the Properties of Neural Machine Translation: Encoder-Decoder Approaches” arXiv, 2014 DOI: 10.48550/ARXIV.1409.1259
- Alex Krizhevsky, Ilya Sutskever and Geoffrey E Hinton “ImageNet Classification with Deep Convolutional Neural Networks” In Advances in Neural Information Processing Systems 25 Curran Associates, Inc., 2012 URL: https://proceedings.neurips.cc/paper/2012/file/c399862d3b9d6b76c8436e924a68c45b-Paper.pdf
- “Very Deep Convolutional Networks for Large-Scale Image Recognition” arXiv, 2014 DOI: 10.48550/ARXIV.1409.1556
- Joey Hong, Benjamin Sapp and James Philbin “Rules of the Road: Predicting Driving Behavior With a Convolutional Model of Semantic Interactions” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019
- “CoverNet: Multimodal Behavior Prediction Using Trajectory Sets” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020
- “Uncertainty-aware Short-term Motion Prediction of Traffic Actors for Autonomous Driving” In 2020 IEEE Winter Conference on Applications of Computer Vision (WACV), 2020, pp. 2084–2093 DOI: 10.1109/WACV45572.2020.9093332
- “Scene Transformer: A unified architecture for predicting multiple agent trajectories” arXiv, 2021 DOI: 10.48550/ARXIV.2106.08417
- “Multi-Head Attention for Multi-Modal Joint Vehicle Motion Forecasting” arXiv, 2019 DOI: 10.48550/ARXIV.1910.03650
- “VectorNet: Encoding HD Maps and Agent Dynamics From Vectorized Representation” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020
- Junru Gu, Chen Sun and Hang Zhao “DenseTNT: End-to-End Trajectory Prediction From Dense Goal Sets” In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 15303–15312
- “PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation” arXiv, 2016 DOI: 10.48550/ARXIV.1612.00593
- “Graph-Based Spatial-Temporal Convolutional Network for Vehicle Trajectory Prediction in Autonomous Driving” arXiv, 2021 DOI: 10.48550/ARXIV.2109.12764
- “Spectral Temporal Graph Neural Network for Trajectory Prediction” In 2021 IEEE International Conference on Robotics and Automation (ICRA), 2021, pp. 1839–1845 DOI: 10.1109/ICRA48506.2021.9561461
- “Support vector machines for multi-class pattern recognition.” In Esann 99, 1999, pp. 219–224
- “INTERACTION Dataset: An INTERnational, Adversarial and Cooperative moTION Dataset in Interactive Driving Scenarios with Semantic Maps” arXiv, 2019 DOI: 10.48550/ARXIV.1910.03088
- “Argoverse: 3D Tracking and Forecasting with Rich Maps” In Conference on Computer Vision and Pattern Recognition (CVPR), 2019
- “Lanelet2: A high-definition map framework for the future of automated driving” In 2018 21st International Conference on Intelligent Transportation Systems (ITSC), 2018, pp. 1672–1679 DOI: 10.1109/ITSC.2018.8569929
- Diederik P. Kingma and Jimmy Ba “Adam: A Method for Stochastic Optimization” arXiv, 2014 DOI: 10.48550/ARXIV.1412.6980
- Xiaoyu Mo, Yang Xing and Chen Lv “ReCoG: A Deep Learning Framework with Heterogeneous Graph for Interaction-Aware Trajectory Prediction” arXiv, 2020 DOI: 10.48550/ARXIV.2012.05032
- “Masked Label Prediction: Unified Message Passing Model for Semi-Supervised Classification” arXiv, 2020 DOI: 10.48550/ARXIV.2009.03509