EqDrive: Efficient Equivariant Motion Forecasting with Multi-Modality for Autonomous Driving (2310.17540v2)
Abstract: Forecasting vehicular motions in autonomous driving requires a deep understanding of agent interactions and the preservation of motion equivariance under Euclidean geometric transformations. Traditional models often lack the sophistication needed to handle the intricate dynamics inherent to autonomous vehicles and the interaction relationships among agents in the scene. As a result, these models have a lower model capacity, which then leads to higher prediction errors and lower training efficiency. In our research, we employ EqMotion, a leading equivariant particle, and human prediction model that also accounts for invariant agent interactions, for the task of multi-agent vehicle motion forecasting. In addition, we use a multi-modal prediction mechanism to account for multiple possible future paths in a probabilistic manner. By leveraging EqMotion, our model achieves state-of-the-art (SOTA) performance with fewer parameters (1.2 million) and a significantly reduced training time (less than 2 hours).
- Vectornet: Encoding hd maps and agent dynamics from vectorized representation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 11525–11533, 2020.
- Continual multi-agent interaction behavior prediction with conditional generative memory. IEEE Robotics and Automation Letters, 6(4):8410–8417, 2021.
- Loki: Long term and key intentions for trajectory prediction. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 9803–9812, 2021.
- A cognition-inspired trajectory prediction method for vehicles in interactive scenarios. IET Intelligent Transport Systems.
- Multipath: Multiple probabilistic anchor trajectory hypotheses for behavior prediction. In Leslie Pack Kaelbling, Danica Kragic, and Komei Sugiura, editors, 3rd Annual Conference on Robot Learning, volume 100, pages 86–99, 2019.
- Agentformer: Agent-aware transformers for socio-temporal multi-agent forecasting. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 9813–9823, 2021.
- Spatio-temporal graph dual-attention network for multi-agent prediction and tracking. IEEE Transactions on Intelligent Transportation Systems, 23(8):10556–10569, 2021.
- Rain: Reinforced hybrid attention inference network for motion forecasting. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021.
- Learning lane graph representations for motion forecasting. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part II 16, pages 541–556. Springer, 2020.
- Hivt: Hierarchical vector transformer for multi-agent motion prediction. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 8813–8823, 2022.
- Eqmotion: Equivariant multi-agent motion prediction with invariant interaction reasoning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1410–1420, 2023.
- Group equivariant convolutional networks. In International conference on machine learning, pages 2990–2999. PMLR, 2016.
- Invariant and equivariant graph networks. arXiv preprint arXiv:1812.09902, 2018.
- Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907, 2016.
- Multipath++: Efficient information fusion and trajectory aggregation for behavior prediction. In 2022 International Conference on Robotics and Automation (ICRA), pages 7814–7821. IEEE, 2022.
- Conditional generative neural system for probabilistic trajectory prediction. In 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 6150–6156. IEEE, 2019.
- Evolvegraph: Multi-agent trajectory prediction with dynamic relational reasoning. NeurIPS, 33:19783–19794, 2020.
- Shared cross-modal trajectory prediction for autonomous driving. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 244–253, 2021.
- Interaction modeling with multiplex attention. Advances in Neural Information Processing Systems, 35:20038–20050, 2022.
- Multi-agent driving behavior prediction across different scenarios with self-supervised domain knowledge. In 2021 IEEE International Intelligent Transportation Systems Conference (ITSC). IEEE, 2021.
- Multiple futures prediction. Advances in neural information processing systems, 32, 2019.
- Attention is all you need. NeurIPS, 30, 2017.
- Densetnt: End-to-end trajectory prediction from dense goal sets. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021.
- Argoverse: 3d tracking and forecasting with rich maps. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 8748–8757, 2019.
- Practical search techniques in path planning for autonomous driving. Ann Arbor, 1001(48105):18–80, 2008.
- Rrt-smart: Rapid convergence implementation of rrt towards optimal solution. In 2012 IEEE international conference on mechatronics and automation, pages 1651–1656. IEEE, 2012.
- Model-based probabilistic collision detection in autonomous driving. IEEE Transactions on Intelligent Transportation Systems, 10(2):299–310, 2009.