CMP: Cooperative Motion Prediction with Multi-Agent Communication (2403.17916v3)
Abstract: The confluence of the advancement of Autonomous Vehicles (AVs) and the maturity of Vehicle-to-Everything (V2X) communication has enabled the capability of cooperative connected and automated vehicles (CAVs). Building on top of cooperative perception, this paper explores the feasibility and effectiveness of cooperative motion prediction. Our method, CMP, takes LiDAR signals as model input to enhance tracking and prediction capabilities. Unlike previous work that focuses separately on either cooperative perception or motion prediction, our framework, to the best of our knowledge, is the first to address the unified problem where CAVs share information in both perception and prediction modules. Incorporated into our design is the unique capability to tolerate realistic V2X transmission delays, while dealing with bulky perception representations. We also propose a prediction aggregation module, which unifies the predictions obtained by different CAVs and generates the final prediction. Through extensive experiments and ablation studies on the OPV2V and V2V4Real datasets, we demonstrate the effectiveness of our method in cooperative perception, tracking, and motion prediction. In particular, CMP reduces the average prediction error by 12.3% compared with the strongest baseline. Our work marks a significant step forward in the cooperative capabilities of CAVs, showcasing enhanced performance in complex scenarios. More details can be found on the project website: https://cmp-cooperative-prediction.github.io.
- “AVR: Augmented Vehicular Reality” In Proceedings of the 16th Annual International Conference on Mobile Systems, Applications, and Services (Mobisys), MobiSys ’18 Munich, Germany: ACM, 2018, pp. 81–95
- “AutoCast: Scalable Infrastructure-less Cooperative Perception for Distributed Collaborative Driving” In Proceedings of the 20th Annual International Conference on Mobile Systems, Applications, and Services, 2022
- “V2VNet: Vehicle-to-Vehicle Communication for Joint Perception and Prediction” In European Conference on Computer Vision Springer, 2020, pp. 605–621
- “OPV2V: An Open Benchmark Dataset and Fusion Pipeline for Perception with Vehicle-to-Vehicle Communication” In International Conference on Robotics and Automation (ICRA), 2022
- “CoBEVT: Cooperative Bird’s Eye View Semantic Segmentation with Sparse Transformers” In Conference on Robot Learning, 2022, pp. 989–1000
- “V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision Transformer” In European Conference on Computer Vision Springer, 2022, pp. 107–124
- “Collaborative Motion Prediction via Neural Motion Message Passing” In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020, pp. 6318–6327
- “Machine Learning-Based Vehicle Trajectory Prediction Using V2V Communications and On-Board Sensors” In Electronics 10, 2021, pp. 420
- Hui Guo, Lan-lan Rui and Zhi-peng Gao “V2V Task Offloading Algorithm with LSTM-based Spatiotemporal Trajectory Prediction Model in SVCNs” In IEEE Transactions on Vehicular Technology 71.10, 2022, pp. 11017–11032
- “Motion Transformer with Global Intention Localization and Local Movement Refinement” In Advances in Neural Information Processing Systems, 2022
- “MTR++: Multi-agent motion prediction with symmetric scene modeling and guided intention querying” In IEEE Transactions on Pattern Analysis and Machine Intelligence IEEE, 2024
- “EqDrive: Efficient Equivariant Motion Forecasting with Multi-Modality for Autonomous Driving” In arXiv preprint arXiv:2310.17540, 2023
- Zaydoun Yahya Rawashdeh and Zheng Wang “Collaborative Automated Driving: A Machine Learning-based Method to Enhance the Accuracy of Shared Information” In International Conference on Intelligent Transportation Systems (ITSC), 2018, pp. 3961–3966
- “Coopernaut: End-to-End Driving with Cooperative Perception for Networked Vehicles” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
- Donghao Qiao and Farhana H. Zulkernine “Adaptive Feature Fusion for Cooperative Perception using LiDAR Point Clouds” In IEEE/CVF Winter Conference on Applications of Computer Vision IEEE, 2023, pp. 1186–1195
- Hao Xiang, Runsheng Xu and Jiaqi Ma “HM-ViT: Hetero-modal Vehicle-to-Vehicle Cooperative Perception with Vision Transformer” In IEEE/CVF International Conference on Computer Vision IEEE, 2023, pp. 284–295
- “EvolveGraph: Multi-Agent Trajectory Prediction with Dynamic Relational Reasoning” In Advances in Neural Information Processing Systems, 2020
- “VectorNet: Encoding HD Maps and Agent Dynamics From Vectorized Representation” In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020, pp. 11522–11530
- “Dynamics-aware spatiotemporal occupancy prediction in urban environments” In 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2022, pp. 10836–10841 IEEE
- “Spatio-temporal graph dual-attention network for multi-agent prediction and tracking” In IEEE Transactions on Intelligent Transportation Systems 23.8 IEEE, 2021, pp. 10556–10569
- “MultiPath++: Efficient Information Fusion and Trajectory Aggregation for Behavior Prediction” In International Conference on Robotics and Automation (ICRA), 2022
- “Loki: Long term and key intentions for trajectory prediction” In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 9803–9812
- “Shared cross-modal trajectory prediction for autonomous driving” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 244–253
- “M2I: From Factored Marginal Trajectory Prediction to Interactive Prediction” In IEEE/CVF Conference on Computer Vision and Pattern Recognition IEEE, 2022, pp. 6533–6542
- Bernard Lange, Jiachen Li and Mykel J Kochenderfer “Scene Informer: Anchor-based Occlusion Inference and Trajectory Prediction in Partially Observable Environments” In International Conference on Robotics and Automation (ICRA), 2024
- “Disentangled Neural Relational Inference for Interpretable Motion Prediction” In IEEE Robotics and Automation Letters IEEE, 2023
- “Learning cooperative trajectory representations for motion forecasting” In arXiv preprint arXiv:2311.00371, 2023
- “Game Theory-Based Simultaneous Prediction and Planning for Autonomous Vehicle Navigation in Crowded Environments” In 2023 IEEE 26th International Conference on Intelligent Transportation Systems (ITSC), 2023, pp. 2977–2984 IEEE
- “Equivariant Map and Agent Geometry for Autonomous Driving Motion Prediction” In 2023 International Conference on Electrical, Computer and Energy Technologies (ICECET), 2023, pp. 1–6 IEEE
- Junru Gu, Chen Sun and Hang Zhao “DenseTNT: End-to-end Trajectory Prediction from Dense Goal Sets” In IEEE/CVF International Conference on Computer Vision IEEE, 2021, pp. 15283–15292
- “Scene Transformer: A unified architecture for predicting future trajectories of multiple agents” In International Conference on Learning Representations, 2022
- “3D Multi-Object Tracking: A Baseline and New Evaluation Metrics” In IEEE/RSJ International Conference on Intelligent Robots and Systems IEEE, 2020, pp. 10359–10366
- “PointPillars: Fast Encoders for Object Detection from Point Clouds” In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019
- “PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation” In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017
- “An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale” In International Conference on Learning Representations, 2020
- “Focal Loss for Dense Object Detection” In IEEE International Conference on Computer Vision (ICCV), 2017
- “Model-Agnostic Multi-Agent Perception Framework” In IEEE International Conference on Robotics and Automation IEEE, 2023, pp. 1471–1478
- “Decoupled Weight Decay Regularization” In International Conference on Learning Representations, 2018