Producing and Leveraging Online Map Uncertainty in Trajectory Prediction (2403.16439v1)
Abstract: High-definition (HD) maps have played an integral role in the development of modern autonomous vehicle (AV) stacks, albeit with high associated labeling and maintenance costs. As a result, many recent works have proposed methods for estimating HD maps online from sensor data, enabling AVs to operate outside of previously-mapped regions. However, current online map estimation approaches are developed in isolation of their downstream tasks, complicating their integration in AV stacks. In particular, they do not produce uncertainty or confidence estimates. In this work, we extend multiple state-of-the-art online map estimation methods to additionally estimate uncertainty and show how this enables more tightly integrating online mapping with trajectory forecasting. In doing so, we find that incorporating uncertainty yields up to 50% faster training convergence and up to 15% better prediction performance on the real-world nuScenes driving dataset.
- nuScenes: A multimodal dataset for autonomous driving. In IEEE Conf. on Computer Vision and Pattern Recognition, 2020.
- Structured bird’s-eye-view traffic scene understanding from onboard images. In IEEE Int. Conf. on Computer Vision, 2021.
- Argoverse: 3d tracking and forecasting with rich maps. In IEEE Conf. on Computer Vision and Pattern Recognition, 2019.
- Parting with misconceptions about learning-based vehicle motion planning. In Conf. on Robot Learning, 2023.
- Multimodal trajectory prediction conditioned on lane-graph traversals. In Conf. on Robot Learning, 2021.
- PivotNet: Vectorized pivot learning for end-to-end HD map construction. In IEEE Int. Conf. on Computer Vision, 2023.
- SuperFusion: Multilevel LiDAR-camera fusion for long-range HD map generation. arXiv preprint arXiv:2211.15656, 2022.
- Large scale interactive motion forecasting for autonomous driving: The waymo open motion dataset. In IEEE Int. Conf. on Computer Vision, 2021.
- VectorNet: Encoding HD maps and agent dynamics from vectorized representation. In IEEE Conf. on Computer Vision and Pattern Recognition, 2020.
- HOME: Heatmap output for future motion estimation. In Proc. IEEE Int. Conf. on Intelligent Transportation Systems, 2021.
- GOHOME: Graph-oriented heatmap output for future motion estimation. In Proc. IEEE Conf. on Robotics and Automation, 2022a.
- THOMAS: Trajectory heatmap output with learned multi-agent sampling. In Int. Conf. on Learning Representations, 2022b.
- DenseTNT: End-to-end trajectory prediction from dense goal sets. In IEEE Int. Conf. on Computer Vision, 2021.
- Planning-oriented autonomous driving. In IEEE Conf. on Computer Vision and Pattern Recognition, 2023.
- BEVPoolv2: A cutting-edge implementation of BEVDet toward deployment. arXiv preprint arXiv:2211.17111, 2022.
- Expanding the deployment envelope of behavior prediction via adaptive meta-learning. In Proc. IEEE Conf. on Robotics and Automation, 2023a.
- trajdata: A unified interface to multiple human trajectory datasets. In Conf. on Neural Information Processing Systems Datasets and Benchmarks Track, New Orleans, USA, 2023b.
- VAD: Vectorized scene representation for efficient autonomous driving. In IEEE Int. Conf. on Computer Vision, 2023.
- HDMapNet: An online HD map construction and evaluation framework. In Proc. IEEE Conf. on Robotics and Automation, 2022a.
- BEVFormer: Learning bird’s-eye-view representation from multi-camera images via spatiotemporal transformers. In European Conf. on Computer Vision, 2022b.
- Learning lane graph representations for motion forecasting. In European Conf. on Computer Vision, 2020.
- MapTR: Structured modeling and learning for online vectorized HD map construction. In Int. Conf. on Learning Representations, 2023a.
- MapTRv2: An end-to-end framework for online vectorized HD map construction. arXiv preprint arXiv:2308.05736, 2023b.
- Multimodal motion prediction with stacked transformers. In IEEE Conf. on Computer Vision and Pattern Recognition, 2021.
- VectorMapNet: End-to-end vectorized HD map learning. In Int. Conf. on Machine Learning. PMLR, 2023a.
- BEVFusion: Multi-task multi-sensor fusion with unified bird’s-eye view representation. In Proc. IEEE Conf. on Robotics and Automation, 2023b.
- CoverNet: Multimodal behavior prediction using trajectory sets. In IEEE Conf. on Computer Vision and Pattern Recognition, 2020.
- Lift, Splat, Shoot: Encoding images from arbitrary camera rigs by implicitly unprojecting to 3D. In European Conf. on Computer Vision, 2020.
- End-to-end vectorized HD-map construction with piecewise bezier curve. In IEEE Conf. on Computer Vision and Pattern Recognition, 2023.
- Human motion trajectory prediction: A survey. Int. Journal of Robotics Research, 39(8):895–935, 2020.
- Trajectron++: Dynamically-feasible trajectory forecasting with heterogeneous data. In European Conf. on Computer Vision, 2020.
- InstaGraM: Instance-level graph modeling for vectorized HD map learning. arXiv preprint arXiv:2301.04470, 2023.
- Scene as occupancy. In IEEE Int. Conf. on Computer Vision, 2023.
- Attention is all you need. In Conf. on Neural Information Processing Systems, 2017.
- Waymo. Safety report, 2021. Available at https://waymo.com/safety/safety-report.
- Argoverse 2: Next generation datasets for self-driving perception and forecasting. In Conf. on Neural Information Processing Systems Datasets and Benchmarks Track, 2021.
- InsightMapper: A closer look at inner-instance information for vectorized high-definition mapping. arXiv preprint arXiv:2308.08543, 2023.
- StreamMapNet: Streaming mapping network for vectorized online HD map construction. In IEEE Winter Conf. on Applications of Computer Vision, 2024.
- AgentFormer: Agent-aware transformers for socio-temporal multi-agent forecasting. In IEEE Int. Conf. on Computer Vision, pages 9813–9823, 2021.
- TNT: Target-driveN Trajectory Prediction. In Conf. on Robot Learning, 2020.
- HiVT: Hierarchical vector transformer for multi-agent motion prediction. In IEEE Conf. on Computer Vision and Pattern Recognition, 2022.