Online Vectorized HD Map Construction using Geometry (2312.03341v2)

Published 6 Dec 2023 in cs.CV and cs.AI

Abstract: The construction of online vectorized High-Definition (HD) maps is critical for downstream prediction and planning. Recent efforts have built strong baselines for this task, however, shapes and relations of instances in urban road systems are still under-explored, such as parallelism, perpendicular, or rectangle-shape. In our work, we propose GeMap ($\textbf{Ge}$ometry $\textbf{Map}$), which end-to-end learns Euclidean shapes and relations of map instances beyond basic perception. Specifically, we design a geometric loss based on angle and distance clues, which is robust to rigid transformations. We also decouple self-attention to independently handle Euclidean shapes and relations. Our method achieves new state-of-the-art performance on the NuScenes and Argoverse 2 datasets. Remarkably, it reaches a 71.8% mAP on the large-scale Argoverse 2 dataset, outperforming MapTR V2 by +4.4% and surpassing the 70% mAP threshold for the first time. Code is available at https://github.com/cnzzx/GeMap.

References (47)

Citations (15)

View on Semantic Scholar

Summary

The paper introduces GeMap, a novel framework utilizing geometric loss functions and geometry-decoupled attention to enhance online HD map construction.
The approach employs Euclidean shape and relation clues, BEV feature extraction, and rotation-invariant representations to achieve superior mapping accuracy.
Experimental results show GeMap achieving 71.8% mAP on Argoverse 2, marking a 4.4% improvement over existing methods and setting a new benchmark for autonomous systems.

An In-depth Review of GeMap: A Framework for Online Vectorized HD Map Construction

The paper, "Online Vectorized HD Map Construction using Geometry," proposes a novel approach, GeMap, for the efficient construction of high-definition vectorized maps, a task critical for autonomous driving systems. The focus of this research is on deciphering and leveraging the inherent geometric properties present in urban road systems beyond basic perception techniques typically used in existing methodologies.

GeMap Framework

GeMap introduces an innovative framework that learns Euclidean shapes and spatial relations of map instances, thereby allowing for more effective map construction. The framework comprises several key components:

Geometric Loss: The creation of an advanced geometric loss that relies on angle and distance clues to provide robustness against rigid transformations is central to this work. This loss function, known as Euclidean Loss, incorporates two main elements:
- Euclidean Shape Clues, which focus on the shape of each map instance.
- Euclidean Relation Clues, which capture the relational properties between multiple map instances.
Geometry-Decoupled Attention (GDA): Another significant innovation is the adapted attention mechanism, GDA, which decouples attention to independently handle the Euclidean shape and relations between map instances. This separation facilitates an effective learning procedure crucial to understanding complex geometries.
BEV Representation and G-Representation: The usage of Bird's-Eye-View (BEV) as an initial feature extractor from multi-view images lays the groundwork for the model. Complementing this, a robust translation- and rotation-invariant representation termed G-Representation allows for effective leveraging of instance geometry.

Implementation and Results

The GeMap technique has been rigorously tested on datasets like NuScenes and Argoverse 2, where it has demonstrated new state-of-the-art performance. Specifically, GeMap achieved a 71.8% mAP on the Argoverse 2 dataset, surpassing existing methods such as MapTR V2 by 4.4%. This improvement is notably significant as it crosses the 70% mAP threshold for the first time. The approach is not particularly taxing on computational resources, maintaining competitive FPS during inference.

Implications and Future Directions

The findings underscore the potential geometric properties have in enhancing the robustness and accuracy of HD map construction. GeMap's clear delineation of shapes and relations provides a structured pathway for learning model networks to interpret complex mapping environments, thereby facilitating a more reliable autonomous driving system.

The theoretical implications of this research rest on understanding the depth of integration of geometric properties into neural network-based systems. Practically, approaches like GeMap may become integral components in real-time applications where accuracy and adaptability to coordinate transformations are paramount.

Future developments could explore further sophistication in geometric representations or more intricate geometric patterns, potentially extending the benefits observed here to broader aspects of navigation and spatial awareness tasks in AI-driven systems. Moreover, exploring the operational scope of geometric features in tackling partial occlusions or adverse conditions in autonomous settings is another promising avenue for subsequent research.

Overall, GeMap provides a robust framework addressing key challenges in online vectorized HD map construction, enhancing both the theoretical and practical paradigms within this evolving field of autonomous technology. The work establishes a firm basis for subsequent investigations aimed at refining and extending the described methodologies.

Related Papers

GitHub

Online Vectorized HD Map Construction using Geometry (53 stars)

Tweets

https://twitter.com/_vztu/status/1811455922885271949

YouTube

Show All Videos