Lane Graph as Path: Continuity-preserving Path-wise Modeling for Online Lane Graph Construction (2303.08815v3)
Abstract: Online lane graph construction is a promising but challenging task in autonomous driving. Previous methods usually model the lane graph at the pixel or piece level, and recover the lane graph by pixel-wise or piece-wise connection, which breaks down the continuity of the lane and results in suboptimal performance. Human drivers focus on and drive along the continuous and complete paths instead of considering lane pieces. Autonomous vehicles also require path-specific guidance from lane graph for trajectory planning. We argue that the path, which indicates the traffic flow, is the primitive of the lane graph. Motivated by this, we propose to model the lane graph in a novel path-wise manner, which well preserves the continuity of the lane and encodes traffic information for planning. We present a path-based online lane graph construction method, termed LaneGAP, which end-to-end learns the path and recovers the lane graph via a Path2Graph algorithm. We qualitatively and quantitatively demonstrate the superior accuracy and efficiency of LaneGAP over conventional pixel-based and piece-based methods on the challenging nuScenes and Argoverse2 datasets under controllable and fair conditions. Compared to the recent state-of-the-art piece-wise method TopoNet on the OpenLane-V2 dataset, LaneGAP still outperforms by 1.6 mIoU, further validating the effectiveness of path-wise modeling. Abundant visualizations in the supplementary material show LaneGAP can cope with diverse traffic conditions. Code is released at \url{https://github.com/hustvl/LaneGAP}.
- Roadtracer: Automatic extraction of road networks from aerial images. In CVPR, 2018.
- Improved road connectivity by joint learning of orientation and segmentation. In CVPR, 2019.
- Learning and aggregating lane graphs for urban automated driving. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 13415–13424, 2023.
- Fully convolutional network for automatic road extraction from satellite imagery. In CVPR, 2018.
- nuscenes: A multimodal dataset for autonomous driving. In CVPR, 2020.
- Structured bird’s-eye-view traffic scene understanding from onboard images. In ICCV, 2021.
- Topology preserving local road network estimation from single onboard camera image. In CVPR, 2022.
- End-to-end object detection with transformers. In ECCV, 2020.
- Persformer: 3d lane detection via perspective transformer and the openlane benchmark. In ECCV, 2022.
- Efficient and robust 2d-to-bev representation learning via geometry-guided kernel transformer. arXiv preprint arXiv:2206.04584, 2022.
- Neural turtle graphics for modeling city road layouts. In ICCV, 2019.
- Pivotnet: Vectorized pivot learning for end-to-end hd map construction. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 3672–3682, 2023.
- You only look at one sequence: Rethinking transformer in vision through object detection. NeurIPS, 2021.
- Rethinking efficient lane detection via curve modeling. In CVPR, 2022.
- 3d-lanenet: end-to-end 3d multiple lane detection. In ICCV, 2019.
- Gen-lanenet: A generalized and scalable approach for 3d lane detection. In ECCV, 2020.
- Deep residual learning for image recognition. In CVPR, 2016.
- Lane-level street map extraction from aerial imagery. In WACV, 2022.
- Sat2graph: Road graph extraction through graph-tensor encoding. In ECCV, 2020.
- Hdmapnet: An online hd map construction and evaluation framework. In ICRA, 2022.
- Bevformer: Learning bird’s-eye-view representation from multi-camera images via spatiotemporal transformers. In ECCV, 2022.
- Topological map extraction from overhead images. In ICCV, 2019.
- MapTR: Structured modeling and learning for online vectorized HD map construction. In ICLR, 2023.
- Focal loss for dense object detection. In ICCV, 2017.
- Learning to predict 3d lane shape and camera pose from a single image via geometry constraints. In AAAI, 2022.
- End-to-end lane shape prediction with transformers. In WACV, 2021.
- Vectormapnet: End-to-end vectorized hd map learning. arXiv preprint arXiv:2206.08920, 2022.
- Petrv2: A unified framework for 3d perception from multi-camera images. arXiv preprint arXiv:2206.01256, 2022.
- Vision-based uneven bev representation learning with polar rasterization and surface estimation. arXiv preprint arXiv:2207.01878, 2022.
- Bevfusion: Multi-task multi-sensor fusion with unified bird’s-eye view representation. arXiv preprint arXiv:2205.13542, 2022.
- Learning ego 3d representation as ray tracing. In ECCV, 2022.
- Vision-centric bev perception: A survey. arXiv preprint arXiv:2208.02797, 2022.
- Deeproadmapper: Extracting road topology from aerial images. In ICCV, 2017.
- Hdmapgen: A hierarchical graph generative model of high definition maps. In CVPR, 2021.
- End-to-end vectorized hd-map construction with piecewise bezier curve. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 13218–13228, 2023.
- U-net: Convolutional networks for biomedical image segmentation. In MICCAI, 2015.
- Instagram: Instance-level graph modeling for vectorized hd map learning. arXiv preprint arXiv:2301.04470, 2023.
- Keep your eyes on the lane: Real-time attention-guided lane detection. In CVPR, 2021.
- Vecroad: Point-based iterative graph exploration for road graphs extraction. In CVPR, 2020.
- A keypoint-based global association network for lane detection. In CVPR, 2022.
- Argoverse 2: Next generation datasets for self-driving perception and forecasting. arXiv preprint arXiv:2301.00493, 2023.
- Centerlinedet: Road lane centerline graph detection with vehicle-mounted sensors by transformer for high-definition map creation. arXiv preprint arXiv:2209.07734, 2022.
- icurb: Imitation learning-based detection of road curbs using aerial images for autonomous driving. IEEE Robotics and Automation Letters, 6(2):1097–1104, 2021.
- Second: Sparsely embedded convolutional detection. Sensors, 18(10):3337, 2018.
- Cross-view transformers for real-time map-view semantic segmentation. In CVPR, 2022.
- D-linknet: Linknet with pretrained encoder and dilated convolution for high resolution satellite imagery road extraction. In CVPRW, 2018.
- Deformable DETR: deformable transformers for end-to-end object detection. In ICLR, 2021.