Collaborative Dynamic 3D Scene Graphs for Automated Driving (2309.06635v3)

Published 12 Sep 2023 in cs.RO

Abstract: Maps have played an indispensable role in enabling safe and automated driving. Although there have been many advances on different fronts ranging from SLAM to semantics, building an actionable hierarchical semantic representation of urban dynamic scenes and processing information from multiple agents are still challenging problems. In this work, we present Collaborative URBan Scene Graphs (CURB-SG) that enable higher-order reasoning and efficient querying for many functions of automated driving. CURB-SG leverages panoptic LiDAR data from multiple agents to build large-scale maps using an effective graph-based collaborative SLAM approach that detects inter-agent loop closures. To semantically decompose the obtained 3D map, we build a lane graph from the paths of ego agents and their panoptic observations of other vehicles. Based on the connectivity of the lane graph, we segregate the environment into intersecting and non-intersecting road areas. Subsequently, we construct a multi-layered scene graph that includes lane information, the position of static landmarks and their assignment to certain map sections, other vehicles observed by the ego agents, and the pose graph from SLAM including 3D panoptic point clouds. We extensively evaluate CURB-SG in urban scenarios using a photorealistic simulator. We release our code at http://curb.cs.uni-freiburg.de.

References (43)

Citations (17)

View on Semantic Scholar

Summary

The paper proposes a centralized collaborative SLAM framework that aggregates multi-agent LiDAR data and optimizes global pose graphs.
It constructs a hierarchical, semantic-rich 3D scene graph that segments and categorizes both static and dynamic urban elements.
Empirical results in CARLA simulations demonstrate reduced localization errors and enhanced mapping precision through multi-agent cooperation.

An Analytical Overview of Collaborative Dynamic 3D Scene Graphs for Automated Driving

The paper presents an advanced methodology for constructing Collaborative Dynamic 3D Scene Graphs (CURB-SG) specifically tailored for automated driving applications. This work situates itself at the intersection of simultaneous localization and mapping (SLAM), high-definition (HD) semantic mapping, and automated driving (AD), addressing the inherent challenges in representing dynamic urban environments. The authors propose a centralized collaborative SLAM approach that leverages multi-agent LiDAR data, enriched with semantic segmentation, to construct and maintain an evolving hierarchical scene graph.

Technical Contributions and Methodological Innovations

Collaborative SLAM Framework: At the core of this research is a centralized SLAM approach that efficiently aggregates observations from multiple agents. Agents independently process LiDAR data to estimate odometry and detect static and dynamic elements, transmitting keyframe packages to a central server. The server, in turn, detects intra- and inter-agent loop closures to optimize a global pose graph. This method enables fast and frequent map updates over large-scale environments by dynamically restructuring the pose graph using edge contraction techniques.
Semantic Scene Graph Construction: CURB-SG introduces a multi-layered scene graph structure. Using the trajectories of ego agents and their observations, a lane graph is constructed, facilitating the segmentation of the environment into intersecting and non-intersecting road areas. The hierarchical scene graph further deciphers these areas into categories per the presence of static and dynamic entities, significantly improving spatial and semantic querying.
Panoptic Data Integration: The approach predicates on integrating panoptic segmentation data to enhance the semantic representation within the SLAM framework. The allocation of distinct voxel resolutions for different semantic objects allows for higher granularity where necessary, improving the precision of map localization and robustness to sensor noise.
Interconnection of SLAM and Graph-based Representations: The tightly coupled integration of scene graphs with SLAM-derived pose graphs is a distinctive feature of this work. It combines the strengths of metric and semantic mapping with topological abstractions, seamlessly unifying spatial data for subsequent autonomous driving applications like perception, planning, and control.

Quantitative Insights and Empirical Analysis

The experimental evaluation conducted using the CARLA simulator demonstrates that CURB-SG accurately constructs scene graphs over multiple urban scenarios. Results indicate a significant reduction in localization errors, with increased agent collaboration leading to improved mapping accuracy and exploration efficiency. Notably, localization and mapping errors decrease when more agents participate, supporting the system's efficacy in cooperative settings.

The constructed lane graphs, evaluated via metrics such as TOPO, GEO, and APLS, show enhancements in recall and graph IoU with the incorporation of vehicle observations, affirming the utility of integrating observed dynamic entities into the mapping process. The partitioning of environments into meaningful urban structures, verified against ground-truth data, showcases high precision and recall, underlying the viability of the proposed method for real-world applications.

Implications and Future Directions

The CURB-SG framework offers theoretical and practical implications for the field of automated driving. By leveraging collaborative and hierarchical representations, it bridges the gap between rich environmental semantics and real-time mapping demands. The framework's potential to integrate diverse data sources supports future scalability and adaptability to complex urban landscapes.

Looking forward, further developments could address decentralized implementations to improve real-time performance under resource constraints. Extending the framework to include pedestrian dynamics and additional topological features like road boundaries could provide a more comprehensive urban schema. The transition from simulation to real-world deployment remains a compelling challenge, necessitating robust handling of variable data fidelity and real-world uncertainties.

In summary, the CURB-SG approach presents a significant advancement in managing semantic-rich urban environments for automated driving. Its innovative blend of collaborative SLAM and semantic scene graph construction offers a robust platform for realizing high-level autonomous vehicle functionalities.