Automatic Labeling to Generate Training Data for Online LiDAR-based Moving Object Segmentation (2201.04501v1)

Published 12 Jan 2022 in cs.RO

Abstract: Understanding the scene is key for autonomously navigating vehicles and the ability to segment the surroundings online into moving and non-moving objects is a central ingredient for this task. Often, deep learning-based methods are used to perform moving object segmentation (MOS). The performance of these networks, however, strongly depends on the diversity and amount of labeled training data, information that may be costly to obtain. In this paper, we propose an automatic data labeling pipeline for 3D LiDAR data to save the extensive manual labeling effort and to improve the performance of existing learning-based MOS systems by automatically generating labeled training data. Our proposed approach achieves this by processing the data offline in batches. It first exploits an occupancy-based dynamic object removal to detect possible dynamic objects coarsely. Second, it extracts segments among the proposals and tracks them using a Kalman filter. Based on the tracked trajectories, it labels the actually moving objects such as driving cars and pedestrians as moving. In contrast, the non-moving objects, e.g., parked cars, lamps, roads, or buildings, are labeled as static. We show that this approach allows us to label LiDAR data highly effectively and compare our results to those of other label generation methods. We also train a deep neural network with our auto-generated labels and achieve similar performance compared to the one trained with manual labels on the same data, and an even better performance when using additional datasets with labels generated by our approach. Furthermore, we evaluate our method on multiple datasets using different sensors and our experiments indicate that our method can generate labels in diverse environments.

Authors (7)

Xieyuanli Chen (76 papers)
Benedikt Mersch (13 papers)
Lucas Nunes (6 papers)
Rodrigo Marcuzzi (4 papers)
Ignacio Vizzo (8 papers)
Jens Behley (50 papers)
Cyrill Stachniss (98 papers)

Citations (65)

View on Semantic Scholar

Summary

Automatic Labeling to Generate Training Data for Online LiDAR-based Moving Object Segmentation

The presented paper discusses an innovative approach for generating labeled training data for moving object segmentation (MOS) in 3D LiDAR data, capitalizing on automatic labeling techniques to mitigate the prohibitive costs associated with manual data labeling. The core proposition is a pipeline for automatic data labeling, which enhances the performance of deep learning-based MOS models by relying on the intrinsic dynamics of LiDAR data, processed offline to build a comprehensive labeled dataset.

Methodology Overview

The paper elucidates a two-step procedure in the proposed pipeline. Initially, a coarse detection of dynamic objects uses occupancy-based dynamic object removal strategies, generating initial hypotheses for dynamically moving objects in the dataset. Subsequently, these hypotheses are refined through segmentation processes facilitated by a Kalman filter, which tracks potential moving object segments over time, distinguishing moving objects—such as vehicles and pedestrians—from static elements like buildings or stationary automobiles. The culmination of this process results in a labeled dataset, delineating moving and non-moving objects, leveraging a modular approach that integrates occupancy estimation techniques, clustering, and multi-object tracking.

Results and Analytical Insights

The application of this automated labeling pipeline to LiDAR data has demonstrated significant efficacy. Training a deep neural network with automatically generated labels yielded results comparable to networks trained on manually annotated datasets, thereby attesting to the robustness and accuracy of the proposed labeling technique. Moreover, the inclusion of additional datasets, labeled via the automatic pipeline, augmented network performance beyond baseline metrics. The versatility of the approach is underscored by its compatibility with various LiDAR datasets and sensors, illustrating its applicability to diverse environmental conditions and sensor setups.

Implications and Future Directions

This research holds substantial implications for the field of autonomous navigation, particularly in regard to the scalability of training data generation for learning-based MOS. The amalgamation of offline processing with real-time application potential highlights a pivotal step towards cost-effective data labeling methodologies. The automated pipeline not only addresses data scarcity but also offers a sustainable pathway to improve network robustness across varying operational environments.

Further avenues for exploration include refining the segmentation and tracking algorithms to enhance precision in dynamic object identification and extending the automated labeling techniques to more data types beyond LiDAR, potentially integrating vision-based sensors for richer environmental context. Moreover, investigating the integration of semantic scene understanding could amplify object-specific recognition and labeling accuracy, fostering advancements in training methodologies for complex urban environments.

By advancing the state of semi-supervised machine learning, this research delineates promising future trajectories in AI, particularly within autonomous driving and intelligent systems, where rapid, accurate training data generation remains a cornerstone for system efficacy and deployment.

PDF Markdown

Related Papers

Find Related Papers

YouTube

Show All Videos