Toronto-3D: A Large-scale Mobile LiDAR Dataset for Semantic Segmentation of Urban Roadways (2003.08284v3)

Published 18 Mar 2020 in cs.CV

Abstract: Semantic segmentation of large-scale outdoor point clouds is essential for urban scene understanding in various applications, especially autonomous driving and urban high-definition (HD) mapping. With rapid developments of mobile laser scanning (MLS) systems, massive point clouds are available for scene understanding, but publicly accessible large-scale labeled datasets, which are essential for developing learning-based methods, are still limited. This paper introduces Toronto-3D, a large-scale urban outdoor point cloud dataset acquired by a MLS system in Toronto, Canada for semantic segmentation. This dataset covers approximately 1 km of point clouds and consists of about 78.3 million points with 8 labeled object classes. Baseline experiments for semantic segmentation were conducted and the results confirmed the capability of this dataset to train deep learning models effectively. Toronto-3D is released to encourage new research, and the labels will be improved and updated with feedback from the research community.

Authors (8)

Weikai Tan (3 papers)
Nannan Qin (2 papers)
Lingfei Ma (7 papers)
Ying Li (432 papers)
Jing Du (29 papers)
Guorong Cai (5 papers)
Ke Yang (152 papers)
Jonathan Li (62 papers)

Citations (172)

View on Semantic Scholar

Summary

Toronto-3D Dataset for Semantic Segmentation

The paper introduces Toronto-3D, a substantial urban point cloud dataset intended to advance semantic segmentation tasks, especially those related to autonomous driving and urban HD mapping. This dataset comprises roughly 1 km of point clouds representing urban street environments in Toronto, Canada, collected using a Mobile Laser Scanning (MLS) system. The dataset includes around 78.3 million points categorized into 8 labeled object classes, such as roads, natural elements, buildings, cars, and more.

Existing Point Cloud Datasets

Point cloud data is becoming increasingly accessible with advancements in LiDAR technology and the growing interest in applications requiring 3D vision capabilities. However, existing datasets often present challenges, including their large volumes and associated noise, which complicates the manual labeling process. The paper provides a comparative overview of popular datasets like Oakland 3-D, iQmulus, Semantic3D, Paris-Lille-3D, and SemanticKITTI. Each of these datasets possesses unique features and limitations, notably their scope, point density, and label granularity. Toronto-3D aims to complement these datasets by offering a high-quality resource with specific focus on new object classes and maintaining a realistic range of point density variations.

Toronto-3D Dataset Characteristics

Key characteristics of the Toronto-3D dataset include full coverage of the LiDAR measurement range up to approximately 100 meters and varying point densities caused by repeated scans during data collection. Moreover, the dataset uniquely includes certain challenging classes like road markings and utility lines, which are underrepresented in existing datasets. These features are poised to test the robustness of semantic segmentation models more comprehensively.

Baseline Evaluation

The paper evaluates several state-of-the-art point-based deep learning models on the Toronto-3D dataset, including PointNet++, DGCNN, KPFCNN, MS-PCNN, TGNet, and a newly proposed MS-TGNet. While KPFCNN achieved the highest overall accuracy and performed well across several object classes, the proposed MS-TGNet slightly outperformed it in mean Intersection over Union (mIoU) by effectively capturing road marking and natural surface categories. Despite the baseline strengths, there remain challenges with road marking and fence classes where improvements can be pursued.

Implications and Future Directions

The introduction of Toronto-3D as a comprehensive point cloud dataset offers an opportunity for the research community to develop more innovative deep learning models for semantic segmentation. Given its incorporation of underrepresented object classes and realistic data collection scenarios, future work can explore algorithms capable of efficiently processing large-scale point clouds with variable densities such as RandLA-Net. Furthermore, the dataset is expected to evolve with contributions from the research community, leading to enhanced labeling accuracy and expanded applications across smart city infrastructures and autonomous navigation systems.

Toronto-3D sets the stage for advancing point cloud semantics within urban environments, emphasizing the need for robust classification methods capable of generating high-quality annotations across diverse object classes.

PDF Markdown

Related Papers

Find Related Papers