Semantic Dense Reconstruction with Consistent Scene Segments (2109.14821v1)

Published 30 Sep 2021 in cs.CV and cs.RO

Abstract: In this paper, a method for dense semantic 3D scene reconstruction from an RGB-D sequence is proposed to solve high-level scene understanding tasks. First, each RGB-D pair is consistently segmented into 2D semantic maps based on a camera tracking backbone that propagates objects' labels with high probabilities from full scans to corresponding ones of partial views. Then a dense 3D mesh model of an unknown environment is incrementally generated from the input RGB-D sequence. Benefiting from 2D consistent semantic segments and the 3D model, a novel semantic projection block (SP-Block) is proposed to extract deep feature volumes from 2D segments of different views. Moreover, the semantic volumes are fused into deep volumes from a point cloud encoder to make the final semantic segmentation. Extensive experimental evaluations on public datasets show that our system achieves accurate 3D dense reconstruction and state-of-the-art semantic prediction performances simultaneously.

Authors (6)

Yingcai Wan (1 paper)
Yanyan Li (86 papers)
Yingxuan You (8 papers)
Cheng Guo (97 papers)
Lijin Fang (1 paper)
Federico Tombari (214 papers)

Citations (1)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Semantic Dense Reconstruction with Consistent Scene Segments (2109.14821v1)

Summary

Related Papers