Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving (2406.07037v1)

Published 11 Jun 2024 in cs.CV

Abstract: Vision-centric occupancy networks, which represent the surrounding environment with uniform voxels with semantics, have become a new trend for safe driving of camera-only autonomous driving perception systems, as they are able to detect obstacles regardless of their shape and occlusion. Modern occupancy networks mainly focus on reconstructing visible voxels from object surfaces with voxel-wise semantic prediction. Usually, they suffer from inconsistent predictions of one object and mixed predictions for adjacent objects. These confusions may harm the safety of downstream planning modules. To this end, we investigate panoptic segmentation on 3D voxel scenarios and propose an instance-aware occupancy network, PanoSSC. We predict foreground objects and backgrounds separately and merge both in post-processing. For foreground instance grouping, we propose a novel 3D instance mask decoder that can efficiently extract individual objects. we unify geometric reconstruction, 3D semantic segmentation, and 3D instance segmentation into PanoSSC framework and propose new metrics for evaluating panoptic voxels. Extensive experiments show that our method achieves competitive results on SemanticKITTI semantic scene completion benchmark.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Yining Shi (21 papers)
  2. Jiusi Li (4 papers)
  3. Kun Jiang (128 papers)
  4. Ke Wang (531 papers)
  5. Yunlong Wang (91 papers)
  6. Mengmeng Yang (35 papers)
  7. Diange Yang (37 papers)
Citations (4)

Summary

We haven't generated a summary for this paper yet.