Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Occ3D: A Large-Scale 3D Occupancy Prediction Benchmark for Autonomous Driving (2304.14365v3)

Published 27 Apr 2023 in cs.CV

Abstract: Robotic perception requires the modeling of both 3D geometry and semantics. Existing methods typically focus on estimating 3D bounding boxes, neglecting finer geometric details and struggling to handle general, out-of-vocabulary objects. 3D occupancy prediction, which estimates the detailed occupancy states and semantics of a scene, is an emerging task to overcome these limitations. To support 3D occupancy prediction, we develop a label generation pipeline that produces dense, visibility-aware labels for any given scene. This pipeline comprises three stages: voxel densification, occlusion reasoning, and image-guided voxel refinement. We establish two benchmarks, derived from the Waymo Open Dataset and the nuScenes Dataset, namely Occ3D-Waymo and Occ3D-nuScenes benchmarks. Furthermore, we provide an extensive analysis of the proposed dataset with various baseline models. Lastly, we propose a new model, dubbed Coarse-to-Fine Occupancy (CTF-Occ) network, which demonstrates superior performance on the Occ3D benchmarks. The code, data, and benchmarks are released at https://tsinghua-mars-lab.github.io/Occ3D/.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Xiaoyu Tian (16 papers)
  2. Tao Jiang (274 papers)
  3. Longfei Yun (6 papers)
  4. Yucheng Mao (6 papers)
  5. Huitong Yang (7 papers)
  6. Yue Wang (676 papers)
  7. Yilun Wang (39 papers)
  8. Hang Zhao (156 papers)
Citations (150)

Summary

We haven't generated a summary for this paper yet.