Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

RBGNet: Ray-based Grouping for 3D Object Detection (2204.02251v1)

Published 5 Apr 2022 in cs.CV

Abstract: As a fundamental problem in computer vision, 3D object detection is experiencing rapid growth. To extract the point-wise features from the irregularly and sparsely distributed points, previous methods usually take a feature grouping module to aggregate the point features to an object candidate. However, these methods have not yet leveraged the surface geometry of foreground objects to enhance grouping and 3D box generation. In this paper, we propose the RBGNet framework, a voting-based 3D detector for accurate 3D object detection from point clouds. In order to learn better representations of object shape to enhance cluster features for predicting 3D boxes, we propose a ray-based feature grouping module, which aggregates the point-wise features on object surfaces using a group of determined rays uniformly emitted from cluster centers. Considering the fact that foreground points are more meaningful for box estimation, we design a novel foreground biased sampling strategy in downsample process to sample more points on object surfaces and further boost the detection performance. Our model achieves state-of-the-art 3D detection performance on ScanNet V2 and SUN RGB-D with remarkable performance gains. Code will be available at https://github.com/Haiyang-W/RBGNet.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Haiyang Wang (47 papers)
  2. Shaoshuai Shi (39 papers)
  3. Ze Yang (51 papers)
  4. Rongyao Fang (18 papers)
  5. Qi Qian (54 papers)
  6. Hongsheng Li (340 papers)
  7. Bernt Schiele (210 papers)
  8. Liwei Wang (239 papers)
Citations (52)

Summary

We haven't generated a summary for this paper yet.