Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

HS-Pose: Hybrid Scope Feature Extraction for Category-level Object Pose Estimation (2303.15743v1)

Published 28 Mar 2023 in cs.CV

Abstract: In this paper, we focus on the problem of category-level object pose estimation, which is challenging due to the large intra-category shape variation. 3D graph convolution (3D-GC) based methods have been widely used to extract local geometric features, but they have limitations for complex shaped objects and are sensitive to noise. Moreover, the scale and translation invariant properties of 3D-GC restrict the perception of an object's size and translation information. In this paper, we propose a simple network structure, the HS-layer, which extends 3D-GC to extract hybrid scope latent features from point cloud data for category-level object pose estimation tasks. The proposed HS-layer: 1) is able to perceive local-global geometric structure and global information, 2) is robust to noise, and 3) can encode size and translation information. Our experiments show that the simple replacement of the 3D-GC layer with the proposed HS-layer on the baseline method (GPV-Pose) achieves a significant improvement, with the performance increased by 14.5% on 5d2cm metric and 10.3% on IoU75. Our method outperforms the state-of-the-art methods by a large margin (8.3% on 5d2cm, 6.9% on IoU75) on the REAL275 dataset and runs in real-time (50 FPS).

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Linfang Zheng (5 papers)
  2. Chen Wang (600 papers)
  3. Yinghan Sun (4 papers)
  4. Esha Dasgupta (1 paper)
  5. Hua Chen (138 papers)
  6. Ales Leonardis (84 papers)
  7. Wei Zhang (1492 papers)
  8. Hyung Jin Chang (47 papers)
Citations (31)

Summary

We haven't generated a summary for this paper yet.