Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

6D Pose Estimation with Correlation Fusion (1909.12936v2)

Published 24 Sep 2019 in cs.CV and cs.RO

Abstract: 6D object pose estimation is widely applied in robotic tasks such as grasping and manipulation. Prior methods using RGB-only images are vulnerable to heavy occlusion and poor illumination, so it is important to complement them with depth information. However, existing methods using RGB-D data cannot adequately exploit consistent and complementary information between RGB and depth modalities. In this paper, we present a novel method to effectively consider the correlation within and across both modalities with attention mechanism to learn discriminative and compact multi-modal features. Then, effective fusion strategies for intra- and inter-correlation modules are explored to ensure efficient information flow between RGB and depth. To our best knowledge, this is the first work to explore effective intra- and inter-modality fusion in 6D pose estimation. The experimental results show that our method can achieve the state-of-the-art performance on LineMOD and YCB-Video dataset. We also demonstrate that the proposed method can benefit a real-world robot grasping task by providing accurate object pose estimation.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Yi Cheng (78 papers)
  2. Hongyuan Zhu (36 papers)
  3. Ying Sun (155 papers)
  4. Cihan Acar (6 papers)
  5. Wei Jing (33 papers)
  6. Yan Wu (109 papers)
  7. Liyuan Li (11 papers)
  8. Cheston Tan (50 papers)
  9. Joo-Hwee Lim (10 papers)
Citations (15)

Summary

We haven't generated a summary for this paper yet.