Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

PhoCaL: A Multi-Modal Dataset for Category-Level Object Pose Estimation with Photometrically Challenging Objects (2205.08811v1)

Published 18 May 2022 in cs.CV

Abstract: Object pose estimation is crucial for robotic applications and augmented reality. Beyond instance level 6D object pose estimation methods, estimating category-level pose and shape has become a promising trend. As such, a new research field needs to be supported by well-designed datasets. To provide a benchmark with high-quality ground truth annotations to the community, we introduce a multimodal dataset for category-level object pose estimation with photometrically challenging objects termed PhoCaL. PhoCaL comprises 60 high quality 3D models of household objects over 8 categories including highly reflective, transparent and symmetric objects. We developed a novel robot-supported multi-modal (RGB, depth, polarisation) data acquisition and annotation process. It ensures sub-millimeter accuracy of the pose for opaque textured, shiny and transparent objects, no motion blur and perfect camera synchronisation. To set a benchmark for our dataset, state-of-the-art RGB-D and monocular RGB methods are evaluated on the challenging scenes of PhoCaL.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Pengyuan Wang (15 papers)
  2. HyunJun Jung (14 papers)
  3. Yitong Li (95 papers)
  4. Siyuan Shen (22 papers)
  5. Rahul Parthasarathy Srikanth (1 paper)
  6. Lorenzo Garattoni (14 papers)
  7. Sven Meier (3 papers)
  8. Nassir Navab (459 papers)
  9. Benjamin Busam (82 papers)
Citations (38)

Summary

We haven't generated a summary for this paper yet.