Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

HouseCat6D -- A Large-Scale Multi-Modal Category Level 6D Object Perception Dataset with Household Objects in Realistic Scenarios (2212.10428v5)

Published 20 Dec 2022 in cs.CV

Abstract: Estimating 6D object poses is a major challenge in 3D computer vision. Building on successful instance-level approaches, research is shifting towards category-level pose estimation for practical applications. Current category-level datasets, however, fall short in annotation quality and pose variety. Addressing this, we introduce HouseCat6D, a new category-level 6D pose dataset. It features 1) multi-modality with Polarimetric RGB and Depth (RGBD+P), 2) encompasses 194 diverse objects across 10 household categories, including two photometrically challenging ones, and 3) provides high-quality pose annotations with an error range of only 1.35 mm to 1.74 mm. The dataset also includes 4) 41 large-scale scenes with comprehensive viewpoint and occlusion coverage, 5) a checkerboard-free environment, and 6) dense 6D parallel-jaw robotic grasp annotations. Additionally, we present benchmark results for leading category-level pose estimation networks.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (13)
  1. HyunJun Jung (14 papers)
  2. Guangyao Zhai (26 papers)
  3. Shun-Cheng Wu (11 papers)
  4. Patrick Ruhkamp (12 papers)
  5. Hannah Schieber (10 papers)
  6. Giulia Rizzoli (8 papers)
  7. Pengyuan Wang (15 papers)
  8. Hongcheng Zhao (2 papers)
  9. Lorenzo Garattoni (14 papers)
  10. Sven Meier (3 papers)
  11. Daniel Roth (13 papers)
  12. Nassir Navab (459 papers)
  13. Benjamin Busam (82 papers)
Citations (10)

Summary

We haven't generated a summary for this paper yet.