Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Towards Self-Supervised Category-Level Object Pose and Size Estimation (2203.02884v2)

Published 6 Mar 2022 in cs.CV and cs.RO

Abstract: In this work, we tackle the challenging problem of category-level object pose and size estimation from a single depth image. Although previous fully-supervised works have demonstrated promising performance, collecting ground-truth pose labels is generally time-consuming and labor-intensive. Instead, we propose a label-free method that learns to enforce the geometric consistency between category template mesh and observed object point cloud under a self-supervision manner. Specifically, our method consists of three key components: differentiable shape deformation, registration, and rendering. In particular, shape deformation and registration are applied to the template mesh to eliminate the differences in shape, pose and scale. A differentiable renderer is then deployed to enforce geometric consistency between point clouds lifted from the rendered depth and the observed scene for self-supervision. We evaluate our approach on real-world datasets and find that our approach outperforms the simple traditional baseline by large margins while being competitive with some fully-supervised approaches.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Yisheng He (14 papers)
  2. Haoqiang Fan (55 papers)
  3. Haibin Huang (60 papers)
  4. Qifeng Chen (187 papers)
  5. Jian Sun (415 papers)
Citations (15)

Summary

We haven't generated a summary for this paper yet.