Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Monocular Differentiable Rendering for Self-Supervised 3D Object Detection (2009.14524v1)

Published 30 Sep 2020 in cs.CV and cs.LG

Abstract: 3D object detection from monocular images is an ill-posed problem due to the projective entanglement of depth and scale. To overcome this ambiguity, we present a novel self-supervised method for textured 3D shape reconstruction and pose estimation of rigid objects with the help of strong shape priors and 2D instance masks. Our method predicts the 3D location and meshes of each object in an image using differentiable rendering and a self-supervised objective derived from a pretrained monocular depth estimation network. We use the KITTI 3D object detection dataset to evaluate the accuracy of the method. Experiments demonstrate that we can effectively use noisy monocular depth and differentiable rendering as an alternative to expensive 3D ground-truth labels or LiDAR information.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Deniz Beker (5 papers)
  2. Hiroharu Kato (10 papers)
  3. Mihai Adrian Morariu (1 paper)
  4. Takahiro Ando (2 papers)
  5. Toru Matsuoka (2 papers)
  6. Wadim Kehl (14 papers)
  7. Adrien Gaidon (84 papers)
Citations (38)

Summary

We haven't generated a summary for this paper yet.

Youtube Logo Streamline Icon: https://streamlinehq.com