Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Photorealistic Image Synthesis for Object Instance Detection (1902.03334v1)

Published 9 Feb 2019 in cs.CV, cs.AI, and cs.RO

Abstract: We present an approach to synthesize highly photorealistic images of 3D object models, which we use to train a convolutional neural network for detecting the objects in real images. The proposed approach has three key ingredients: (1) 3D object models are rendered in 3D models of complete scenes with realistic materials and lighting, (2) plausible geometric configuration of objects and cameras in a scene is generated using physics simulations, and (3) high photorealism of the synthesized images achieved by physically based rendering. When trained on images synthesized by the proposed approach, the Faster R-CNN object detector achieves a 24% absolute improvement of [email protected] on Rutgers APC and 11% on LineMod-Occluded datasets, compared to a baseline where the training images are synthesized by rendering object models on top of random photographs. This work is a step towards being able to effectively train object detectors without capturing or annotating any real images. A dataset of 600K synthetic images with ground truth annotations for various computer vision tasks will be released on the project website: thodan.github.io/objectsynth.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Tomas Hodan (22 papers)
  2. Vibhav Vineet (58 papers)
  3. Ran Gal (3 papers)
  4. Emanuel Shalev (1 paper)
  5. Jon Hanzelka (1 paper)
  6. Treb Connell (1 paper)
  7. Pedro Urbina (1 paper)
  8. Sudipta N. Sinha (13 papers)
  9. Brian Guenter (4 papers)
Citations (136)

Summary

We haven't generated a summary for this paper yet.

Github Logo Streamline Icon: https://streamlinehq.com