Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

NerfDiff: Single-image View Synthesis with NeRF-guided Distillation from 3D-aware Diffusion (2302.10109v1)

Published 20 Feb 2023 in cs.CV and cs.LG

Abstract: Novel view synthesis from a single image requires inferring occluded regions of objects and scenes whilst simultaneously maintaining semantic and physical consistency with the input. Existing approaches condition neural radiance fields (NeRF) on local image features, projecting points to the input image plane, and aggregating 2D features to perform volume rendering. However, under severe occlusion, this projection fails to resolve uncertainty, resulting in blurry renderings that lack details. In this work, we propose NerfDiff, which addresses this issue by distilling the knowledge of a 3D-aware conditional diffusion model (CDM) into NeRF through synthesizing and refining a set of virtual views at test time. We further propose a novel NeRF-guided distillation algorithm that simultaneously generates 3D consistent virtual views from the CDM samples, and finetunes the NeRF based on the improved virtual views. Our approach significantly outperforms existing NeRF-based and geometry-free approaches on challenging datasets, including ShapeNet, ABO, and Clevr3D.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Jiatao Gu (84 papers)
  2. Alex Trevithick (8 papers)
  3. Kai-En Lin (7 papers)
  4. Josh Susskind (38 papers)
  5. Christian Theobalt (251 papers)
  6. Lingjie Liu (79 papers)
  7. Ravi Ramamoorthi (65 papers)
Citations (151)

Summary

We haven't generated a summary for this paper yet.