Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

HiFi-123: Towards High-fidelity One Image to 3D Content Generation (2310.06744v3)

Published 10 Oct 2023 in cs.CV

Abstract: Recent advances in diffusion models have enabled 3D generation from a single image. However, current methods often produce suboptimal results for novel views, with blurred textures and deviations from the reference image, limiting their practical applications. In this paper, we introduce HiFi-123, a method designed for high-fidelity and multi-view consistent 3D generation. Our contributions are twofold: First, we propose a Reference-Guided Novel View Enhancement (RGNV) technique that significantly improves the fidelity of diffusion-based zero-shot novel view synthesis methods. Second, capitalizing on the RGNV, we present a novel Reference-Guided State Distillation (RGSD) loss. When incorporated into the optimization-based image-to-3D pipeline, our method significantly improves 3D generation quality, achieving state-of-the-art performance. Comprehensive evaluations demonstrate the effectiveness of our approach over existing methods, both qualitatively and quantitatively. Video results are available on the project page.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Wangbo Yu (15 papers)
  2. Li Yuan (142 papers)
  3. Yan-Pei Cao (58 papers)
  4. Xiangjun Gao (9 papers)
  5. Xiaoyu Li (348 papers)
  6. Long Quan (35 papers)
  7. Ying Shan (252 papers)
  8. Yonghong Tian (184 papers)
  9. Wenbo Hu (55 papers)
Citations (20)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com