Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

DreamComposer: Controllable 3D Object Generation via Multi-View Conditions (2312.03611v2)

Published 6 Dec 2023 in cs.CV, cs.AI, and cs.LG

Abstract: Utilizing pre-trained 2D large-scale generative models, recent works are capable of generating high-quality novel views from a single in-the-wild image. However, due to the lack of information from multiple views, these works encounter difficulties in generating controllable novel views. In this paper, we present DreamComposer, a flexible and scalable framework that can enhance existing view-aware diffusion models by injecting multi-view conditions. Specifically, DreamComposer first uses a view-aware 3D lifting module to obtain 3D representations of an object from multiple views. Then, it renders the latent features of the target view from 3D representations with the multi-view feature fusion module. Finally the target view features extracted from multi-view inputs are injected into a pre-trained diffusion model. Experiments show that DreamComposer is compatible with state-of-the-art diffusion models for zero-shot novel view synthesis, further enhancing them to generate high-fidelity novel view images with multi-view conditions, ready for controllable 3D object reconstruction and various other applications.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Yunhan Yang (11 papers)
  2. Yukun Huang (39 papers)
  3. Xiaoyang Wu (28 papers)
  4. Yuan-Chen Guo (31 papers)
  5. Song-Hai Zhang (41 papers)
  6. Hengshuang Zhao (118 papers)
  7. Tong He (124 papers)
  8. Xihui Liu (92 papers)
Citations (7)

Summary

We haven't generated a summary for this paper yet.

Github Logo Streamline Icon: https://streamlinehq.com