Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

ControlDreamer: Blending Geometry and Style in Text-to-3D (2312.01129v3)

Published 2 Dec 2023 in cs.CV

Abstract: Recent advancements in text-to-3D generation have significantly contributed to the automation and democratization of 3D content creation. Building upon these developments, we aim to address the limitations of current methods in blending geometries and styles in text-to-3D generation. We introduce multi-view ControlNet, a novel depth-aware multi-view diffusion model trained on generated datasets from a carefully curated text corpus. Our multi-view ControlNet is then integrated into our two-stage pipeline, ControlDreamer, enabling text-guided generation of stylized 3D models. Additionally, we present a comprehensive benchmark for 3D style editing, encompassing a broad range of subjects, including objects, animals, and characters, to further facilitate research on diverse 3D generation. Our comparative analysis reveals that this new pipeline outperforms existing text-to-3D methods as evidenced by human evaluations and CLIP score metrics. Project page: https://controldreamer.github.io

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Yeongtak Oh (5 papers)
  2. Jooyoung Choi (21 papers)
  3. Yongsung Kim (6 papers)
  4. Minjun Park (4 papers)
  5. Chaehun Shin (12 papers)
  6. Sungroh Yoon (163 papers)
X Twitter Logo Streamline Icon: https://streamlinehq.com