Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

IDE-3D: Interactive Disentangled Editing for High-Resolution 3D-aware Portrait Synthesis (2205.15517v1)

Published 31 May 2022 in cs.CV

Abstract: Existing 3D-aware facial generation methods face a dilemma in quality versus editability: they either generate editable results in low resolution or high-quality ones with no editing flexibility. In this work, we propose a new approach that brings the best of both worlds together. Our system consists of three major components: (1) a 3D-semantics-aware generative model that produces view-consistent, disentangled face images and semantic masks; (2) a hybrid GAN inversion approach that initialize the latent codes from the semantic and texture encoder, and further optimized them for faithful reconstruction; and (3) a canonical editor that enables efficient manipulation of semantic masks in canonical view and product high-quality editing results. Our approach is competent for many applications, e.g. free-view face drawing, editing, and style control. Both quantitative and qualitative results show that our method reaches the state-of-the-art in terms of photorealism, faithfulness, and efficiency.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Jingxiang Sun (20 papers)
  2. Xuan Wang (205 papers)
  3. Yichun Shi (40 papers)
  4. Lizhen Wang (20 papers)
  5. Jue Wang (204 papers)
  6. Yebin Liu (115 papers)
Citations (10)

Summary

We haven't generated a summary for this paper yet.