Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

ReGANIE: Rectifying GAN Inversion Errors for Accurate Real Image Editing (2301.13402v1)

Published 31 Jan 2023 in cs.CV and eess.IV

Abstract: The StyleGAN family succeed in high-fidelity image generation and allow for flexible and plausible editing of generated images by manipulating the semantic-rich latent style space.However, projecting a real image into its latent space encounters an inherent trade-off between inversion quality and editability. Existing encoder-based or optimization-based StyleGAN inversion methods attempt to mitigate the trade-off but see limited performance. To fundamentally resolve this problem, we propose a novel two-phase framework by designating two separate networks to tackle editing and reconstruction respectively, instead of balancing the two. Specifically, in Phase I, a W-space-oriented StyleGAN inversion network is trained and used to perform image inversion and editing, which assures the editability but sacrifices reconstruction quality. In Phase II, a carefully designed rectifying network is utilized to rectify the inversion errors and perform ideal reconstruction. Experimental results show that our approach yields near-perfect reconstructions without sacrificing the editability, thus allowing accurate manipulation of real images. Further, we evaluate the performance of our rectifying network, and see great generalizability towards unseen manipulation types and out-of-domain images.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Bingchuan Li (10 papers)
  2. Tianxiang Ma (12 papers)
  3. Peng Zhang (642 papers)
  4. Miao Hua (9 papers)
  5. Wei Liu (1135 papers)
  6. Qian He (65 papers)
  7. Zili Yi (21 papers)
Citations (5)

Summary

We haven't generated a summary for this paper yet.