Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

LinkGAN: Linking GAN Latents to Pixels for Controllable Image Synthesis (2301.04604v2)

Published 11 Jan 2023 in cs.CV

Abstract: This work presents an easy-to-use regularizer for GAN training, which helps explicitly link some axes of the latent space to a set of pixels in the synthesized image. Establishing such a connection facilitates a more convenient local control of GAN generation, where users can alter the image content only within a spatial area simply by partially resampling the latent code. Experimental results confirm four appealing properties of our regularizer, which we call LinkGAN. (1) The latent-pixel linkage is applicable to either a fixed region (\textit{i.e.}, same for all instances) or a particular semantic category (i.e., varying across instances), like the sky. (2) Two or multiple regions can be independently linked to different latent axes, which further supports joint control. (3) Our regularizer can improve the spatial controllability of both 2D and 3D-aware GAN models, barely sacrificing the synthesis performance. (4) The models trained with our regularizer are compatible with GAN inversion techniques and maintain editability on real images.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Jiapeng Zhu (26 papers)
  2. Ceyuan Yang (51 papers)
  3. Yujun Shen (111 papers)
  4. Zifan Shi (19 papers)
  5. Bo Dai (245 papers)
  6. Deli Zhao (66 papers)
  7. Qifeng Chen (187 papers)
Citations (19)

Summary

We haven't generated a summary for this paper yet.