Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Discovering Class-Specific GAN Controls for Semantic Image Synthesis (2212.01455v1)

Published 2 Dec 2022 in cs.CV

Abstract: Prior work has extensively studied the latent space structure of GANs for unconditional image synthesis, enabling global editing of generated images by the unsupervised discovery of interpretable latent directions. However, the discovery of latent directions for conditional GANs for semantic image synthesis (SIS) has remained unexplored. In this work, we specifically focus on addressing this gap. We propose a novel optimization method for finding spatially disentangled class-specific directions in the latent space of pretrained SIS models. We show that the latent directions found by our method can effectively control the local appearance of semantic classes, e.g., changing their internal structure, texture or color independently from each other. Visual inspection and quantitative evaluation of the discovered GAN controls on various datasets demonstrate that our method discovers a diverse set of unique and semantically meaningful latent directions for class-specific edits.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Edgar Schönfeld (21 papers)
  2. Julio Borges (3 papers)
  3. Vadim Sushko (7 papers)
  4. Bernt Schiele (210 papers)
  5. Anna Khoreva (27 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.