Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Learning Disentangled Prompts for Compositional Image Synthesis (2306.00763v1)

Published 1 Jun 2023 in cs.CV and cs.AI

Abstract: We study domain-adaptive image synthesis, the problem of teaching pretrained image generative models a new style or concept from as few as one image to synthesize novel images, to better understand the compositional image synthesis. We present a framework that leverages a pretrained class-conditional generation model and visual prompt tuning. Specifically, we propose a novel source class distilled visual prompt that learns disentangled prompts of semantic (e.g., class) and domain (e.g., style) from a few images. Learned domain prompt is then used to synthesize images of any classes in the style of target domain. We conduct studies on various target domains with the number of images ranging from one to a few to many, and show qualitative results which show the compositional generalization of our method. Moreover, we show that our method can help improve zero-shot domain adaptation classification accuracy.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Kihyuk Sohn (54 papers)
  2. Albert Shaw (6 papers)
  3. Yuan Hao (5 papers)
  4. Han Zhang (338 papers)
  5. Luisa Polania (4 papers)
  6. Huiwen Chang (28 papers)
  7. Lu Jiang (90 papers)
  8. Irfan Essa (91 papers)
Citations (6)

Summary

We haven't generated a summary for this paper yet.