DreamDistribution: Prompt Distribution Learning for Text-to-Image Diffusion Models (2312.14216v1)

Published 21 Dec 2023 in cs.CV

Abstract: The popularization of Text-to-Image (T2I) diffusion models enables the generation of high-quality images from text descriptions. However, generating diverse customized images with reference visual attributes remains challenging. This work focuses on personalizing T2I diffusion models at a more abstract concept or category level, adapting commonalities from a set of reference images while creating new instances with sufficient variations. We introduce a solution that allows a pretrained T2I diffusion model to learn a set of soft prompts, enabling the generation of novel images by sampling prompts from the learned distribution. These prompts offer text-guided editing capabilities and additional flexibility in controlling variation and mixing between multiple distributions. We also show the adaptability of the learned prompt distribution to other tasks, such as text-to-3D. Finally we demonstrate effectiveness of our approach through quantitative analysis including automatic evaluation and human assessment. Project website: https://briannlongzhao.github.io/DreamDistribution

PDF HTML Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

References (59)

Authors (9)

Brian Nlong Zhao (5 papers)
Yuhang Xiao (5 papers)
Jiashu Xu (21 papers)
Xinyang Jiang (40 papers)
Yifan Yang (578 papers)
Dongsheng Li (240 papers)
Laurent Itti (57 papers)
Vibhav Vineet (58 papers)
Yunhao Ge (29 papers)

Citations (6)

View on Semantic Scholar

GitHub

DreamDistribution: Prompt Distribution Learning for Text-to-Image Diffusion Models

HackerNews

Prompt Distribution Learning for Text-to-Image Diffusion Models (1 point, 0 comments)

DreamDistribution: Prompt Distribution Learning for Text-to-Image Diffusion Models (2312.14216v1)

Related Papers

GitHub

HackerNews