Diffusion Cocktail: Mixing Domain-Specific Diffusion Models for Diversified Image Generations (2312.08873v2)

Published 12 Dec 2023 in cs.CV and cs.AI

Abstract: Diffusion models, capable of high-quality image generation, receive unparalleled popularity for their ease of extension. Active users have created a massive collection of domain-specific diffusion models by fine-tuning base models on self-collected datasets. Recent work has focused on improving a single diffusion model by uncovering semantic and visual information encoded in various architecture components. However, those methods overlook the vastly available set of fine-tuned diffusion models and, therefore, miss the opportunity to utilize their combined capacity for novel generation. In this work, we propose Diffusion Cocktail (Ditail), a training-free method that transfers style and content information between multiple diffusion models. This allows us to perform diversified generations using a set of diffusion models, resulting in novel images unobtainable by a single model. Ditail also offers fine-grained control of the generation process, which enables flexible manipulations of styles and contents. With these properties, Ditail excels in numerous applications, including style transfer guided by diffusion models, novel-style image generation, and image manipulation via prompts or collage inputs.

Authors (4)

Haoming Liu (7 papers)
Yuanhe Guo (3 papers)
Shengjie Wang (29 papers)
Hongyi Wen (5 papers)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

GitHub

Ditail

Diffusion Cocktail: Mixing Domain-Specific Diffusion Models for Diversified Image Generations (2312.08873v2)

Summary

Related Papers

GitHub