Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Adaptively-Realistic Image Generation from Stroke and Sketch with Diffusion Model (2208.12675v2)

Published 26 Aug 2022 in cs.CV

Abstract: Generating images from hand-drawings is a crucial and fundamental task in content creation. The translation is difficult as there exist infinite possibilities and the different users usually expect different outcomes. Therefore, we propose a unified framework supporting a three-dimensional control over the image synthesis from sketches and strokes based on diffusion models. Users can not only decide the level of faithfulness to the input strokes and sketches, but also the degree of realism, as the user inputs are usually not consistent with the real images. Qualitative and quantitative experiments demonstrate that our framework achieves state-of-the-art performance while providing flexibility in generating customized images with control over shape, color, and realism. Moreover, our method unleashes applications such as editing on real images, generation with partial sketches and strokes, and multi-domain multi-modal synthesis.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Shin-I Cheng (3 papers)
  2. Yu-Jie Chen (13 papers)
  3. Wei-Chen Chiu (54 papers)
  4. Hung-Yu Tseng (31 papers)
  5. Hsin-Ying Lee (60 papers)
Citations (50)

Summary

We haven't generated a summary for this paper yet.