Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Deep Image Style Transfer from Freeform Text (2212.06868v1)

Published 13 Dec 2022 in cs.CV, cs.CL, and cs.LG

Abstract: This paper creates a novel method of deep neural style transfer by generating style images from freeform user text input. The LLM and style transfer model form a seamless pipeline that can create output images with similar losses and improved quality when compared to baseline style transfer methods. The LLM returns a closely matching image given a style text and description input, which is then passed to the style transfer model with an input content image to create a final output. A proof-of-concept tool is also developed to integrate the models and demonstrate the effectiveness of deep image style transfer from freeform text.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Tejas Santanam (8 papers)
  2. Mengyang Liu (16 papers)
  3. Jiangyue Yu (2 papers)
  4. Zhaodong Yang (2 papers)

Summary

We haven't generated a summary for this paper yet.