2000 character limit reached
Deep Image Style Transfer from Freeform Text (2212.06868v1)
Published 13 Dec 2022 in cs.CV, cs.CL, and cs.LG
Abstract: This paper creates a novel method of deep neural style transfer by generating style images from freeform user text input. The LLM and style transfer model form a seamless pipeline that can create output images with similar losses and improved quality when compared to baseline style transfer methods. The LLM returns a closely matching image given a style text and description input, which is then passed to the style transfer model with an input content image to create a final output. A proof-of-concept tool is also developed to integrate the models and demonstrate the effectiveness of deep image style transfer from freeform text.
- Tejas Santanam (8 papers)
- Mengyang Liu (16 papers)
- Jiangyue Yu (2 papers)
- Zhaodong Yang (2 papers)