WordArt Designer API: User-Driven Artistic Typography Synthesis with Large Language Models on ModelScope (2401.01699v2)
Abstract: This paper introduces the WordArt Designer API, a novel framework for user-driven artistic typography synthesis utilizing LLMs on ModelScope. We address the challenge of simplifying artistic typography for non-professionals by offering a dynamic, adaptive, and computationally efficient alternative to traditional rigid templates. Our approach leverages the power of LLMs to understand and interpret user input, facilitating a more intuitive design process. We demonstrate through various case studies how users can articulate their aesthetic preferences and functional requirements, which the system then translates into unique and creative typographic designs. Our evaluations indicate significant improvements in user satisfaction, design flexibility, and creative expression over existing systems. The WordArt Designer API not only democratizes the art of typography but also opens up new possibilities for personalized digital communication and design.
- Jennifer Amar, Olivier Droulers and Patrick Legohérel “Typography in destination advertising: An exploratory study and research perspectives” In Tourism Management 63, 2017, pp. 77–86 DOI: https://doi.org/10.1016/j.tourman.2017.06.002
- “Video ecommerce: Towards online video advertising” In Proceedings of the 24th ACM international conference on Multimedia, 2016, pp. 1365–1374
- “Video ecommerce++: Toward large scale online video advertising” In IEEE transactions on multimedia 19.6 IEEE, 2017, pp. 1170–1183
- “Video2shop: Exact matching clothes in videos to online shopping images” In Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp. 4048–4056
- David Turner, Robert Wilhelm and Werner Lemberg “FreeType 2”, 1996 FreeType URL: https://freetype.org/index.html
- “WordArt Designer: User-Driven Artistic Typography Synthesis using Large Language Models” In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
- “Deep Residual Learning for Image Recognition” In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA, June 27-30, 2016, 2016, pp. 770–778
- “Differentiable vector graphics rasterization for editing and learning” In SIGGRAPH 39.6, 2020, pp. 193:1–193:15
- “High-Resolution Image Synthesis With Latent Diffusion Models” In CVPR, 2022, pp. 10684–10695
- Sompatu Vungthong, Emilia Djonov and Jane Torr “Images as a Resource for Supporting Vocabulary Learning: A Multimodal Analysis of Thai EFL Tablet Apps for Primary School Children” In TESOL Quarterly 51.1, 2017, pp. 32–58 DOI: https://doi.org/10.1002/tesq.274
- “Adding Conditional Control to Text-to-Image Diffusion Models” In arXiv preprint abs/2302.05543, 2023