Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Human Aesthetic Preference-Based Large Text-to-Image Model Personalization: Kandinsky Generation as an Example (2402.06389v1)

Published 9 Feb 2024 in cs.AI, cs.HC, and cs.MM

Abstract: With the advancement of neural generative capabilities, the art community has actively embraced GenAI (generative artificial intelligence) for creating painterly content. Large text-to-image models can quickly generate aesthetically pleasing outcomes. However, the process can be non-deterministic and often involves tedious trial-and-error, as users struggle with formulating effective prompts to achieve their desired results. This paper introduces a prompting-free generative approach that empowers users to automatically generate personalized painterly content that incorporates their aesthetic preferences in a customized artistic style. This approach involves utilizing ``semantic injection'' to customize an artist model in a specific artistic style, and further leveraging a genetic algorithm to optimize the prompt generation process through real-time iterative human feedback. By solely relying on the user's aesthetic evaluation and preference for the artist model-generated images, this approach creates the user a personalized model that encompasses their aesthetic preferences and the customized artistic style.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Aven-Le Zhou (4 papers)
  2. Yu-Ao Wang (2 papers)
  3. Wei Wu (482 papers)
  4. Kang Zhang (46 papers)
X Twitter Logo Streamline Icon: https://streamlinehq.com