Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

PromptCrafter: Crafting Text-to-Image Prompt through Mixed-Initiative Dialogue with LLM (2307.08985v1)

Published 18 Jul 2023 in cs.HC and cs.AI

Abstract: Text-to-image generation model is able to generate images across a diverse range of subjects and styles based on a single prompt. Recent works have proposed a variety of interaction methods that help users understand the capabilities of models and utilize them. However, how to support users to efficiently explore the model's capability and to create effective prompts are still open-ended research questions. In this paper, we present PromptCrafter, a novel mixed-initiative system that allows step-by-step crafting of text-to-image prompt. Through the iterative process, users can efficiently explore the model's capability, and clarify their intent. PromptCrafter also supports users to refine prompts by answering various responses to clarifying questions generated by a LLM. Lastly, users can revert to a desired step by reviewing the work history. In this workshop paper, we discuss the design process of PromptCrafter and our plans for follow-up studies.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Seungho Baek (2 papers)
  2. Hyerin Im (1 paper)
  3. Jiseung Ryu (1 paper)
  4. Juhyeong Park (3 papers)
  5. Takyeon Lee (1 paper)
Citations (4)