Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

The Infinite Index: Information Retrieval on Generative Text-To-Image Models (2212.07476v2)

Published 14 Dec 2022 in cs.IR, cs.CL, and cs.CV

Abstract: Conditional generative models such as DALL-E and Stable Diffusion generate images based on a user-defined text, the prompt. Finding and refining prompts that produce a desired image has become the art of prompt engineering. Generative models do not provide a built-in retrieval model for a user's information need expressed through prompts. In light of an extensive literature review, we reframe prompt engineering for generative models as interactive text-based retrieval on a novel kind of "infinite index". We apply these insights for the first time in a case study on image generation for game design with an expert. Finally, we envision how active learning may help to guide the retrieval of generated images.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Niklas Deckers (9 papers)
  2. Maik Fröbe (20 papers)
  3. Johannes Kiesel (8 papers)
  4. Gianluca Pandolfo (1 paper)
  5. Christopher Schröder (9 papers)
  6. Benno Stein (44 papers)
  7. Martin Potthast (64 papers)
Citations (16)