2000 character limit reached
The Infinite Index: Information Retrieval on Generative Text-To-Image Models (2212.07476v2)
Published 14 Dec 2022 in cs.IR, cs.CL, and cs.CV
Abstract: Conditional generative models such as DALL-E and Stable Diffusion generate images based on a user-defined text, the prompt. Finding and refining prompts that produce a desired image has become the art of prompt engineering. Generative models do not provide a built-in retrieval model for a user's information need expressed through prompts. In light of an extensive literature review, we reframe prompt engineering for generative models as interactive text-based retrieval on a novel kind of "infinite index". We apply these insights for the first time in a case study on image generation for game design with an expert. Finally, we envision how active learning may help to guide the retrieval of generated images.
- Niklas Deckers (9 papers)
- Maik Fröbe (20 papers)
- Johannes Kiesel (8 papers)
- Gianluca Pandolfo (1 paper)
- Christopher Schröder (9 papers)
- Benno Stein (44 papers)
- Martin Potthast (64 papers)