Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

FairyTailor: A Multimodal Generative Framework for Storytelling (2108.04324v1)

Published 13 Jul 2021 in cs.CL, cs.AI, and cs.CV

Abstract: Storytelling is an open-ended task that entails creative thinking and requires a constant flow of ideas. Natural language generation (NLG) for storytelling is especially challenging because it requires the generated text to follow an overall theme while remaining creative and diverse to engage the reader. In this work, we introduce a system and a web-based demo, FairyTailor, for human-in-the-loop visual story co-creation. Users can create a cohesive children's fairytale by weaving generated texts and retrieved images with their input. FairyTailor adds another modality and modifies the text generation process to produce a coherent and creative sequence of text and images. To our knowledge, this is the first dynamic tool for multimodal story generation that allows interactive co-formation of both texts and images. It allows users to give feedback on co-created stories and share their results.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Eden Bensaid (1 paper)
  2. Mauro Martino (10 papers)
  3. Benjamin Hoover (18 papers)
  4. Hendrik Strobelt (43 papers)
Citations (16)

Summary

We haven't generated a summary for this paper yet.