Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Plot and Rework: Modeling Storylines for Visual Storytelling (2105.06950v3)

Published 14 May 2021 in cs.CL and cs.AI

Abstract: Writing a coherent and engaging story is not easy. Creative writers use their knowledge and worldview to put disjointed elements together to form a coherent storyline, and work and rework iteratively toward perfection. Automated visual storytelling (VIST) models, however, make poor use of external knowledge and iterative generation when attempting to create stories. This paper introduces PR-VIST, a framework that represents the input image sequence as a story graph in which it finds the best path to form a storyline. PR-VIST then takes this path and learns to generate the final story via an iterative training process. This framework produces stories that are superior in terms of diversity, coherence, and humanness, per both automatic and human evaluations. An ablation study shows that both plotting and reworking contribute to the model's superiority.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Chi-Yang Hsu (5 papers)
  2. Yun-Wei Chu (13 papers)
  3. Ting-Hao 'Kenneth' Huang (42 papers)
  4. Lun-Wei Ku (35 papers)
Citations (25)

Summary

We haven't generated a summary for this paper yet.