The Next Chapter: A Study of Large Language Models in Storytelling (2301.09790v3)
Abstract: To enhance the quality of generated stories, recent story generation models have been investigating the utilization of higher-level attributes like plots or commonsense knowledge. The application of prompt-based learning with LLMs, exemplified by GPT-3, has exhibited remarkable performance in diverse NLP tasks. This paper conducts a comprehensive investigation, utilizing both automatic and human evaluation, to compare the story generation capacity of LLMs with recent models across three datasets with variations in style, register, and length of stories. The results demonstrate that LLMs generate stories of significantly higher quality compared to other story generation models. Moreover, they exhibit a level of performance that competes with human authors, albeit with the preliminary observation that they tend to replicate real stories in situations involving world knowledge, resembling a form of plagiarism.
- Zhuohan Xie (15 papers)
- Trevor Cohn (105 papers)
- Jey Han Lau (67 papers)