Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Evaluating Large Language Model Creativity from a Literary Perspective (2312.03746v1)

Published 30 Nov 2023 in cs.CL, cs.AI, and cs.LG

Abstract: This paper assesses the potential for LLMs to serve as assistive tools in the creative writing process, by means of a single, in-depth case study. In the course of the study, we develop interactive and multi-voice prompting strategies that interleave background descriptions (scene setting, plot elements), instructions that guide composition, samples of text in the target style, and critical discussion of the given samples. We qualitatively evaluate the results from a literary critical perspective, as well as from the standpoint of computational creativity (a sub-field of artificial intelligence). Our findings lend support to the view that the sophistication of the results that can be achieved with an LLM mirrors the sophistication of the prompting.

Insights into Evaluating LLM Creativity from a Literary Perspective

This paper, titled "Evaluating LLM Creativity from a Literary Perspective," conducts a case paper to explore the potential of LLMs as tools for assisting in creative writing, evaluating them from both literary and computational creativity angles. The paper highlights three main approaches: creative dialogue, varying sampling temperature, and a multi-voice generation experiment. The findings suggest that the sophistication of the output from an LLM closely correlates with the complexity and creativity embedded in the user prompts.

Summary of the Investigations

The focal point of the research is a speculative fiction project, involving a 16th-century character, Effie, who experiences a time-travel-induced encounter with the 21st century. The authors used this narrative as a testbed to examine various interactions with a LLM (GPT-4), seeking to derive insights into the model's creative capacities.

Techniques Explored

  1. Creative Dialogue: Here, the LLM interacts in a dialogue format where the human plays the critic or mentor, creating an iterative process of drafting and refinement. The paper emphasizes the potential of LLMs to improve textual drafts through structured feedback and stylistic suggestions. Analysis revealed the ability of the model to produce increasingly sophisticated text as interactions progressed.
  2. Raising the Temperature: This technique involves altering the temperature parameter of the model to change the randomness of the text output. Lower temperatures result in more deterministic responses, while higher temperatures foster creative, albeit sometimes risky and unintelligible outputs. This parameter tweaking unleashes the model’s capability to generate text that ventures into unpredictable, experimental linguistic territory, useful for sparking creative ideas.
  3. Multi-Voice Generation: The authors created a dialogue within the model itself by simulating both an "author" and a "mentor" role. This approach demonstrated the model's capacity to provide self-assessment and critique, showcasing its potential for autonomous role-playing and multi-character narrative construction, which introduced a new character, Margaret, into the narrative effortlessly.

Critical Evaluation

The paper critically evaluates the model outputs using traditional literary criticism methods, focusing on style, imagery, narrative consistency, and thematic elements. The research underscores the importance of interaction between human and AI, positing that LLMs can significantly contribute to creative writing when effectively guided by sophisticated, contextual prompts.

The paper argues that this interactive potential exhibits a form of computational creativity, where the model reveals a degree of autonomy and introduces novel elements into its narratives, pushing the usual boundaries of human–AI collaboration in literature.

Implications and Future Outlook

The implications of this research extend into multiple domains. Practically, this paper suggests that LLMs could be incorporated as interactive tools in creative industries to supplement and enhance human creativity. Theoretically, it raises questions about the role of computational entities in narrative creation, blurring the lines between human-authored and machine-authored texts.

Looking forward, such studies encourage further development of AI's role in co-creative processes. The potential for AI models to autonomously critique and creatively collaborate opens new paths in interactive storytelling and content generation, necessitating continued inquiry into ethical and copyright considerations and the impacts on traditional creative sectors.

Conclusion

In all, this paper demonstrates that careful prompting and interactive engagement with LLMs can facilitate creative processes akin to those exhibited by humans, without undermining the intrinsic human value of narrative art. It highlights the dual capacity of LLMs to both imitate existing literary techniques and introduce novel narrative variations, expanding our understanding of machine-aided creativity.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (25)
  1. Automatic story generation: A survey of approaches. ACM Computing Surveys, 54(5), 2021.
  2. On the dangers of stochastic parrots: Can language models be too big? In Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency, pages 610–623, 2021.
  3. H. Bloom. The Anxiety of Influence: A Theory of Poetry. Oxford University Press, New York and Oxford, second edition, 1997. p. 11.
  4. M. Boden. The Creative Mind: Myths and Mechanisms. Taylor & Francis, second edition, 2004.
  5. Rethink reporting of evaluation results in ai. Science, 380(6641):136–138, 2023.
  6. Converging on the divergent: The history (and future) of the international joint workshops in computational creativity. AI Magazine, 30(3):15–22, July 2009.
  7. G. Chaucer. The parliament of fowls. In Benson, Larry D. (ed.), The Riverside Chaucer: A New Edition, pages 383–394, lines 22–23 and 24. Oxford University Press, 1987.
  8. E. Cherry and C. Latulipe. Quantifying the creativity support of digital tools through the creativity support index. ACM Transactions on Computer-Human Interaction, 21(4), 2014.
  9. S. Colton and G. A. Wiggins. Computational creativity: The final frontier? In Proceedings of the 20th European Conference on Artificial Intelligence, pages 21–26, 2012.
  10. G. Franceschelli and M. Musolesi. DeepCreativity: measuring creativity with deep learning techniques. Intelligenza Artificiale, 16(2):151–163, 2022.
  11. A. Jordanous. Four PPPPerspectives on computational creativity in theory and in practice. Connection Science, 28(2):194–216, 2016.
  12. Evaluating computational creativity: An interdisciplinary tutorial. ACM Computing Surveys, 51(2), 2018.
  13. CoAuthor: Designing a human-AI collaborative writing dataset for exploring language model capabilities. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems, CHI ’22, 2022.
  14. A. B. Lord. The singer of tales. Harvard University Press, 1960.
  15. F. P. J. Magoun. Oral-formulaic character of anglo-saxon narrative poetry. Speculum, 28(3):446–467, 1953.
  16. A. Minnis. Medieval Theory of Authorship: Scholastic Literary Attitudes in the Later Middle Ages. University of Pennsylvania Press, Philadelphia, second edition, 2012.
  17. Co-writing screenplays and theatre scripts with language models: Evaluation by industry professionals. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023.
  18. OpenAI. GPT-4 Technical Report. arXiv preprint, arXiv:2303.08774, 2023.
  19. M. Parry. L’epithèt traditionnelle dans Homère. Société d’éditions, Paris, 1928.
  20. M. Rhodes. An analysis of creativity. The Phi Delta Kappan, 42(7):305–310, 1961.
  21. M. Shanahan. Talking about large language models. arXiv preprint, arXiv:2212.03551, 2023.
  22. Role-play with large language models. Nature, 623:493–498, 2023.
  23. LaMDA: Language models for dialog applications. arXiv preprint, arXiv:2201.08239, 2022.
  24. M. Ward. Victorian fictions of computational creativity. In S. Cave, K. Dihal, and S. Dillon, editors, AI Narratives: A History of Imaginative Thinking about Intelligent Machines, pages 144–164. Oxford University Press, 2020.
  25. Ethical and social risks of harm from language models. arXiv preprint arXiv:2112.04359, 2021.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. Murray Shanahan (46 papers)
  2. Catherine Clarke (1 paper)
Citations (8)