Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Measuring Psychological Depth in Language Models (2406.12680v2)

Published 18 Jun 2024 in cs.CL

Abstract: Evaluations of creative stories generated by LLMs often focus on objective properties of the text, such as its style, coherence, and diversity. While these metrics are indispensable, they do not speak to a story's subjective, psychological impact from a reader's perspective. We introduce the Psychological Depth Scale (PDS), a novel framework rooted in literary theory that measures an LLM's ability to produce authentic and narratively complex stories that provoke emotion, empathy, and engagement. We empirically validate our framework by showing that humans can consistently evaluate stories based on PDS (0.72 Krippendorff's alpha). We also explore techniques for automating the PDS to easily scale future analyses. GPT-4o, combined with a novel Mixture-of-Personas (MoP) prompting strategy, achieves an average Spearman correlation of 0.51 with human judgment while Llama-3-70B with constrained decoding scores as high as 0.68 for empathy. Finally, we compared the depth of stories authored by both humans and LLMs. Surprisingly, GPT-4 stories either surpassed or were statistically indistinguishable from highly-rated human-written stories sourced from Reddit. By shifting the focus from text to reader, the Psychological Depth Scale is a validated, automated, and systematic means of measuring the capacity of LLMs to connect with humans through the stories they tell.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Fabrice Harel-Canada (6 papers)
  2. Hanyu Zhou (19 papers)
  3. Zeynep Yildiz (2 papers)
  4. Amit Sahai (10 papers)
  5. Nanyun Peng (205 papers)
  6. Sreya Muppalla (3 papers)
  7. Miryung Kim (17 papers)
Citations (3)
X Twitter Logo Streamline Icon: https://streamlinehq.com