Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A fine-grained comparison of pragmatic language understanding in humans and language models (2212.06801v2)

Published 13 Dec 2022 in cs.CL and cs.AI

Abstract: Pragmatics and non-literal language understanding are essential to human communication, and present a long-standing challenge for artificial LLMs. We perform a fine-grained comparison of LLMs and humans on seven pragmatic phenomena, using zero-shot prompting on an expert-curated set of English materials. We ask whether models (1) select pragmatic interpretations of speaker utterances, (2) make similar error patterns as humans, and (3) use similar linguistic cues as humans to solve the tasks. We find that the largest models achieve high accuracy and match human error patterns: within incorrect responses, models favor literal interpretations over heuristic-based distractors. We also find preliminary evidence that models and humans are sensitive to similar linguistic cues. Our results suggest that pragmatic behaviors can emerge in models without explicitly constructed representations of mental states. However, models tend to struggle with phenomena relying on social expectation violations.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Jennifer Hu (22 papers)
  2. Sammy Floyd (1 paper)
  3. Olessia Jouravlev (1 paper)
  4. Evelina Fedorenko (19 papers)
  5. Edward Gibson (7 papers)
Citations (41)