Language models align with human judgments on key grammatical constructions (2402.01676v2)

Published 19 Jan 2024 in cs.CL and cs.AI

Abstract: Do LLMs make human-like linguistic generalizations? Dentella et al. (2023) ("DGL") prompt several LLMs ("Is the following sentence grammatically correct in English?") to elicit grammaticality judgments of 80 English sentences, concluding that LLMs demonstrate a "yes-response bias" and a "failure to distinguish grammatical from ungrammatical sentences". We re-evaluate LLM performance using well-established practices and find that DGL's data in fact provide evidence for just how well LLMs capture human behaviors. Models not only achieve high accuracy overall, but also capture fine-grained variation in human linguistic judgments.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (10)

Authors (5)

Jennifer Hu (22 papers)
Kyle Mahowald (40 papers)
Gary Lupyan (2 papers)
Anna Ivanova (8 papers)
Roger Levy (43 papers)

Citations (12)

View on Semantic Scholar

Tweets

https://twitter.com/_jennhu/status/1754891896478921073

https://twitter.com/IntuitMachine/status/1755641249988563013

https://twitter.com/spiantado/status/1781756650392781276

Language models align with human judgments on key grammatical constructions (2402.01676v2)

Related Papers

Tweets