Humans and language models diverge when predicting repeating text (2310.06408v2)

Published 10 Oct 2023 in cs.CL

Abstract: LLMs that are trained on the next-word prediction task have been shown to accurately model human behavior in word prediction and reading speed. In contrast with these findings, we present a scenario in which the performance of humans and LMs diverges. We collected a dataset of human next-word predictions for five stimuli that are formed by repeating spans of text. Human and GPT-2 LM predictions are strongly aligned in the first presentation of a text span, but their performance quickly diverges when memory (or in-context learning) begins to play a role. We traced the cause of this divergence to specific attention heads in a middle layer. Adding a power-law recency bias to these attention heads yielded a model that performs much more similarly to humans. We hope that this scenario will spur future work in bringing LMs closer to human behavior.

Citations (4)

View on Semantic Scholar

Summary

The paper finds that human word prediction improves gradually with repeated text while GPT-2 achieves near-perfect accuracy after a single repeat.
Detailed analysis identifies specific GPT-2 attention heads that drive its rapid pattern recognition, highlighting a contrast with human short-term memory limitations.
Introducing a human-like recency bias in GPT-2 leads to more aligned predictions with human behavior, though it reduces overall accuracy in non-repeating contexts.

In recently published research, a comparison between human cognitive behavior and the performance of LLMs (LMs), specifically the GPT-2 model, was conducted in the field of word prediction. The paper centered on a compelling question: Do humans and artificial intelligence diverge when predicting repeating text?

The paper first set up an experiment where human participants were asked to predict the next word in sequences that were repeated up to four times. Predictably, human performance on this task improved slightly with each repeat, as familiarity with the text helped refine their predictions.

In stark contrast, the paper revealed that the GPT-2 model excelled after just one repetition, achieving nearly perfect performance from therein. This sharp deviation pointed to a fundamental difference in memory mechanisms—the humans relying on relatively fallible short-term memory, and GPT-2 leveraging its capacity to recognize and recall repeated sequences with almost flawless precision.

Upon further analysis, the researchers identified specific attention heads within GPT-2's neural network architecture that facilitated this pattern recognition. These findings throw a spot of doubt on previously held beliefs that LMs mimic human cognitive functions closely.

Seeking to bridge this gap, the researchers introduced a novel method within the model that skewed the attention heads to favor recent information over older data, simulating a form of recency bias akin to human memory patterns. Surprisingly, with this adjustment, the model demonstrated behavior more closely resembling that of the human participants, suggesting that such modifications could make LMs better proxies for human cognition.

However, this human-like performance came at a cost: the LM's overall word-prediction accuracy decreased when applied to non-repeating text. This trade-off revealed an intriguing insight: the LM's exceptional prediction capabilities might be rooted more in its superior memory recall than in its mimicry of human thought processes.

In conclusion, the paper not only shed light on the distinctive memory operations in humans and LMs but also proposed potential steps forward. The work implies that, by refining LMs to exhibit behavior similar to human memory patterns, we may advance closer to creating AI that genuinely reflects human cognitive processes. The findings also hint at optimization opportunities in LM design, perhaps leading to more efficient and effective artificial intelligence in the future.

PDF Markdown

Related Papers

GitHub

GitHub - HuthLab/lm-repeating-text: Code for CoNLL 2023 paper "Humans and language models diverge when predicting repeating text" (1 star)

Tweets

https://twitter.com/14572397/status/1732067276575142042