Mission: Impossible Language Models (2401.06416v2)

Published 12 Jan 2024 in cs.CL, cs.AI, and cs.LG

Abstract: Chomsky and others have very directly claimed that LLMs are equally capable of learning languages that are possible and impossible for humans to learn. However, there is very little published experimental evidence to support such a claim. Here, we develop a set of synthetic impossible languages of differing complexity, each designed by systematically altering English data with unnatural word orders and grammar rules. These languages lie on an impossibility continuum: at one end are languages that are inherently impossible, such as random and irreversible shuffles of English words, and on the other, languages that may not be intuitively impossible but are often considered so in linguistics, particularly those with rules based on counting word positions. We report on a wide range of evaluations to assess the capacity of GPT-2 small models to learn these uncontroversially impossible languages, and crucially, we perform these assessments at various stages throughout training to compare the learning process for each language. Our core finding is that GPT-2 struggles to learn impossible languages when compared to English as a control, challenging the core claim. More importantly, we hope our approach opens up a productive line of inquiry in which different LLM architectures are tested on a variety of impossible languages in an effort to learn more about how LLMs can be used as tools for these cognitive and typological investigations.

References (65)

Citations (9)

View on Semantic Scholar

Summary

The paper shows that GPT-2 struggles to learn synthetically generated impossible languages compared to natural English, challenging previous claims.
The paper employs perplexity evaluation, surprisal analysis, and causal abstraction to reveal that natural statistical patterns align better with GPT-2’s representations.
The paper finds that token-based verb marker placement (TokenHop) improves grammatical tracking over word-based markers, emphasizing the inductive bias of information locality.

This paper, "Mission: Impossible LLMs," investigates the claim that LLMs are equally capable of learning both possible and impossible human languages. The authors challenge this assertion by training GPT-2 small models on a set of synthetically generated "impossible" languages and comparing their performance to that of models trained on English. The core finding is that GPT-2 struggles to learn these impossible languages compared to English, thus questioning the initial claim.

The paper defines a spectrum of impossible languages based on their complexity, ranging from entirely random word orderings to more subtly altered languages with unnatural grammar rules, specifically those dependent on counting word positions. The authors generate these languages by systematically perturbing the BabyLM dataset, an English-language dataset designed to simulate the linguistic input available to a child.

The impossible languages are categorized into three classes:

*Shuffle Languages: These involve different ways of shuffling the tokens of English sentences, including:
- NoShuffle (English, as a control)
- NondeterministicShuffle (random shuffling)
- DeterministicShuffle (shuffling based on sentence length and a random seed)
- LocalShuffle (shuffling within a fixed-size window)
- EvenOddShuffle (even-indexed tokens followed by odd-indexed tokens)
*Reverse Languages: These involve reversing all or part of sentences:
- NoReverse (English with an inserted reversal marker, as a control)
- PartialReverse (reversal marker followed by reversed tokens)
- FullReverse (entire sentence reversed after a reversal marker)
*Hop Languages: These languages manipulate verb inflection by placing number/tense markers at different positions relative to the verb:
- NoHop (English-like with verb markers immediately after the verb, as a control)
- TokenHop (verb marker placed 4 tokens after the verb)
- WordHop (verb marker placed 4 words after the verb, skipping punctuation)

The authors conduct three main experiments:

Perplexity Evaluation: GPT-2 models are trained on each language, and their perplexities on a test set are measured. Results show that the models trained on possible languages (controls) achieve lower perplexities more quickly, indicating better learning efficiency. The NondeterministicShuffle languages are the hardest to learn, while the Hop languages are nearly as easy to learn as the control.
Surprisal Analysis: This experiment focuses on the *Hop languages and measures the surprisal of verb marker tokens (singular/plural). The NoHop model exhibits the highest surprisal difference between the expected and unexpected marker positions, suggesting it has learned the natural grammatical pattern better than the models of impossible languages. The TokenHop model performs better than the WordHop model, indicating that GPT-2 is better at learning the verb marking rule when counting units are tokens instead of words.
Causal Abstraction Analysis: This experiment uses interchange interventions to identify representations within the *Hop models that causally affect subject-verb agreement. The analysis reveals that all three *Hop models develop similar modular solutions by tracking agreement through representations at relevant positions, but the NoHop model achieves higher accuracy earlier in training.

The paper concludes that GPT-2 models struggle to learn impossible languages compared to natural ones, contradicting claims that LLMs cannot distinguish between possible and impossible languages. The authors suggest that information locality, the tendency for statistical correlations to be short-range, might be an inductive bias in GPT models that matches natural language and explains these results. They propose further exploration of the boundaries between possible and impossible languages, treating LLMs as a comparative system for understanding human language.

The appendix provides details on the dataset filtering, model hyperparameters, additional results for models trained without positional encodings, constituency probing experiments, and detailed results for DeterministicShuffle experiments.

PDF Markdown

Tweets

https://twitter.com/pascalefung/status/1824665235179049027

https://twitter.com/JulieKallini/status/1746992945738526985

https://twitter.com/plain_simon/status/1880853508926439681

https://twitter.com/fly51fly/status/1746857917364814105

https://twitter.com/adelegoldberg1/status/1844462078674993182

https://twitter.com/AdityaYadavall2/status/1762136179091312646

YouTube

Show All Videos

HackerNews

Mission: Impossible Language Models (2 points, 0 comments)
Mission: Impossible Language Models (2 points, 0 comments)
Mission: Impossible Language Models (1 point, 0 comments)
Assessing the Learning Limits of LLMs with Synthetic Impossible Languages (1 point, 0 comments)

Mission: Impossible Language Models (2401.06416v2)

Summary

Related Papers

Tweets

YouTube

HackerNews