Measuring Meaning Composition in the Human Brain with Composition Scores from Large Language Models (2403.04325v3)

Published 7 Mar 2024 in cs.CL and cs.AI

Abstract: The process of meaning composition, wherein smaller units like morphemes or words combine to form the meaning of phrases and sentences, is essential for human sentence comprehension. Despite extensive neurolinguistic research into the brain regions involved in meaning composition, a computational metric to quantify the extent of composition is still lacking. Drawing on the key-value memory interpretation of transformer feed-forward network blocks, we introduce the Composition Score, a novel model-based metric designed to quantify the degree of meaning composition during sentence comprehension. Experimental findings show that this metric correlates with brain clusters associated with word frequency, structural processing, and general sensitivity to words, suggesting the multifaceted nature of meaning composition during human sentence comprehension.

References (45)

Citations (1)

View on Semantic Scholar

Summary

The paper introduces the Composition Score to quantify meaning composition in human sentence comprehension using transformer-based LLM insights.
It demonstrates that the Composition Score correlates with brain regions involved in word frequency, structural, and general linguistic processing.
This approach bridges computational models with neurolinguistic research, offering a new tool for investigating semantic processing in the brain.

The paper "Measuring Meaning Composition in the Human Brain with Composition Scores from LLMs" introduces an innovative approach to quantify the process of meaning composition in human sentence comprehension. This process is fundamental in linguistics, as it involves combining smaller language units into meaningful phrases and sentences, a subject extensively studied in neurolinguistics. However, quantifying this cognitive process computationally has proven challenging.

The authors propose a novel metric called the Composition Score. This metric leverages key insights from transformer architectures, specifically the key-value memory mechanism found in the feed-forward network blocks of LLMs. By interpreting these network blocks, the Composition Score is designed to evaluate the degree of meaning composition occurring during the comprehension of a sentence.

Key findings from the paper include:

Correlation with Brain Activity: The Composition Score was found to correlate with specific brain clusters that are involved in linguistic processing. These clusters are associated with:
- Word Frequency: Areas sensitive to how often words occur in language use.
- Structural Processing: Regions involved in understanding the grammar and syntax of sentences.
- General Sensitivity to Words: Brain areas that respond to words in a broad linguistic context.

These correlations suggest that the Composition Score captures a multifaceted aspect of meaning composition linked to various cognitive processes in the brain. This underscores the complexity of meaning making in human language comprehension and opens new avenues for further research into the computational modeling of semantic processing in the brain.

Overall, the development of the Composition Score represents a significant step in bridging computational models and neurolinguistic research, providing a new tool for exploring how meaning is constructed and represented in the human brain.

PDF Markdown

Measuring Meaning Composition in the Human Brain with Composition Scores from Large Language Models (2403.04325v3)

Summary

Related Papers