Causal Graph in Language Model Rediscovers Cortical Hierarchy in Human Narrative Processing

Published 17 Nov 2023 in cs.CL | (2311.10431v1)

Abstract: Understanding how humans process natural language has long been a vital research direction. The field of NLP has recently experienced a surge in the development of powerful LLMs. These models have proven to be invaluable tools for studying another complex system known to process human language: the brain. Previous studies have demonstrated that the features of LLMs can be mapped to fMRI brain activity. This raises the question: is there a commonality between information processing in LLMs and the human brain? To estimate information flow patterns in a LLM, we examined the causal relationships between different layers. Drawing inspiration from the workspace framework for consciousness, we hypothesized that features integrating more information would more accurately predict higher hierarchical brain activity. To validate this hypothesis, we classified LLM features into two categories based on causal network measures: 'low in-degree' and 'high in-degree'. We subsequently compared the brain prediction accuracy maps for these two groups. Our results reveal that the difference in prediction accuracy follows a hierarchical pattern, consistent with the cortical hierarchy map revealed by activity time constants. This finding suggests a parallel between how LLMs and the human brain process linguistic information.