fMRI predictors based on language models of increasing complexity recover brain left lateralization
Abstract: Over the past decade, studies of naturalistic language processing where participants are scanned while listening to continuous text have flourished. Using word embeddings at first, then LLMs, researchers have created encoding models to analyze the brain signals. Presenting these models with the same text as the participants allows to identify brain areas where there is a significant correlation between the functional magnetic resonance imaging (fMRI) time series and the ones predicted by the models' artificial neurons. One intriguing finding from these studies is that they have revealed highly symmetric bilateral activation patterns, somewhat at odds with the well-known left lateralization of language processing. Here, we report analyses of an fMRI dataset where we manipulate the complexity of LLMs, testing 28 pretrained models from 8 different families, ranging from 124M to 14.2B parameters. First, we observe that the performance of models in predicting brain responses follows a scaling law, where the fit with brain activity increases linearly with the logarithm of the number of parameters of the model (and its performance on natural language processing tasks). Second, although this effect is present in both hemispheres, it is stronger in the left than in the right hemisphere. Specifically, the left-right difference in brain correlation follows a scaling law with the number of parameters. This finding reconciles computational analyses of brain activity using LLMs with the classic observation from aphasic patients showing left hemisphere dominance for language.
- Scaling laws for language encoding models in fMRI. Advances in Neural Information Processing Systems, 36.
- Qwen technical report. arXiv preprint arXiv:2309.16609.
- Stable lm 2 1.6 b technical report. arXiv preprint arXiv:2402.17834.
- The neurobiology of semantic memory. Trends in Cognitive Sciences, 15(11):527–536.
- Determination of language dominance using functional MRI: a comparison with the Wada test. Neurology, 46(4):978–984.
- Bookheimer, S. (2002). Functional MRI of Language: New Approaches to Understanding the Cortical Organization of Semantic Processing. Annual Review of Neuroscience, 25(1):151–188.
- Measuring language lateralisation with different language tasks: a systematic review. PeerJ, 5:e3929.
- Broca, P. (1865). Sur le siège de la faculté du langage articulé. Bulletins et Mémoires de la Société d’Anthropologie de Paris, 6(1):377–393.
- Brains and algorithms partially converge in natural language processing. Communications biology, 5(1):1–10.
- Information flow across the cortical timescale hierarchy during narrative construction. Proceedings of the National Academy of Sciences, 119(51):e2209307119. Publisher: Proceedings of the National Academy of Sciences.
- Dax, M. D. (1865). Lésions de la moitié gauche de l’encéphale coïncidant avec l’oubli des signes de la pensée: Lu au Congrès méridional tenu à Montpellier en 1836, par le docteur Marc Dax. Gazette Hebdomadaire de Médecine et de Chirurgie, 17:259–260.
- The role of coherence and cohesion in text comprehension: an event-related fMRI study. Cognitive Brain Research, 11(3):325–340.
- Language after section of the cerebral commissures. Brain, 90(1):131–148. Publisher: Oxford University Press.
- Glover, G. H. (1999). Deconvolution of impulse response in event-related BOLD fMRI. Neuroimage, 9(4):416–429.
- Mamba: Linear-time sequence modeling with selective state spaces. arXiv preprint arXiv:2312.00752.
- Language lateralisation measured across linguistic and national boundaries. Cortex, 111:134–147.
- The Hierarchical Cortical Organization of Human Speech Processing. Journal of Neuroscience, 37(27):6539–6557. Publisher: Society for Neuroscience Section: Research Articles.
- Hunter, J. D. (2007). Matplotlib: A 2D graphics environment. Computing in Science & Engineering, 9(3):90–95.
- Natural speech reveals the semantic maps that tile human cerebral cortex. Nature, 532(7600):453–458.
- Incorporating context into language encoding models for fmri. Advances in neural information processing systems, 31.
- Mistral 7B. arXiv preprint arXiv:2310.06825.
- Hemispheric specialization for language. Brain Research Reviews, 44(1):1–12.
- Jung-Beeman, M. (2005). Bilateral brain processes for comprehending natural language. Trends in Cognitive Sciences, 9(11):512–518.
- Brain activation modulated by sentence comprehension. Science, 274(5284):114–116.
- Scaling laws for neural language models. arXiv preprint arXiv:2001.08361.
- A task-optimized neural network replicates human auditory behavior, predicts brain responses, and reveals a cortical processing hierarchy. Neuron, 98(3):630–644.
- The role of the angular gyrus in semantic cognition: a synthesis of five functional neuroimaging studies. Brain Structure and Function, 228(1):273–291.
- A solution to Plato’s problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge. Psychological review, 104(2):211. Publisher: American Psychological Association.
- Topographic Mapping of a Hierarchy of Temporal Receptive Windows Using a Narrated Story. Journal of Neuroscience, 31(8):2906–2915.
- Le petit prince multilingual naturalistic fmri corpus. Scientific data, 9(1):530.
- An investigation across 45 languages and 12 language families reveals a universal language network. Nature Neuroscience, 25(8):1014–1019.
- Marc Dax and the discovery of the lateralisation of language in the left cerebral hemisphere. Revue Neurologique, 167(12):868–872.
- The Cortical Organization of Syntax. Cerebral Cortex, 30(3):1481–1498.
- McKinney, W. et al. (2010). Data structures for statistical computing in python. In Proceedings of the 9th Python in Science Conference, volume 445, pages 51–56. Austin, TX.
- Pointer sentinel mixture models. arXiv preprint arXiv:1609.07843.
- The neural basis of language development: Changes in lateralization over age. Proceedings of the National Academy of Sciences, 117(38):23477–23483.
- Precision fMRI reveals that the language network exhibits adult-like left-hemispheric lateralization by 4 years of age.
- Cortical representation of the constituent structure of sentences. Proceedings of the National Academy of Sciences, 108(6):2522–2527.
- Neural language models are not born equal to fit brain data, but training helps. arXiv preprint arXiv:2207.03380.
- Information-Restricted Neural Language Models Reveal Different Brain Regions’ Sensitivity to Semantics, Syntax, and Context. Neurobiology of Language, 4(4):611–636.
- Pytorch: An imperative style, high-performance deep learning library. Advances in neural information processing systems, 32.
- The Hub-and-Spoke Hypothesis of Semantic Memory. In Neurobiology of Language, pages 765–775. Elsevier.
- Scikit-learn: Machine learning in Python. Journal of machine learning research, 12(Oct):2825–2830.
- Speech and Brain Mechanisms. Princeton University Press, Princeton, NJ.
- Glove: Global vectors for word representation. In Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pages 1532–1543.
- Amodal semantic representations depend on both anterior temporal lobes: Evidence from repetitive transcranial magnetic stimulation. Neuropsychologia, 48(5):1336–1342.
- What the Hand reveals about the Brain. MIT Press, Cambridge, MA.
- Poldrack, R. A. (2011). Inferring mental states from neuroimaging data: From reverse inference to large-scale decoding. Neuron, 72(5):692–697.
- Converging evidence for the neuroanatomic basis of combinatorial semantics in the angular gyrus. Journal of Neuroscience, 35(7):3276–3284.
- Pylkkänen, L. (2019). The neural basis of combinatory syntax and semantics. Science, 366(6461):62–66.
- Language models are unsupervised multitask learners. OpenAI blog, 1(8):9.
- The neural architecture of language: Integrative modeling converges on predictive processing. Proceedings of the National Academy of Sciences, 118(45).
- Brain-score: Which artificial neural network for object recognition is most brain-like? BioRxiv, page 407007.
- statsmodels: Econometric and statistical modeling with python. In 9th Python in Science Conference.
- Functional subdivisions in the left angular gyrus where the semantic system meets and diverges from the default network. The Journal of Neuroscience: The Official Journal of the Society for Neuroscience, 30(50):16809–16817.
- Dynamic reconfiguration of the default mode network during narrative comprehension. Nature Communications, 7(1):12141. Number: 1 Publisher: Nature Publishing Group.
- Semantic dementia and the left and right temporal lobes. Cortex, 107:188–203.
- Localization of syntactic comprehension by positron emission tomography. Brain and language, 52(3):452–473.
- fMRI study of language lateralization in children and adults. Human brain mapping, 27(3):202–212.
- Gemma: Open models based on gemini research and technology. arXiv preprint arXiv:2403.08295.
- Interpreting and improving natural-language processing (in machines) with natural language-processing (in the brain). Advances in neural information processing systems, 32.
- Llama 2: Open foundation and fine-tuned chat models. arXiv preprint arXiv:2307.09288.
- Multi-factorial modulation of hemispheric specialization and plasticity for language in healthy and pathological conditions: A review. Cortex, 86:314–339.
- The numpy array: a structure for efficient numerical computation. Computing in Science & Engineering, 13(2):22.
- Attention is all you need. Advances in neural information processing systems, 30.
- What is right-hemisphere contribution to phonological, lexico-semantic, and sentence processing? NeuroImage, 54(1):577–593.
- Intracarotid Injection of Sodium Amytal for the Lateralization of Cerebral Speech Dominance: Experimental and Clinical Observations. Journal of Neurosurgery, 17(2):266–282. Publisher: Journal of Neurosurgery Publishing Group Section: Journal of Neurosurgery.
- Waskom, M. L. (2021). seaborn: statistical data visualization. Journal of Open Source Software, 6(60):3021.
- Wernicke, C. (1874). Der aphasische Symptomencomplex: eine psychologische Studie auf anatomischer Basis. Cohn & Weigert.
- Cerebral processing of linguistic and emotional prosody: fMRI studies. Progress in brain research, 156:249–268. Publisher: Elsevier.
- Left hemisphere specialization for language in the newborn: Neuroanatomical evidence of asymmetry. Brain, 96(3):641–646.
- Transformers: State-of-the-art natural language processing. In Proceedings of the 2020 conference on empirical methods in natural language processing: system demonstrations, pages 38–45.
- Language cortex activation in normal children. Neurology, 63(6):1035–1044.
- Language in context: emergent features of word, sentence, and narrative comprehension. NeuroImage, 25(3):1002–1015.
- The neurobiological nature of syntactic hierarchies. Neuroscience & Biobehavioral Reviews.
- Reviewing the functional basis of the syntactic Merge mechanism for language: A coordinate-based activation likelihood estimation meta-analysis. Neuroscience & Biobehavioral Reviews, 80:646–656.
- Hellaswag: Can a machine really finish your sentence? arXiv preprint arXiv:1905.07830.
- Opt: Open pre-trained transformer language models. arXiv preprint arXiv:2205.01068.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.