Papers
Topics
Authors
Recent
2000 character limit reached

The neural dynamics of auditory word recognition and integration

Published 22 May 2023 in cs.CL and q-bio.NC | (2305.13388v2)

Abstract: Listeners recognize and integrate words in rapid and noisy everyday speech by combining expectations about upcoming content with incremental sensory evidence. We present a computational model of word recognition which formalizes this perceptual process in Bayesian decision theory. We fit this model to explain scalp EEG signals recorded as subjects passively listened to a fictional story, revealing both the dynamics of the online auditory word recognition process and the neural correlates of the recognition and integration of words. The model reveals distinct neural processing of words depending on whether or not they can be quickly recognized. While all words trigger a neural response characteristic of probabilistic integration -- voltage modulations predicted by a word's surprisal in context -- these modulations are amplified for words which require more than roughly 150 ms of input to be recognized. We observe no difference in the latency of these neural responses according to words' recognition times. Our results are consistent with a two-part model of speech comprehension, combining an eager and rapid process of word recognition with a temporally independent process of word integration. However, we also developed alternative models of the scalp EEG signal not incorporating word recognition dynamics which showed similar performance improvements. We discuss potential future modeling steps which may help to separate these hypotheses.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (42)
  1. Optuna: A next-generation hyperparameter optimization framework. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.
  2. Tracking the time course of spoken word recognition using eye movements: Evidence for continuous mapping models. Journal of memory and language, 38(4):419–439.
  3. Algorithms for hyper-parameter optimization. Advances in neural information processing systems, 24.
  4. GPT-Neo: Large Scale Autoregressive Language Modeling with Mesh-Tensorflow.
  5. Trevor Brothers and Gina R Kuperberg. 2021. Word predictability effects are linear, not logarithmic: Implications for probabilistic models of sentence comprehension. Journal of Memory and Language, 116:104174.
  6. Multiple predictions during language comprehension: Friends, foes, or indifferent companions? Cognition, 241:105602.
  7. Colin Brown and Peter Hagoort. 1993. The Processing Nature of the N400: Evidence from Masked Priming. Journal of Cognitive Neuroscience, 5(1):34–44.
  8. Language models are few-shot learners. Advances in neural information processing systems, 33:1877–1901.
  9. Marc Brysbaert and Boris New. 2009. Moving beyond Kučera and Francis: A critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English. Behavior Research Methods, 41(4):977–990.
  10. Charlotte Caucheteux and Jean-Rémi King. 2022. Brains and algorithms partially converge in natural language processing. Communications biology, 5(1):134.
  11. The multivariate temporal response function (mtrf) toolbox: A matlab toolbox for relating neural signals to continuous stimuli. Frontiers in Human Neuroscience, 10.
  12. Probabilistic word pre-activation during language comprehension inferred from electrical brain activity. Nature Neuroscience, 8(8):1117–1121.
  13. Peter W. Donhauser and Sylvain Baillet. 2020. Two Distinct Neural Timescales for Predictive Speech Processing. Neuron, 105(2):385–393.e9.
  14. Kara D Federmeier. 2007. Thinking ahead: The role and roots of prediction in language comprehension. Psychophysiology, 44(4):491–505.
  15. Kara D Federmeier and Marta Kutas. 1999. A rose by any other name: Long-term memory structure and sentence processing. Journal of memory and Language, 41(4):469–495.
  16. Kara D Federmeier and Sarah Laszlo. 2009. Time for meaning: Electrophysiology provides insights into the dynamics of representation and processing in semantic memory. Psychology of learning and motivation, 51:1–44.
  17. The ERP response to the amount of information conveyed by words in sentences. Brain and Language, 140:1–11.
  18. Neural Markers of Speech Comprehension: Measuring EEG Tracking of Linguistic Speech Representations, Controlling the Speech Acoustics. Journal of Neuroscience, 41(50):10316–10329.
  19. Shared computational principles for language processing in humans and deep language models. Nature Neuroscience, 25(3):369–380.
  20. François Grosjean. 1980. Spoken word recognition processes and the gating paradigm. Perception & Psychophysics, 28(4):267–283.
  21. Peter Hagoort. 2008. The fractionation of spoken language understanding by measuring electrical and magnetic brain signals. Philosophical Transactions of the Royal Society B: Biological Sciences, 363(1493):1055–1069.
  22. A hierarchy of linguistic predictions during natural language comprehension. Proceedings of the National Academy of Sciences, 119(32):e2201968119.
  23. A tale of two positivities and the n400: Distinct neural signatures are evoked by confirmed and violated predictions at different levels of representation. Journal of Cognitive Neuroscience, 32(1):12–35.
  24. Gina R. Kuperberg and T. Florian Jaeger. 2016. What do we mean by prediction in language comprehension? Language, Cognition and Neuroscience, 31(1):32–59.
  25. Marta Kutas and Kara D. Federmeier. 2011. Thirty Years and Counting: Finding Meaning in the N400 Component of the Event-Related Brain Potential (ERP). Annual Review of Psychology, 62(1):621–647.
  26. Marta Kutas and Steven A. Hillyard. 1984. Brain potentials during reading reflect word expectancy and semantic association. Nature, 307(5947):161–163.
  27. Resolving precise temporal processing properties of the auditory system using continuous stimuli. Journal of Neurophysiology, 102(1):349–359. PMID: 19439675.
  28. Falk Lieder and Thomas L Griffiths. 2020. Resource-rational analysis: Understanding human cognition as the optimal use of limited computational resources. Behavioral and brain sciences, 43:e1.
  29. William D Marslen-Wilson. 1987. Functional parallelism in spoken word-recognition. Cognition, 25(1-2):71–102.
  30. D. Norris and J. McQueen. 2008. Shortlist B: A Bayesian model of continuous speech recognition. Psychological review.
  31. Timothy B O'Rourke and Phillip J Holcomb. 2002. Electrophysiological evidence for the efficiency of spoken word processing. Biological psychology, 60(2-3):121–150.
  32. Language models are unsupervised multitask learners. OpenAI blog, 1(8):9.
  33. The neural architecture of language: Integrative modeling converges on predictive processing. Proceedings of the National Academy of Sciences, 118(45):e2105646118.
  34. Herbert A Simon. 1955. A behavioral model of rational choice. The quarterly journal of economics, pages 99–118.
  35. Nathaniel J Smith and Roger Levy. 2013. The effect of word predictability on reading time is logarithmic. Cognition, 128(3):302–319.
  36. How inappropriate high-pass filters can produce artifactual effects and incorrect conclusions in erp studies of language and cognition. Psychophysiology, 52(8):997–1009.
  37. Diana, a process-oriented model of human auditory word recognition. Brain Sciences, 12(5):681.
  38. The massive auditory lexical decision (mald) database. Behavior Research Methods, 51:1187 – 1204.
  39. The cascaded nature of lexical selection and integration in auditory sentence processing. Journal of Experimental Psychology: Learning, Memory, and Cognition, 32(2):364–372.
  40. Speech understanding oppositely affects acoustic and linguistic neural tracking in a speech rate manipulation paradigm. Journal of Neuroscience, 42(39):7442–7453.
  41. Specific lexico-semantic predictions are associated with unique spatial and temporal patterns of neural activity. Elife, 7:e39061.
  42. Andrea Weber and Roel Smits. 2003. Consonant And Vowel Confusion Patterns By American English Listeners. page 4.

Summary

We haven't generated a summary for this paper yet.

Whiteboard

Paper to Video (Beta)

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Authors (2)

Collections

Sign up for free to add this paper to one or more collections.