Towards auditory attention decoding with noise-tagging: A pilot study (2403.15523v2)
Abstract: Auditory attention decoding (AAD) aims to extract from brain activity the attended speaker amidst candidate speakers, offering promising applications for neuro-steered hearing devices and brain-computer interfacing. This pilot study makes a first step towards AAD using the noise-tagging stimulus protocol, which evokes reliable code-modulated evoked potentials, but is minimally explored in the auditory modality. Participants were sequentially presented with two Dutch speech stimuli that were amplitude-modulated with a unique binary pseudo-random noise-code, effectively tagging these with additional decodable information. We compared the decoding of unmodulated audio against audio modulated with various modulation depths, and a conventional AAD method against a standard method to decode noise-codes. Our pilot study revealed higher performances for the conventional method with 70 to 100 percent modulation depths compared to unmodulated audio. The noise-code decoder did not further improve these results. These fundamental insights highlight the potential of integrating noise-codes in speech to enhance auditory speaker detection when multiple speakers are presented simultaneously.
- E. Colin Cherry “Some Experiments on the Recognition of Speech, with One and with Two Ears” In The Journal of the Acoustical Society of America 25.5, 2005, pp. 975–979 DOI: 10.1121/1.1907229
- Nai Ding and Jonathan Z. Simon “Emergence of neural encoding of auditory objects while listening to competing speakers” In Proceedings of the National Academy of Sciences 109.29, 2012, pp. 11854–11859 DOI: 10.1073/pnas.1205381109
- “Electroencephalography-Based Auditory Attention Decoding: Toward Neurosteered Hearing Devices” In IEEE Signal Processing Magazine 38.4, 2021, pp. 89–102 DOI: 10.1109/MSP.2021.3075932
- “Attentional selection in a cocktail party environment can be decoded from single-trial EEG” In Cerebral cortex 25.7 Oxford University Press, 2015, pp. 1697–1706
- Edmund C. Lalor and John J. Foxe “Neural responses to uninterrupted natural speech can be extracted with precise temporal resolution” In European Journal of Neuroscience 31.1, 2010, pp. 189–193 DOI: https://doi.org/10.1111/j.1460-9568.2009.07055.x
- Nai Ding and Jonathan Z. Simon “Neural coding of continuous speech in auditory cortex during monaural and dichotic listening” PMID: 21975452 In Journal of Neurophysiology 107.1, 2012, pp. 78–89 DOI: 10.1152/jn.00297.2011
- “Decoding the auditory brain with canonical component analysis” In NeuroImage 172, 2018, pp. 206–216 DOI: https://doi.org/10.1016/j.neuroimage.2018.01.033
- “Extracting multidimensional stimulus-response correlations using hybrid encoding-decoding of neural activity” New advances in encoding and decoding of brain signals In NeuroImage 180, 2018, pp. 134–146 DOI: https://doi.org/10.1016/j.neuroimage.2017.05.037
- “Brain–Computer Interfaces based on Code-Modulated Visual Evoked Potentials (c-VEP): A Literature Review” In Journal of Neural Engineering 18.6 IOP Publishing, 2021, pp. 061002
- “From full calibration to zero training for a code-modulated visual evoked potentials for brain–computer interface” In Journal of Neural Engineering 18.5 IOP Publishing, 2021, pp. 056007
- “Estimating and approaching the maximum information rate of noninvasive visual brain-computer interface” In NeuroImage Elsevier, 2024, pp. 120548 DOI: 10.1016/j.neuroimage.2024.120548
- “Towards a noise-tagging auditory BCI-paradigm” In Proceedings of the 4th International Brain–Computer Interface Workshop and Training Course 2008. Graz, Austria, 2008 URL: https://www.tugraz.at/fileadmin/user_upload/Institute/INE/Proceedings/Proceedings_BCI_Conference_2008.pdf
- “Radioboeken voor Kinderden” In Radioboeken • deBuren, 2007 URL: https://deburen.eu/project/4/radioboeken
- “The Effect of Head-Related Filtering and Ear-Specific Decoding Bias on Auditory Attention Detection” In Journal of Neural Engineering 13, 2016, pp. 056014 DOI: 10.1088/1741-2560/13/5/056014
- R. Gold “Optimal Binary Sequences for Spread Spectrum Multiplexing” In IEEE Transactions on Information Theory 13.4 IEEE, 1967, pp. 619–621
- “Broad-Band visually evoked potentials: re(con)volution in brain-computer interfacing” In PLOS ONE 10.7 Public Library of Science, 2015, pp. e0133797 DOI: 10.1371/journal.pone.0133797
- “Auditory-Inspired Speech Envelope Extraction Methods for Improved EEG-Based Auditory Attention Detection in a Cocktail Party Scenario” In IEEE Transactions on Neural Systems and Rehabilitation Engineering 25.5, 2017, pp. 402–412 DOI: 10.1109/TNSRE.2016.2571900