Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
144 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

SPEED: Scalable Preprocessing of EEG Data for Self-Supervised Learning (2408.08065v3)

Published 15 Aug 2024 in eess.SP and cs.AI

Abstract: Electroencephalography (EEG) research typically focuses on tasks with narrowly defined objectives, but recent studies are expanding into the use of unlabeled data within larger models, aiming for a broader range of applications. This addresses a critical challenge in EEG research. For example, Kostas et al. (2021) show that self-supervised learning (SSL) outperforms traditional supervised methods. Given the high noise levels in EEG data, we argue that further improvements are possible with additional preprocessing. Current preprocessing methods often fail to efficiently manage the large data volumes required for SSL, due to their lack of optimization, reliance on subjective manual corrections, and validation processes or inflexible protocols that limit SSL. We propose a Python-based EEG preprocessing pipeline optimized for self-supervised learning, designed to efficiently process large-scale data. This optimization not only stabilizes self-supervised training but also enhances performance on downstream tasks compared to training with raw data.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (23)
  1. “Bendr: using transformers and a contrastive self-supervised learning task to learn from massive amounts of eeg data,” Frontiers in Human Neuroscience, vol. 15, pp. 653659, 2021.
  2. Iyad Obeid, Joseph Picone, “The temple university hospital eeg data corpus,” Frontiers in neuroscience, vol. 10, pp. 195498, 2016.
  3. Arnaud Delorme, Scott Makeig, “Eeglab: an open source toolbox for analysis of single-trial eeg dynamics including independent component analysis,” Journal of neuroscience methods, vol. 134, no. 1, pp. 9–21, 2004.
  4. “Brainstorm: a user-friendly application for meg/eeg analysis,” Computational intelligence and neuroscience, vol. 2011, pp. 1–13, 2011.
  5. “Fieldtrip: open source software for advanced analysis of meg, eeg, and invasive electrophysiological data,” Computational intelligence and neuroscience, vol. 2011, pp. 1–9, 2011.
  6. “The PREP pipeline: standardized preprocessing for large-scale EEG analysis,” Front. Neuroinform., vol. 9, pp. 16, June 2015.
  7. “Automagic: Standardized preprocessing of big EEG data,” Neuroimage, vol. 200, pp. 460–473, Oct. 2019.
  8. “Meg and eeg data analysis with mne-python,” Frontiers in neuroscience, vol. 7, pp. 70133, 2013.
  9. “Faster: fully automated statistical thresholding for eeg artifact rejection,” Journal of neuroscience methods, vol. 192, no. 1, pp. 152–162, 2010.
  10. “Autoreject: Automated artifact rejection for meg and eeg data,” NeuroImage, vol. 159, pp. 417–429, 2017.
  11. “Uncovering the structure of clinical eeg signals with self-supervised learning,” Journal of Neural Engineering, vol. 18, no. 4, pp. 046020, 2021.
  12. “Neuro-gpt: Developing a foundation model for eeg,” arXiv preprint arXiv:2311.03764, 2023.
  13. “Bci2000: a general-purpose brain-computer interface (bci) system,” IEEE Transactions on Biomedical Engineering, vol. 51, no. 6, pp. 1034–1043, 2004.
  14. Wendy Kan Jérémie Mattout, “Bci challenge @ ner 2015,” 2014.
  15. Alain de Cheveigné, “Zapline: A simple and effective method to remove power line artifacts,” NeuroImage, vol. 207, pp. 116356, 2020.
  16. Martin A. Fischler, Robert C. Bolles, “Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography,” Commun. ACM, vol. 24, no. 6, pp. 381–395, jun 1981.
  17. Dezhong Yao, “A method to standardize a reference of scalp eeg recordings to a point at infinity,” Physiological Measurement, vol. 22, no. 4, pp. 693, oct 2001.
  18. “High-pass filters and baseline correction in m/eeg analysis. commentary on: “how inappropriate high-pass filters can produce artefacts and incorrect conclusions in erp studies of language and cognition”,” Journal of Neuroscience Methods, vol. 266, pp. 164–165, 2016.
  19. “Independent component analysis using an extended infomax algorithm for mixed subgaussian and supergaussian sources,” Neural Comput., vol. 11, no. 2, pp. 417–441, Feb. 1999.
  20. “Iclabel: An automated electroencephalographic independent component classifier, dataset, and website,” NeuroImage, vol. 198, pp. 181–197, 2019.
  21. “Spherical splines for scalp potential and current density mapping,” Electroencephalography and Clinical Neurophysiology, vol. 72, no. 2, pp. 184–187, 1989.
  22. “False-positive psychology: Undisclosed flexibility in data collection and analysis allows presenting anything as significant,” Psychological Science, vol. 22, no. 11, pp. 1359–1366, 2011, PMID: 22006061.
  23. Cheng Li, Bingyu Wang, “Fisher linear discriminant analysis,” CCIS Northeastern University, vol. 6, 2014.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com