BrainWave: A Brain Signal Foundation Model for Clinical Applications (2402.10251v6)
Abstract: Neural electrical activity is fundamental to brain function, underlying a range of cognitive and behavioral processes, including movement, perception, decision-making, and consciousness. Abnormal patterns of neural signaling often indicate the presence of underlying brain diseases. The variability among individuals, the diverse array of clinical symptoms from various brain disorders, and the limited availability of diagnostic classifications, have posed significant barriers to formulating reliable model of neural signals for diverse application contexts. Here, we present BrainWave, the first foundation model for both invasive and non-invasive neural recordings, pretrained on more than 40,000 hours of electrical brain recordings (13.79 TB of data) from approximately 16,000 individuals. Our analysis show that BrainWave outperforms all other competing models and consistently achieves state-of-the-art performance in the diagnosis and identification of neurological disorders. We also demonstrate robust capabilities of BrainWave in enabling zero-shot transfer learning across varying recording conditions and brain diseases, as well as few-shot classification without fine-tuning, suggesting that BrainWave learns highly generalizable representations of neural signals. We hence believe that open-sourcing BrainWave will facilitate a wide range of clinical applications in medicine, paving the way for AI-driven approaches to investigate brain disorders and advance neuroscience research.
- EEG Signal Analysis for Diagnosing Neurological Disorders Using Discrete Wavelet Transform and Intelligent Techniques. Sensors 20 (2020).
- Diego Alvarez-Estevez and Roselyne Rijsman. 2022. Haaglanden Medisch Centrum sleep staging database (version 1.1). https://doi.org/10.13026/t79q-fr32.
- Diego Alvarez-Estevez and Roselyne M Rijsman. 2021. Inter-database validation of a deep learning approach for automatic sleep scoring. PloS one 16 (2021).
- Sequential Modeling Enables Scalable Learning for Large Vision Models. arXiv:2312.00785 [cs.CV]
- AASM scoring manual updates for 2017 (version 2.4).
- On the Opportunities and Risks of Foundation Models. arXiv:2108.07258 [cs.LG]
- MBrain: A Multi-Channel Self-Supervised Learning Framework for Brain Signals. In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining.
- EEG/SEEG Signal Modelling using Frequency and Fractal Analysis.. In BIOSIGNALS.
- BrainNet: Epileptic Wave Detection from SEEG with Hierarchical Graph Diffusion Learning. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining.
- Previous, current, and future stereotactic EEG techniques for localising epileptic foci. Expert Review of Medical Devices 19 (2022), 571–580.
- Paolo Detti. 2020. Siena Scalp EEG Database (version 1.0.0). https://doi.org/10.13026/5d4a-j060.
- EEG Synchronization Analysis for Seizure Prediction: A Study on Data of Noninvasive Recordings. Processes 8 (2020).
- Improved manual annotation of EEG signals through convolutional neural network guidance. Eneuro 9 (2022).
- SimMTM: A Simple Pre-Training Framework for Masked Time-Series Modeling. In Thirty-seventh Conference on Neural Information Processing Systems.
- Differential entropy feature for EEG-based emotion classification. In 6th International IEEE/EMBS Conference on Neural Engineering (NER).
- Stereotactic EEG Practices: A Survey of United States Tertiary Referral Epilepsy Centers. Journal of Clinical Neurophysiology 39 (2022).
- PhysioBank, PhysioToolkit, and PhysioNet: Components of a New Research Resource for Complex Physiologic Signals. Circulation 101 (2000).
- John Guttag. 2010. CHB-MIT Scalp EEG Database. PhysioNet. https://doi.org/10.13026/C2K01R
- The TUH EEG CORPUS: A big data resource for automated EEG interpretation. In 2014 IEEE signal processing in medicine and biology symposium (SPMB).
- Exploiting Interactivity and Heterogeneity for Sleep Stage Classification Via Heterogeneous Graph Neural Network. In ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
- Analysis of EEG Signal for the Detection of Brain Abnormalities.
- Kranti Kamble and Joydeep Sengupta. 2023. A comprehensive survey on emotion recognition based on electroencephalograph (EEG) signals. Multimedia Tools and Applications (2023), 1–36.
- Analysis of a sleep-dependent neuronal feedback loop: the slow-wave microcontinuity of the EEG. IEEE Transactions on Biomedical Engineering 47 (2000).
- Cost-effectiveness analysis of invasive EEG monitoring in drug-resistant epilepsy. Epilepsy & Behavior 114 (2021).
- Beyond Scale: the Diversity Coefficient as a Data Quality Metric Demonstrates LLMs are Pre-trained on Formally Diverse Data. arXiv:2306.13840 [cs.CL]
- From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction Tuning. arXiv:2308.12032 [cs.CL]
- Removing artefacts and periodically retraining improve performance of neural network-based seizure prediction models. Scientific Reports (2023).
- Ilya Loshchilov and Frank Hutter. 2017. Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101 abs/1711.05101 (2017).
- Zhengqing Miao and Meirong Zhao. 2023a. Time-space-frequency feature Fusion for 3-channel motor imagery classification. arXiv preprint arXiv:2304.01461 abs/2304.01461 (2023).
- Zhengqing Miao and Meirong Zhao. 2023b. Time-space-frequency feature Fusion for 3-channel motor imagery classification. arXiv:2304.01461 [cs.LG]
- Santiago Morales and Maureen Bowers. 2022. Time-frequency analysis methods and their application in developmental EEG data. Developmental Cognitive Neuroscience 54 (2022).
- Multicenter intracranial EEG dataset for classification of graphoelements and artifactual signals. Scientific data 7 (2020).
- A Time Series is Worth 64 Words: Long-term Forecasting with Transformers. In The Eleventh International Conference on Learning Representations.
- XSleepNet: Multi-view sequential model for automatic sleep staging. IEEE Transactions on Pattern Analysis and Machine Intelligence 44 (2021).
- Huy Phan and Kaare Mikkelsen. 2022. Automatic sleep staging of EEG signals: recent development, challenges, and future directions. Physiological Measurement 43 (2022).
- L-SeqSleepNet: Whole-cycle Long Sequence Modeling for Automatic Sleep Staging. IEEE Journal of Biomedical and Health Informatics 27 (2023).
- Language models are unsupervised multitask learners. OpenAI blog 1 (2019).
- Searching for Activation Functions. CoRR abs/1710.05941 (2017).
- BCI2000: A General-Purpose Brain-Computer Interface (BCI) System. IEEE Transactions on Biomedical Engineering 51 (2004).
- Ali Hossam Shoeb. 2009. Application of machine learning to epileptic seizure onset detection and treatment. Ph. D. Dissertation. Massachusetts Institute of Technology.
- EEG Emotion Recognition Using Dynamical Graph Convolutional Neural Networks. IEEE Transactions on Affective Computing 11 (2020).
- EEG Conformer: Convolutional Transformer for EEG Decoding and Visualization. IEEE Transactions on Neural Systems and Rehabilitation Engineering 31 (2023).
- Akara Supratak and Yike Guo. 2020. TinySleepNet: An efficient deep learning model for sleep stage scoring based on raw single-channel EEG. In 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC).
- LLaMA: Open and Efficient Foundation Language Models. arXiv:2302.13971 [cs.CL]
- Llama 2: Open Foundation and Fine-Tuned Chat Models. arXiv:2307.09288 [cs.CL]
- Attention Is All You Need. arXiv:1706.03762 [cs.CL]
- BrainBERT: Self-supervised representation learning for intracranial recordings. In The Eleventh International Conference on Learning Representations.
- InternImage: Exploring Large-Scale Vision Foundation Models With Deformable Convolutions. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- TimesNet: Temporal 2D-Variation Modeling for General Time Series Analysis. In The Eleventh International Conference on Learning Representations.
- Baichuan 2: Open Large-scale Language Models. arXiv:2309.10305 [cs.CL]
- Learning Topology-Agnostic EEG Representations with Geometry-Aware Modeling. In Thirty-seventh Conference on Neural Information Processing Systems.
- An overview of power spectral density (PSD) calculations. In Optical Manufacturing and Testing VI.
- Florence: A New Foundation Model for Computer Vision. CoRR abs/2111.11432 (2021).
- PPi: Pretraining Brain Signal Model for Patient-independent Seizure Detection. In Thirty-seventh Conference on Neural Information Processing Systems.
- Biao Zhang and Rico Sennrich. 2019. Root Mean Square Layer Normalization. CoRR abs/1910.07467 (2019).
- Brant: Foundation Model for Intracranial Neural Signal. In Thirty-seventh Conference on Neural Information Processing Systems.
- A survey on deep learning-based non-invasive brain signals: recent advances and new frontiers. Journal of Neural Engineering 18 (2021).
- Self-supervised contrastive pre-training for time series via time-frequency consistency. Advances in Neural Information Processing Systems 35 (2022).
- ScatterFormer: Locally-Invariant Scattering Transformer for Patient-Independent Multispectral Detection of Epileptiform Discharges. arXiv:2304.14919 [eess.SP]
- Wei-Long Zheng and Bao-Liang Lu. 2015. Investigating Critical Frequency Bands and Channels for EEG-based Emotion Recognition with Deep Neural Networks. IEEE Transactions on Autonomous Mental Development 7, 3 (2015).
- One Fits All: Power General Time Series Analysis by Pretrained LM. In Thirty-seventh Conference on Neural Information Processing Systems.