EEGNet: A Compact Convolutional Network for EEG-based Brain-Computer Interfaces (1611.08024v4)

Published 23 Nov 2016 in cs.LG, q-bio.NC, and stat.ML

Abstract: Brain computer interfaces (BCI) enable direct communication with a computer, using neural activity as the control signal. This neural signal is generally chosen from a variety of well-studied electroencephalogram (EEG) signals. For a given BCI paradigm, feature extractors and classifiers are tailored to the distinct characteristics of its expected EEG control signal, limiting its application to that specific signal. Convolutional Neural Networks (CNNs), which have been used in computer vision and speech recognition, have successfully been applied to EEG-based BCIs; however, they have mainly been applied to single BCI paradigms and thus it remains unclear how these architectures generalize to other paradigms. Here, we ask if we can design a single CNN architecture to accurately classify EEG signals from different BCI paradigms, while simultaneously being as compact as possible. In this work we introduce EEGNet, a compact convolutional network for EEG-based BCIs. We introduce the use of depthwise and separable convolutions to construct an EEG-specific model which encapsulates well-known EEG feature extraction concepts for BCI. We compare EEGNet to current state-of-the-art approaches across four BCI paradigms: P300 visual-evoked potentials, error-related negativity responses (ERN), movement-related cortical potentials (MRCP), and sensory motor rhythms (SMR). We show that EEGNet generalizes across paradigms better than the reference algorithms when only limited training data is available. We demonstrate three different approaches to visualize the contents of a trained EEGNet model to enable interpretation of the learned features. Our results suggest that EEGNet is robust enough to learn a wide variety of interpretable features over a range of BCI tasks, suggesting that the observed performances were not due to artifact or noise sources in the data.

Authors (6)

Vernon J. Lawhern (17 papers)
Amelia J. Solon (3 papers)
Nicholas R. Waytowich (26 papers)
Stephen M. Gordon (6 papers)
Chou P. Hung (1 paper)
Brent J. Lance (8 papers)

Citations (2,510)

View on Semantic Scholar

Summary

EEGNet: A Compact Convolutional Neural Network for EEG-based Brain-Computer Interfaces

Introduction

In the paper "EEGNet: A Compact Convolutional Neural Network for EEG-based Brain-Computer Interfaces," the authors address the challenge of designing a universal convolutional neural network (CNN) architecture that can adapt to various EEG-based Brain-Computer Interface (BCI) paradigms while remaining compact in terms of model parameters. Traditional BCI systems necessitate bespoke feature extraction and classification methodologies tailored to specific paradigms, which can restrict their broader application. The introduction of EEGNet aims to overcome these limitations by providing a generalizable and interpretable model capable of handling multiple EEG signal types across different paradigms.

Methodology

EEGNet leverages depthwise and separable convolutions to capture EEG-specific features while minimizing the number of learnable parameters. The model's architecture consists of two main blocks followed by a classification layer:

Temporal Convolution and Depthwise Convolution: This block applies a temporal convolution to learn frequency filters, followed by a depthwise convolution to learn frequency-specific spatial filters. This is inspired by EEG-specific strategies like the Filter-Bank Common Spatial Pattern (FBCSP) approach.
Separable Convolution: This block combines depthwise convolutions with pointwise convolutions, effectively decoupling intra-feature map summarization from inter-feature map mixing.

The compact design results in a drastic reduction in the number of parameters compared to existing CNN architectures, making EEGNet highly efficient and easier to train, even on smaller datasets.

Datasets and Evaluation

The authors evaluate EEGNet across four distinct BCI paradigms:

P300 Visual-Evoked Potentials
Error-Related Negativity (ERN)
Movement-Related Cortical Potentials (MRCP)
Sensory Motor Rhythms (SMR)

These datasets cover a wide range of ERP and oscillatory features, enabling a comprehensive assessment of EEGNet's versatility.

Performance Comparison

EEGNet is compared against state-of-the-art methods, including traditional approaches (e.g., xDAWN + Riemannian Geometry for ERP tasks and FBCSP for SMR) and two existing CNN architectures (DeepConvNet and ShallowConvNet):

Within-Subject Classification: EEGNet consistently matches or exceeds the performance of competing models in most ERP-based tasks. It displays robust performance improvements in the MRCP dataset, suggesting its superior capability in handling mixed ERP and oscillatory features.
Cross-Subject Classification: Here too, EEGNet performs favorably, particularly against DeepConvNet, showcasing its efficiency in training with limited data. Although DeepConvNet's performance improves with larger, cross-subject datasets, EEGNet remains competitive with substantially fewer parameters.

Feature Explainability

An essential aspect of EEGNet is its ability to produce interpretable features:

Spatial Filter Analysis: The depthwise convolutions facilitate extraction of frequency-specific spatial filters, which can be analyzed using traditional EEG techniques, demonstrating the model's interpretability.
Convolutional Kernel Visualization: By visualizing learned kernels, especially in time-frequency domains, insights into the types of neural activities captured by the model can be gained.
Single-Trial Relevance Analysis: Utilizing methods like DeepLIFT, the authors demonstrate how EEGNet identifies relevant features in single trials, offering a clear view of the model's decision-making process.

Implications and Future Research

EEGNet presents substantial implications for the development of generalizable and interpretable BCI systems. Its capacity to adapt across various paradigms without significant performance loss indicates potential applications in a variety of future BCI scenarios, both clinical and non-clinical. As BCI technology moves towards more complex and non-specific use cases, models like EEGNet that minimize the need for paradigm-specific adjustments will become increasingly important.

Continued research may focus on expanding EEGNet's applicability to other EEG features and paradigms, improving its resilience to artifacts, and optimizing its architecture for real-time BCI applications. Additionally, integrating transfer learning techniques could enhance cross-subject performance, further cementing EEGNet's status as a versatile and efficient model for EEG analysis.

Conclusion

EEGNet sets a new standard in the cross-paradigm application of CNNs for EEG-based BCIs. Its compact design, coupled with its strong performance and interpretability, underscores the potential of leveraging advanced convolutional techniques to streamline and enhance EEG signal processing. This work pushes the boundary of BCI research towards more flexible, efficient, and interpretable machine learning solutions, encouraging the broader adoption and innovation within this vital field.

PDF Markdown

Related Papers

Tweets

https://twitter.com/stev__v/status/1791306610955980872

https://twitter.com/frontier_intel/status/1861355735327871300