Alljoined1 -- A dataset for EEG-to-Image decoding (2404.05553v3)

Published 8 Apr 2024 in q-bio.NC and cs.AI

Abstract: We present Alljoined1, a dataset built specifically for EEG-to-Image decoding. Recognizing that an extensive and unbiased sampling of neural responses to visual stimuli is crucial for image reconstruction efforts, we collected data from 8 participants looking at 10,000 natural images each. We have currently gathered 46,080 epochs of brain responses recorded with a 64-channel EEG headset. The dataset combines response-based stimulus timing, repetition between blocks and sessions, and diverse image classes with the goal of improving signal quality. For transparency, we also provide data quality scores. We publicly release the dataset and all code at https://linktr.ee/alljoined1.

References (49)

Citations (2)

View on Semantic Scholar

Summary

The paper introduces a novel EEG dataset recording 46,080 epochs from eight participants to enhance image reconstruction research.
It employs tailored stimulus presentation and rigorous preprocessing to boost signal quality and achieve a high signal-to-noise ratio.
The study’s comprehensive dataset supports advanced EEG-to-image decoding applications in real-time BCIs and clinical diagnostics.

Overview of "Alljoined - A dataset for EEG-to-Image decoding"

The paper "Alljoined - A dataset for EEG-to-Image decoding" introduces a comprehensive dataset tailored for EEG-to-image decoding applications. This dataset, named Alljoined, addresses several limitations inherent in previous EEG-to-image datasets and is designed to facilitate robust and generalizable image reconstruction efforts. The dataset is significant in the context of cognitive neuroscience and brain-computer interface (BCI) research, offering unprecedented insights into how the human brain encodes and processes visual information.

Key Dataset Features

The authors have compiled data from eight participants, each exposed to 10,000 natural images, resulting in a total of 46,080 epochs of brain responses recorded via a 64-channel EEG headset. The stimuli, derived from the MS-COCO dataset, were presented in a manner to maximize the signal-to-noise ratio (SNR). This was achieved through carefully designed trial durations, session and block repetitions, and a broad array of image classes. The dataset comprises detailed data quality scores to ensure transparency and reliability of the information.

Methodological Contributions

Several methodological innovations are central to the dataset:

Tailored Stimulus Presentation: The trial duration and repetitions within and between blocks are specifically designed to enhance the SNR.
Diverse Image Set: The inclusion of 9,000 unique naturalistic images per participant and 1,000 shared images among participants ensures a wide variety of stimuli, critical for generalizable image decoding.
Qualitative Comparisons: Comparisons against existing datasets underscore the superior design and potential of Alljoined for advancing EEG-based image reconstruction.

Comparative Analysis

In comparison with existing datasets like Brain2Image and ThoughtViz, Alljoined addresses several biases and limitations. Brain2Image, for instance, has been criticized for its acquisition design, which inadvertently boosts model performance by introducing extraneous proxy information. The more diverse and extensive stimuli used in Alljoined reduce the risk of block-specific correlations and enhance the dataset's generalizability. Moreover, Alljoined's design mitigates the classification rather than reconstruction phenomena observed in datasets with limited classes.

Signal Quality and Data Processing

The EEG data underwent rigorous preprocessing, including band-pass filtering, ICA for artifact removal, and baseline correction. These steps were carefully selected to preserve the integrity of the neural signals. Noteworthy is the use of the MNE-Python library for preprocessing, ensuring that state-of-the-art methods are employed.

Analysis of Results

The authors provide extensive analyses of the recorded EEG data, including event-related potentials (ERPs) and SNR metrics. Consistent neural activity patterns were observed across individuals, with strong ERP signals peaking between 250 and 300 ms post-stimulus. The SNR analysis highlights the importance of repeated measures across sessions for accurate signal quality assessment.

Implications and Future Directions

Practically, the dataset could serve as a cornerstone for advancements in real-time BCIs and clinical diagnostic tools. The portability and cost-effectiveness of EEG make it preferable over fMRI for such applications. Theoretically, the dataset enriches the understanding of brain dynamics, particularly in how visual information is processed in real-time. Future research directions proposed include high-density EEG recordings focusing on occipital and parietal regions and exploring the generalization to imagined mental imagery.

Conclusion

Alljoined represents a substantial contribution to the domain of EEG-to-image decoding by addressing significant limitations of previous datasets and offering a robust, generalizable dataset. The dataset's potential applications span from enhancing our understanding of visual processing mechanisms to practical BCI implementations. The authors' future work promises to further extend the applicability and utility of this dataset in various cognitive and clinical settings.

Data and Code Availability

The Alljoined dataset, along with the necessary code for stimuli and preprocessing, is made publicly available to promote transparency and facilitate further research in the field. Researchers can access these resources via designated OSF and GitHub links.

By providing a well-documented, high-quality dataset, this paper establishes a new standard for EEG-to-image decoding research, likely spurring subsequent studies and innovative applications within the domain.

PDF Markdown

Related Papers

Tweets

https://twitter.com/kodjima33/status/1786241396569215187

https://twitter.com/jonxuxu/status/1802770474730222023

https://twitter.com/InquilineKea/status/1794082074179105200

https://twitter.com/BioPapers/status/1788812450629857300

https://twitter.com/jonxuxu/status/1786424362318647628

https://twitter.com/BioPapers/status/1790648504102133780