MindFormer: Semantic Alignment of Multi-Subject fMRI for Brain Decoding (2405.17720v2)

Published 28 May 2024 in cs.CV, cs.AI, and cs.LG

Abstract: Research efforts for visual decoding from fMRI signals have attracted considerable attention in research community. Still multi-subject fMRI decoding with one model has been considered intractable due to the drastic variations in fMRI signals between subjects and even within the same subject across different trials. To address current limitations in multi-subject brain decoding, here we introduce a novel semantic alignment method of multi-subject fMRI signals using so-called MindFormer. This model is specifically designed to generate fMRI-conditioned feature vectors that can be used for conditioning Stable Diffusion model for fMRI- to-image generation or LLM for fMRI-to-text generation. More specifically, MindFormer incorporates two key innovations: 1) a subject specific token that effectively capture individual differences in fMRI signals while synergistically combines multi subject fMRI data for training, and 2) a novel feature embedding and training scheme based on the IP-Adapter to extract semantically meaningful features from fMRI signals. Our experimental results demonstrate that MindFormer generates semantically consistent images and text across different subjects. Since our MindFormer maintains semantic fidelity by fully utilizing the training data across different subjects by significantly surpassing existing models in multi-subject brain decoding, this may help deepening our understanding of neural processing variations among individuals.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (51)

Authors (3)

Inhwa Han (3 papers)
Jaayeon Lee (2 papers)
Jong Chul Ye (210 papers)

Tweets

https://twitter.com/CSVisionPapers/status/1795860899711930871

https://twitter.com/realmofresearch/status/1796520700318196061

MindFormer: Semantic Alignment of Multi-Subject fMRI for Brain Decoding (2405.17720v2)

Related Papers

Tweets