Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Unidirectional brain-computer interface: Artificial neural network encoding natural images to fMRI response in the visual cortex (2309.15018v1)

Published 26 Sep 2023 in cs.CV, cs.AI, cs.HC, and q-bio.NC

Abstract: While significant advancements in AI have catalyzed progress across various domains, its full potential in understanding visual perception remains underexplored. We propose an artificial neural network dubbed VISION, an acronym for "Visual Interface System for Imaging Output of Neural activity," to mimic the human brain and show how it can foster neuroscientific inquiries. Using visual and contextual inputs, this multimodal model predicts the brain's functional magnetic resonance imaging (fMRI) scan response to natural images. VISION successfully predicts human hemodynamic responses as fMRI voxel values to visual inputs with an accuracy exceeding state-of-the-art performance by 45%. We further probe the trained networks to reveal representational biases in different visual areas, generate experimentally testable hypotheses, and formulate an interpretable metric to associate these hypotheses with cortical functions. With both a model and evaluation metric, the cost and time burdens associated with designing and implementing functional analysis on the visual cortex could be reduced. Our work suggests that the evolution of computational models may shed light on our fundamental understanding of the visual cortex and provide a viable approach toward reliable brain-machine interfaces.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (29)
  1. “Intermediate acoustic-to-semantic representations link behavioral and neural responses to natural sounds,” Nature Neuroscience, vol. 26, no. 4, pp. 664–672, 2023.
  2. “Characterizing the Ventral Visual Stream with Response-Optimized Neural Encoding Models,” in Advances in Neural Information Processing Systems, 2022.
  3. “Survey of encoding and decoding of visual stimulus via FMRI: An image analysis perspective,” Brain Imaging and Behavior, vol. 8, no. 1, pp. 7–23, 2014.
  4. “A massive 7T fMRI dataset to bridge cognitive neuroscience and artificial intelligence,” Nature Neuroscience, vol. 25, no. 1, pp. 116–126, 2022.
  5. “BOLD5000, a public fMRI dataset while viewing 5000 visual images,” Scientific Data, vol. 6, no. 1, pp. 49, 2019.
  6. Vanvuren Christina, “What is the cost of an MRI?,” 2018.
  7. “Learnable latent embeddings for joint behavioural and neural analysis,” Nature, pp. 1–9, 2023.
  8. “Spiking neural networks,” International Journal of Neural Systems, vol. 19, no. 04, pp. 295–308, 2009.
  9. “Long Short-Term Memory,” Neural Computation, vol. 9, no. 8, pp. 1735–1780, 1997.
  10. “A new approach to extract fetal electrocardiogram using affine combination of adaptive filters,” in ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2023, pp. 1–5.
  11. Yu Takagi and Shinji Nishimoto, “High-resolution image reconstruction with latent diffusion models from human brain activity,” 2022.
  12. “The Algonauts Project 2023 Challenge: How the Human Brain Makes Sense of Natural Scenes,” arXiv:2301.03198, 2023.
  13. “Pqlm-multilingual decentralized portable quantum language model,” in ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2023, pp. 1–5.
  14. “Tatoo: vision-based joint tracking of anatomy and tool for skull-base surgery,” International Journal of Computer Assisted Radiology and Surgery, pp. 1–8, 2023.
  15. “Visual and linguistic semantic representations are aligned at the border of human visual cortex,” Nature Neuroscience, vol. 24, no. 11, pp. 1628–1636, 2021.
  16. “Hierarchical clustering to measure connectivity in fMRI resting-state data,” Magnetic Resonance Imaging, vol. 20, no. 4, pp. 305–317, 2002.
  17. “Score-CAM: Score-Weighted Visual Explanations for Convolutional Neural Networks,” arXiv:1910.01279, 2020.
  18. “BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation,” arXiv:2201.12086, 2022.
  19. “An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale,” in International Conference on Learning Representations, 2020.
  20. “BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding,” in Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Minneapolis, Minnesota, 2019, pp. 4171–4186, Association for Computational Linguistics.
  21. “MLP-Mixer: An all-MLP Architecture for Vision,” in Advances in Neural Information Processing Systems. 2021, vol. 34, pp. 24261–24272, Curran Associates, Inc.
  22. “Gaussian Error Linear Units (GELUs),” arXiv:1606.08415, 2023.
  23. Laurens van der Maaten and Geoffrey Hinton, “Visualizing Data using t-SNE,” Journal of Machine Learning Research, vol. 9, no. 86, pp. 2579–2605, 2008.
  24. “UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction,” arXiv:1802.03426, 2020.
  25. “Segment Anything,” arXiv:2304.02643, 2023.
  26. “Reproducible scaling laws for contrastive language-image learning,” arXiv:2212.07143, 2022.
  27. “Methods for computing the maximum performance of computational models of fMRI responses,” PLOS Computational Biology, vol. 15, no. 3, pp. e1006397, 2019.
  28. “Algorithms for Hyper-Parameter Optimization,” in Advances in Neural Information Processing Systems. 2011, vol. 24, Curran Associates, Inc.
  29. “Predicting brain activity using Transformers,” Preprint, Neuroscience, 2023.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (10)
  1. Ruixing Liang (77 papers)
  2. Xiangyu Zhang (328 papers)
  3. Qiong Li (86 papers)
  4. Lai Wei (68 papers)
  5. Hexin Liu (35 papers)
  6. Avisha Kumar (4 papers)
  7. Kelley M. Kempski Leadingham (1 paper)
  8. Joshua Punnoose (1 paper)
  9. Leibny Paola Garcia (14 papers)
  10. Amir Manbachi (3 papers)