Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Parallel Backpropagation for Shared-Feature Visualization (2405.09827v2)

Published 16 May 2024 in cs.CV and cs.LG

Abstract: High-level visual brain regions contain subareas in which neurons appear to respond more strongly to examples of a particular semantic category, like faces or bodies, rather than objects. However, recent work has shown that while this finding holds on average, some out-of-category stimuli also activate neurons in these regions. This may be due to visual features common among the preferred class also being present in other images. Here, we propose a deep-learning-based approach for visualizing these features. For each neuron, we identify relevant visual features driving its selectivity by modelling responses to images based on latent activations of a deep neural network. Given an out-of-category image which strongly activates the neuron, our method first identifies a reference image from the preferred category yielding a similar feature activation pattern. We then backpropagate latent activations of both images to the pixel level, while enhancing the identified shared dimensions and attenuating non-shared features. The procedure highlights image regions containing shared features driving responses of the model neuron. We apply the algorithm to novel recordings from body-selective regions in macaque IT cortex in order to understand why some images of objects excite these neurons. Visualizations reveal object parts which resemble parts of a macaque body, shedding light on neural preference of these objects.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (39)
  1. The Fusiform Face Area: A Module in Human Extrastriate Cortex Specialized for Face Perception. The Journal of Neuroscience, 17(11):4302–4311, June 1997. ISSN 0270-6474. doi: 10.1523/JNEUROSCI.17-11-04302.1997.
  2. A Cortical Region Consisting Entirely of Face-Selective Cells. Science, 311(5761):670–674, February 2006. ISSN 0036-8075, 1095-9203. doi: 10.1126/science.1119983.
  3. A Cortical Area Selective for Visual Processing of the Human Body. Science, 293(5539):2470–2473, September 2001. doi: 10.1126/science.1063414.
  4. Extrastriate body area in human occipital cortex responds to the performance of motor actions. Nature Neuroscience, 7(5):542–548, May 2004. ISSN 1097-6256. doi: 10.1038/nn1241.
  5. Rufin Vogels. More Than the Face: Representations of Bodies in the Inferior Temporal Cortex. Annual Review of Vision Science, 8(1):383–405, 2022. doi: 10.1146/annurev-vision-100720-113429.
  6. The neural code for “face cells” is not face-specific. Science Advances, 9(35):eadg1736, September 2023. ISSN 2375-2548. doi: 10.1126/sciadv.adg1736.
  7. Face neurons encode nonsemantic features. Proceedings of the National Academy of Sciences, 119(16):e2118705119, April 2022. ISSN 0027-8424, 1091-6490. doi: 10.1073/pnas.2118705119.
  8. A map of object space in primate inferotemporal cortex. Nature, 583(7814):103–108, July 2020. ISSN 1476-4687. doi: 10.1038/s41586-020-2350-5.
  9. When the whole is only the parts: Non-holistic object parts predominate face-cell responses to illusory faces. bioRxiv, 2023.
  10. Rapid, concerted switching of the neural code in inferotemporal cortex. bioRxiv, pages 2023–12, 2023.
  11. Stimulus features coded by single neurons of a macaque body category selective patch. Proceedings of the National Academy of Sciences of the United States of America, 113(17):E2450–2459, April 2016. ISSN 1091-6490. doi: 10.1073/pnas.1520371113.
  12. Visualizing and Understanding Convolutional Networks, November 2013.
  13. RISE: Randomized Input Sampling for Explanation of Black-box Models, September 2018.
  14. Learning Deep Features for Discriminative Localization, December 2015.
  15. Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization. International Journal of Computer Vision, 128(2):336–359, February 2020. ISSN 0920-5691, 1573-1405. doi: 10.1007/s11263-019-01228-7.
  16. Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps, April 2014.
  17. Striving for Simplicity: The All Convolutional Net, April 2015.
  18. Axiomatic Attribution for Deep Networks, June 2017.
  19. Sanity Checks for Saliency Maps. In Neural Information Processing Systems, October 2018.
  20. Towards Better Understanding Attribution Methods. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 10213–10222, June 2022. doi: 10.1109/CVPR52688.2022.00998.
  21. Visualizing Deep Similarity Networks, January 2019.
  22. Performance-optimized hierarchical models predict neural responses in higher visual cortex. Proceedings of the National Academy of Sciences, 111(23):8619–8624, 2014. doi: 10.1073/pnas.1403112111.
  23. Deep convolutional models improve predictions of macaque V1 responses to natural images. PLOS Computational Biology, 15(4):1–27, April 2019. doi: 10.1371/journal.pcbi.1006897.
  24. A Simple Framework for Contrastive Learning of Visual Representations. https://arxiv.org/abs/2002.05709v3, February 2020.
  25. Deep residual learning for image recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 770–778, 2015.
  26. ImageNet: A large-scale hierarchical image database. In 2009 IEEE Conference on Computer Vision and Pattern Recognition, pages 248–255, 2009. doi: 10.1109/CVPR.2009.5206848.
  27. Do adversarially robust ImageNet models transfer better?, 2020.
  28. Deep learning-driven characterization of single cell tuning in primate visual area V4 unveils topological organization. bioRxiv : the preprint server for biology, 2023. doi: 10.1101/2023.05.12.540591.
  29. Towards robust vision by multi-task learning on monkey visual cortex, 2021.
  30. Model metamers illuminate divergences between biological and artificial neural networks. bioRxiv : the preprint server for biology, 2023. doi: 10.1101/2022.05.19.492678.
  31. Generalization in data-driven models of primary visual cortex. bioRxiv : the preprint server for biology, 2020. doi: 10.1101/2020.10.05.326256.
  32. Simultaneous recordings from posterior and anterior body-responsive regions in the macaque Superior Temporal Sulcus. Journal of Vision, 23(9):5403, August 2023. ISSN 1534-7362. doi: 10.1167/jov.23.9.5403.
  33. The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale. International Journal of Computer Vision, 128(7):1956–1981, July 2020. ISSN 0920-5691, 1573-1405. doi: 10.1007/s11263-020-01316-z.
  34. Visual long-term memory has a massive storage capacity for object details. Proceedings of the National Academy of Sciences of the United States of America, 105(38):14325–14329, September 2008. ISSN 1091-6490. doi: 10.1073/pnas.0803390105.
  35. Micro-Valences: Perceiving Affective Valence in Everyday Objects. Frontiers in Psychology, 3:107, April 2012. ISSN 1664-1078. doi: 10.3389/fpsyg.2012.00107.
  36. Adam: A Method for Stochastic Optimization, January 2017.
  37. Tolerance of Macaque Middle STS Body Patch Neurons to Shape-preserving Stimulus Transformations. Journal of Cognitive Neuroscience, 27(5):1001–1016, May 2015. ISSN 0898-929X. doi: 10.1162/jocn_a_00762.
  38. PyTorch: An Imperative Style, High-Performance Deep Learning Library, December 2019.
  39. Exploring patterns enriched in a dataset with contrastive principal component analysis. Nature communications, 9(1):2134, 2018.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Alexander Lappe (3 papers)
  2. Anna Bognár (1 paper)
  3. Ghazaleh Ghamkhari Nejad (1 paper)
  4. Albert Mukovskiy (1 paper)
  5. Lucas Martini (1 paper)
  6. Martin A. Giese (6 papers)
  7. Rufin Vogels (1 paper)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com