Parallel Backpropagation for Shared-Feature Visualization (2405.09827v2)
Abstract: High-level visual brain regions contain subareas in which neurons appear to respond more strongly to examples of a particular semantic category, like faces or bodies, rather than objects. However, recent work has shown that while this finding holds on average, some out-of-category stimuli also activate neurons in these regions. This may be due to visual features common among the preferred class also being present in other images. Here, we propose a deep-learning-based approach for visualizing these features. For each neuron, we identify relevant visual features driving its selectivity by modelling responses to images based on latent activations of a deep neural network. Given an out-of-category image which strongly activates the neuron, our method first identifies a reference image from the preferred category yielding a similar feature activation pattern. We then backpropagate latent activations of both images to the pixel level, while enhancing the identified shared dimensions and attenuating non-shared features. The procedure highlights image regions containing shared features driving responses of the model neuron. We apply the algorithm to novel recordings from body-selective regions in macaque IT cortex in order to understand why some images of objects excite these neurons. Visualizations reveal object parts which resemble parts of a macaque body, shedding light on neural preference of these objects.
- The Fusiform Face Area: A Module in Human Extrastriate Cortex Specialized for Face Perception. The Journal of Neuroscience, 17(11):4302–4311, June 1997. ISSN 0270-6474. doi: 10.1523/JNEUROSCI.17-11-04302.1997.
- A Cortical Region Consisting Entirely of Face-Selective Cells. Science, 311(5761):670–674, February 2006. ISSN 0036-8075, 1095-9203. doi: 10.1126/science.1119983.
- A Cortical Area Selective for Visual Processing of the Human Body. Science, 293(5539):2470–2473, September 2001. doi: 10.1126/science.1063414.
- Extrastriate body area in human occipital cortex responds to the performance of motor actions. Nature Neuroscience, 7(5):542–548, May 2004. ISSN 1097-6256. doi: 10.1038/nn1241.
- Rufin Vogels. More Than the Face: Representations of Bodies in the Inferior Temporal Cortex. Annual Review of Vision Science, 8(1):383–405, 2022. doi: 10.1146/annurev-vision-100720-113429.
- The neural code for “face cells” is not face-specific. Science Advances, 9(35):eadg1736, September 2023. ISSN 2375-2548. doi: 10.1126/sciadv.adg1736.
- Face neurons encode nonsemantic features. Proceedings of the National Academy of Sciences, 119(16):e2118705119, April 2022. ISSN 0027-8424, 1091-6490. doi: 10.1073/pnas.2118705119.
- A map of object space in primate inferotemporal cortex. Nature, 583(7814):103–108, July 2020. ISSN 1476-4687. doi: 10.1038/s41586-020-2350-5.
- When the whole is only the parts: Non-holistic object parts predominate face-cell responses to illusory faces. bioRxiv, 2023.
- Rapid, concerted switching of the neural code in inferotemporal cortex. bioRxiv, pages 2023–12, 2023.
- Stimulus features coded by single neurons of a macaque body category selective patch. Proceedings of the National Academy of Sciences of the United States of America, 113(17):E2450–2459, April 2016. ISSN 1091-6490. doi: 10.1073/pnas.1520371113.
- Visualizing and Understanding Convolutional Networks, November 2013.
- RISE: Randomized Input Sampling for Explanation of Black-box Models, September 2018.
- Learning Deep Features for Discriminative Localization, December 2015.
- Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization. International Journal of Computer Vision, 128(2):336–359, February 2020. ISSN 0920-5691, 1573-1405. doi: 10.1007/s11263-019-01228-7.
- Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps, April 2014.
- Striving for Simplicity: The All Convolutional Net, April 2015.
- Axiomatic Attribution for Deep Networks, June 2017.
- Sanity Checks for Saliency Maps. In Neural Information Processing Systems, October 2018.
- Towards Better Understanding Attribution Methods. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 10213–10222, June 2022. doi: 10.1109/CVPR52688.2022.00998.
- Visualizing Deep Similarity Networks, January 2019.
- Performance-optimized hierarchical models predict neural responses in higher visual cortex. Proceedings of the National Academy of Sciences, 111(23):8619–8624, 2014. doi: 10.1073/pnas.1403112111.
- Deep convolutional models improve predictions of macaque V1 responses to natural images. PLOS Computational Biology, 15(4):1–27, April 2019. doi: 10.1371/journal.pcbi.1006897.
- A Simple Framework for Contrastive Learning of Visual Representations. https://arxiv.org/abs/2002.05709v3, February 2020.
- Deep residual learning for image recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 770–778, 2015.
- ImageNet: A large-scale hierarchical image database. In 2009 IEEE Conference on Computer Vision and Pattern Recognition, pages 248–255, 2009. doi: 10.1109/CVPR.2009.5206848.
- Do adversarially robust ImageNet models transfer better?, 2020.
- Deep learning-driven characterization of single cell tuning in primate visual area V4 unveils topological organization. bioRxiv : the preprint server for biology, 2023. doi: 10.1101/2023.05.12.540591.
- Towards robust vision by multi-task learning on monkey visual cortex, 2021.
- Model metamers illuminate divergences between biological and artificial neural networks. bioRxiv : the preprint server for biology, 2023. doi: 10.1101/2022.05.19.492678.
- Generalization in data-driven models of primary visual cortex. bioRxiv : the preprint server for biology, 2020. doi: 10.1101/2020.10.05.326256.
- Simultaneous recordings from posterior and anterior body-responsive regions in the macaque Superior Temporal Sulcus. Journal of Vision, 23(9):5403, August 2023. ISSN 1534-7362. doi: 10.1167/jov.23.9.5403.
- The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale. International Journal of Computer Vision, 128(7):1956–1981, July 2020. ISSN 0920-5691, 1573-1405. doi: 10.1007/s11263-020-01316-z.
- Visual long-term memory has a massive storage capacity for object details. Proceedings of the National Academy of Sciences of the United States of America, 105(38):14325–14329, September 2008. ISSN 1091-6490. doi: 10.1073/pnas.0803390105.
- Micro-Valences: Perceiving Affective Valence in Everyday Objects. Frontiers in Psychology, 3:107, April 2012. ISSN 1664-1078. doi: 10.3389/fpsyg.2012.00107.
- Adam: A Method for Stochastic Optimization, January 2017.
- Tolerance of Macaque Middle STS Body Patch Neurons to Shape-preserving Stimulus Transformations. Journal of Cognitive Neuroscience, 27(5):1001–1016, May 2015. ISSN 0898-929X. doi: 10.1162/jocn_a_00762.
- PyTorch: An Imperative Style, High-Performance Deep Learning Library, December 2019.
- Exploring patterns enriched in a dataset with contrastive principal component analysis. Nature communications, 9(1):2134, 2018.
- Alexander Lappe (3 papers)
- Anna Bognár (1 paper)
- Ghazaleh Ghamkhari Nejad (1 paper)
- Albert Mukovskiy (1 paper)
- Lucas Martini (1 paper)
- Martin A. Giese (6 papers)
- Rufin Vogels (1 paper)