CT-2.5

Humans and CNNs see differently: Action affordances are represented in scene-selective visual cortex but not CNNs

Clemens G. Bartnik, Iris I.A. Groen, University of Amsterdam, Netherlands

Session:
Contributed Talks 2 Lecture

Track:
Cognitive science

Location:
South Schools / East Schools

Presentation Time:
Sat, 26 Aug, 17:30 - 17:45 United Kingdom Time

Abstract:
To navigate the immediate visual environment, humans use a variety of locomotive actions, such as walking, swimming or climbing. How does the brain represent such environmental action affordances and which visual features drive these representations? Here, we compared representations of visual properties derived from human annotations, fMRI measurements, and convolutional neural networks (CNNs) on a new set of real-world scenes that afford distinct locomotive actions in a diverse set of indoor and outdoor environments. Representational similarity analysis shows that scene-selective brain regions represent information about action affordances as well as materials and objects. In contrast, CNNs trained on scene classification show comparatively lower correlation with action affordances, instead most strongly representing global scene properties. Together, these results suggest that specialized models that incorporate action affordances may be needed to fully capture representations in scene-selective visual cortex.

Manuscript:
License:
Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 Unported License.
DOI:
10.32470/CCN.2023.1598-0
Publication:
2023 Conference on Cognitive Computational Neuroscience
Presentation
Discussion
Resources
No resources available.
Session CT-2
CT-2.1: Mental Imagery: Weak Vision or Compressed Vision?
Tiasha Saha Roy, Jesse Breedlove, Ghislain St-Yves, Kendrick Kay, Thomas Naselaris, University of Minnesota, United States
CT-2.2: Leveraging Artificial Neural Networks to Enhance Diagnostic Efficiency in Autism Spectrum Disorder: A Study on Facial Emotion Recognition
Kushin Mukherjee, University of Wisconsin-Madison, United States; Na Yeon Kim, California Institute of Technology, United States; Shirin Taghian Alamooti, York University, Canada; Ralph Adolphs, California Institite of Technology, United States; Kohitij Kar, York University, Canada
CT-2.3: Dropout as a tool for understanding information distribution in human and machine visual systems
Jacob S. Prince, Harvard University, United States; Gabriel Fajardo, Boston College, United States; George A. Alvarez, Talia Konkle, Harvard University, United States
CT-2.4: Humans and 3D neural field models make similar 3D shape judgements
Thomas O'Connell, MIT, United States; Tyler Bonnen, Stanford University, United States; Yoni Friedman, Ayush Tewari, Josh Tenenbaum, Vincent Sitzmann, Nancy Kanwisher, MIT, United States
CT-2.5: Humans and CNNs see differently: Action affordances are represented in scene-selective visual cortex but not CNNs
Clemens G. Bartnik, Iris I.A. Groen, University of Amsterdam, Netherlands
CT-2.6: Beyond Geometry: Comparing the Temporal Structure of Computation in Neural Circuits with Dynamic Mode Representational Similarity Analysis
Mitchell Ostrow, Adam Eisen, Leo Kozachkov, Ila Fiete, Massachusetts Institute of Technology, United States