Deep Problems with Neural Network Models of Human Vision

Jeffrey S Bowers; Gaurav Malhotra; Marin Dujmović; Milton Llera Montero; Christian Tsvetkov; Valerio Biscione; Guillermo Puebla; Federico Adolfi; John E Hummel; Rachel F Heaton; Benjamin D Evans; Jeffrey Mitchell; Ryan Blything

doi:10.1017/S0140525X22002813

Deep Problems with Neural Network Models of Human Vision

Jeffrey S Bowers, Gaurav Malhotra, Marin Dujmović, Milton Llera Montero, Christian Tsvetkov, Valerio Biscione, Guillermo Puebla, Federico Adolfi, John E Hummel, Rachel F Heaton, Benjamin D Evans, Jeffrey Mitchell, Ryan Blything

Research output: Contribution to journal › Article › peer-review

Abstract

Deep neural networks (DNNs) have had extraordinary successes in classifying photographic images of objects and are often described as the best models of biological vision. This conclusion is largely based on three sets of findings: (1) DNNs are more accurate than any other model in classifying images taken from various datasets, (2) DNNs do the best job in predicting the pattern of human errors in classifying objects taken from various behavioral datasets, and (3) DNNs do the best job in predicting brain signals in response to images taken from various brain datasets (e.g., single cell responses or fMRI data). However, these behavioral and brain datasets do not test hypotheses regarding what features are contributing to good predictions and we show that the predictions may be mediated by DNNs that share little overlap with biological vision. More problematically, we show that DNNs account for almost no results from psychological research. This contradicts the common claim that DNNs are good, let alone the best, models of human object recognition. We argue that theorists interested in developing biologically plausible models of human vision need to direct their attention to explaining psychological findings. More generally, theorists need to build models that explain the results of experiments that manipulate independent variables designed to test hypotheses rather than compete on making the best predictions. We conclude by briefly summarizing various promising modelling approaches that focus on psychological data.

Original language	English
Journal	Behavioral and Brain Sciences
Early online date	1 Dec 2022
DOIs	https://doi.org/10.1017/S0140525X22002813
Publication status	E-pub ahead of print - 1 Dec 2022

Bibliographical note

Copyright © The Author(s), 2022. Published by Cambridge University Press. This accepted manuscript version is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License [https://creativecommons.org/licenses/by-nc-nd/4.0/]. The final publication is available via Cambridge Journals Online at https://doi.org/10.1017/S0140525X22002813. Funding: This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No 741134).

Keywords

Computational Neuroscience
Object recognition
Deep Neural Networks
Brain-Score
Human Vision

Access to Document

10.1017/S0140525X22002813

Bowersetal_2022_AAM
Copyright © The Author(s), 2022. Published by Cambridge University Press. This accepted manuscript version is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License [https://creativecommons.org/licenses/by-nc-nd/4.0/]. The final publication is available via Cambridge Journals Online at https://doi.org/10.1017/S0140525X22002813./S0140525X22002813.
Accepted author manuscript, 1.4 MBLicence: CC BY-NC-ND 4.0

Cite this

@article{9aae07ed604f487680b7a216d0941b0e,

title = "Deep Problems with Neural Network Models of Human Vision",

abstract = "Deep neural networks (DNNs) have had extraordinary successes in classifying photographic images of objects and are often described as the best models of biological vision. This conclusion is largely based on three sets of findings: (1) DNNs are more accurate than any other model in classifying images taken from various datasets, (2) DNNs do the best job in predicting the pattern of human errors in classifying objects taken from various behavioral datasets, and (3) DNNs do the best job in predicting brain signals in response to images taken from various brain datasets (e.g., single cell responses or fMRI data). However, these behavioral and brain datasets do not test hypotheses regarding what features are contributing to good predictions and we show that the predictions may be mediated by DNNs that share little overlap with biological vision. More problematically, we show that DNNs account for almost no results from psychological research. This contradicts the common claim that DNNs are good, let alone the best, models of human object recognition. We argue that theorists interested in developing biologically plausible models of human vision need to direct their attention to explaining psychological findings. More generally, theorists need to build models that explain the results of experiments that manipulate independent variables designed to test hypotheses rather than compete on making the best predictions. We conclude by briefly summarizing various promising modelling approaches that focus on psychological data.",

keywords = "Computational Neuroscience, Object recognition, Deep Neural Networks, Brain-Score, Human Vision",

author = "Bowers, {Jeffrey S} and Gaurav Malhotra and Marin Dujmovi{\'c} and Montero, {Milton Llera} and Christian Tsvetkov and Valerio Biscione and Guillermo Puebla and Federico Adolfi and Hummel, {John E} and Heaton, {Rachel F} and Evans, {Benjamin D} and Jeffrey Mitchell and Ryan Blything",

note = "Copyright {\textcopyright} The Author(s), 2022. Published by Cambridge University Press. This accepted manuscript version is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License [https://creativecommons.org/licenses/by-nc-nd/4.0/]. The final publication is available via Cambridge Journals Online at https://doi.org/10.1017/S0140525X22002813. Funding: This project has received funding from the European Research Council (ERC) under the European Union{\textquoteright}s Horizon 2020 research and innovation programme (grant agreement No 741134).",

year = "2022",

month = dec,

day = "1",

doi = "10.1017/S0140525X22002813",

language = "English",

journal = "Behavioral and Brain Sciences",

issn = "0140-525X",

publisher = "Cambridge University Press",

}

TY - JOUR

T1 - Deep Problems with Neural Network Models of Human Vision

AU - Bowers, Jeffrey S

AU - Malhotra, Gaurav

AU - Dujmović, Marin

AU - Montero, Milton Llera

AU - Tsvetkov, Christian

AU - Biscione, Valerio

AU - Puebla, Guillermo

AU - Adolfi, Federico

AU - Hummel, John E

AU - Heaton, Rachel F

AU - Evans, Benjamin D

AU - Mitchell, Jeffrey

AU - Blything, Ryan

N1 - Copyright © The Author(s), 2022. Published by Cambridge University Press. This accepted manuscript version is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License [https://creativecommons.org/licenses/by-nc-nd/4.0/]. The final publication is available via Cambridge Journals Online at https://doi.org/10.1017/S0140525X22002813. Funding: This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No 741134).

PY - 2022/12/1

Y1 - 2022/12/1

N2 - Deep neural networks (DNNs) have had extraordinary successes in classifying photographic images of objects and are often described as the best models of biological vision. This conclusion is largely based on three sets of findings: (1) DNNs are more accurate than any other model in classifying images taken from various datasets, (2) DNNs do the best job in predicting the pattern of human errors in classifying objects taken from various behavioral datasets, and (3) DNNs do the best job in predicting brain signals in response to images taken from various brain datasets (e.g., single cell responses or fMRI data). However, these behavioral and brain datasets do not test hypotheses regarding what features are contributing to good predictions and we show that the predictions may be mediated by DNNs that share little overlap with biological vision. More problematically, we show that DNNs account for almost no results from psychological research. This contradicts the common claim that DNNs are good, let alone the best, models of human object recognition. We argue that theorists interested in developing biologically plausible models of human vision need to direct their attention to explaining psychological findings. More generally, theorists need to build models that explain the results of experiments that manipulate independent variables designed to test hypotheses rather than compete on making the best predictions. We conclude by briefly summarizing various promising modelling approaches that focus on psychological data.

AB - Deep neural networks (DNNs) have had extraordinary successes in classifying photographic images of objects and are often described as the best models of biological vision. This conclusion is largely based on three sets of findings: (1) DNNs are more accurate than any other model in classifying images taken from various datasets, (2) DNNs do the best job in predicting the pattern of human errors in classifying objects taken from various behavioral datasets, and (3) DNNs do the best job in predicting brain signals in response to images taken from various brain datasets (e.g., single cell responses or fMRI data). However, these behavioral and brain datasets do not test hypotheses regarding what features are contributing to good predictions and we show that the predictions may be mediated by DNNs that share little overlap with biological vision. More problematically, we show that DNNs account for almost no results from psychological research. This contradicts the common claim that DNNs are good, let alone the best, models of human object recognition. We argue that theorists interested in developing biologically plausible models of human vision need to direct their attention to explaining psychological findings. More generally, theorists need to build models that explain the results of experiments that manipulate independent variables designed to test hypotheses rather than compete on making the best predictions. We conclude by briefly summarizing various promising modelling approaches that focus on psychological data.

KW - Computational Neuroscience

KW - Object recognition

KW - Deep Neural Networks

KW - Brain-Score

KW - Human Vision

UR - https://www.cambridge.org/core/journals/behavioral-and-brain-sciences/article/deep-problems-with-neural-network-models-of-human-vision/ABCE483EE95E80315058BB262DCA26A9

UR - http://www.scopus.com/inward/record.url?scp=85144985461&partnerID=8YFLogxK

U2 - 10.1017/S0140525X22002813

DO - 10.1017/S0140525X22002813

M3 - Article

C2 - 36453586

SN - 0140-525X

JO - Behavioral and Brain Sciences

JF - Behavioral and Brain Sciences

ER -

Deep Problems with Neural Network Models of Human Vision

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Clarifying status of DNNs as models of human vision

Cite this