Comparing object recognition in humans and deep convolutional neural networks – An eye tracking study

07/30/2021
by   Leonard E. van Dyck, et al.
9

Deep convolutional neural networks (DCNNs) and the ventral visual pathway share vast architectural and functional similarities in visual challenges such as object recognition. Recent insights have demonstrated that both hierarchical cascades can be compared in terms of both exerted behavior and underlying activation. However, these approaches ignore key differences in spatial priorities of information processing. In this proof-of-concept study, we demonstrate a comparison of human observers (N = 45) and three feedforward DCNNs through eye tracking and saliency maps. The results reveal fundamentally different resolutions in both visualization methods that need to be considered for an insightful comparison. Moreover, we provide evidence that a DCNN with biologically plausible receptive field sizes called vNet reveals higher agreement with human viewing behavior as contrasted with a standard ResNet architecture. We find that image-specific factors such as category, animacy, arousal, and valence have a direct link to the agreement of spatial object recognition priorities in humans and DCNNs, while other measures such as difficulty and general image properties do not. With this approach, we try to open up new perspectives at the intersection of biological and computer vision research.

READ FULL TEXT

page 3

page 6

page 8

page 9

page 10

page 15

research
06/21/2022

Guiding Visual Attention in Deep Convolutional Neural Networks Based on Human Eye Movements

Deep Convolutional Neural Networks (DCNNs) were originally inspired by p...
research
01/10/2017

What are the visual features underlying human versus machine vision?

Although Deep Convolutional Networks (DCNs) are approaching the accuracy...
research
08/10/2021

Understanding Character Recognition using Visual Explanations Derived from the Human Visual System and Deep Networks

Human observers engage in selective information uptake when classifying ...
research
07/31/2017

Capacity limitations of visual search in deep convolutional neural network

Deep convolutional neural networks follow roughly the architecture of bi...
research
08/18/2023

End-to-end topographic networks as models of cortical map formation and human visual behaviour: moving beyond convolutions

Computational models are an essential tool for understanding the origin ...
research
06/26/2017

Do Deep Neural Networks Suffer from Crowding?

Crowding is a visual effect suffered by humans, in which an object that ...

Please sign up or login with your details

Forgot password? Click here to reset