Do humans and Convolutional Neural Networks attend to similar areas during scene classification: Effects of task and image type

07/25/2023
by   Romy Müller, et al.
0

Deep Learning models like Convolutional Neural Networks (CNN) are powerful image classifiers, but what factors determine whether they attend to similar image areas as humans do? While previous studies have focused on technological factors, little is known about the role of factors that affect human attention. In the present study, we investigated how the tasks used to elicit human attention maps interact with image characteristics in modulating the similarity between humans and CNN. We varied the intentionality of human tasks, ranging from spontaneous gaze during categorization over intentional gaze-pointing up to manual area selection. Moreover, we varied the type of image to be categorized, using either singular, salient objects, indoor scenes consisting of object arrangements, or landscapes without distinct objects defining the category. The human attention maps generated in this way were compared to the CNN attention maps revealed by explainable artificial intelligence (Grad-CAM). The influence of human tasks strongly depended on image type: For objects, human manual selection produced maps that were most similar to CNN, while the specific eye movement task has little impact. For indoor scenes, spontaneous gaze produced the least similarity, while for landscapes, similarity was equally low across all human tasks. To better understand these results, we also compared the different human attention maps to each other. Our results highlight the importance of taking human factors into account when comparing the attention of humans and CNN.

READ FULL TEXT

page 9

page 14

page 15

page 20

page 24

research
04/17/2021

Gaze Perception in Humans and CNN-Based Model

Making accurate inferences about other individuals' locus of attention i...
research
03/31/2020

Deep semantic gaze embedding and scanpath comparison for expertise classification during OPT viewing

Modeling eye movement indicative of expertise behavior is decisive in us...
research
12/02/2020

Attention-gating for improved radio galaxy classification

In this work we introduce attention as a state of the art mechanism for ...
research
01/30/2021

Matching Representations of Explainable Artificial Intelligence and Eye Gaze for Human-Machine Interaction

Rapid non-verbal communication of task-based stimuli is a challenge in h...
research
12/18/2017

Guiding human gaze with convolutional neural networks

The eye fixation patterns of human observers are a fundamental indicator...
research
05/07/2022

Ultra-fast image categorization in vivo and in silico

Humans are able to robustly categorize images and can, for instance, det...
research
03/05/2018

Totally Looks Like - How Humans Compare, Compared to Machines

Perceptual judgment of image similarity by humans relies on a rich inter...

Please sign up or login with your details

Forgot password? Click here to reset