Gaze Perception in Humans and CNN-Based Model

04/17/2021
by   Nicole X. Han, et al.
0

Making accurate inferences about other individuals' locus of attention is essential for human social interactions and will be important for AI to effectively interact with humans. In this study, we compare how a CNN (convolutional neural network) based model of gaze and humans infer the locus of attention in images of real-world scenes with a number of individuals looking at a common location. We show that compared to the model, humans' estimates of the locus of attention are more influenced by the context of the scene, such as the presence of the attended target and the number of individuals in the image.

READ FULL TEXT

page 2

page 4

page 5

page 7

research
04/10/2018

Discovery and usage of joint attention in images

Joint visual attention is characterized by two or more individuals looki...
research
07/25/2023

Do humans and Convolutional Neural Networks attend to similar areas during scene classification: Effects of task and image type

Deep Learning models like Convolutional Neural Networks (CNN) are powerf...
research
11/29/2016

Measuring and modeling the perception of natural and unconstrained gaze in humans and machines

Humans are remarkably adept at interpreting the gaze direction of other ...
research
12/18/2017

Guiding human gaze with convolutional neural networks

The eye fixation patterns of human observers are a fundamental indicator...
research
11/02/2021

Human Attention in Fine-grained Classification

The way humans attend to, process and classify a given image has the pot...
research
03/05/2020

Detecting Attended Visual Targets in Video

We address the problem of detecting attention targets in video. Specific...
research
08/02/2022

Can Gaze Beat Touch? A Fitts' Law Evaluation of Gaze, Touch, and Mouse Inputs

Gaze input has been a promising substitute for mouse input for point and...

Please sign up or login with your details

Forgot password? Click here to reset