Target-absent Human Attention

07/04/2022
by   Zhibo Yang, et al.
0

The prediction of human gaze behavior is important for building human-computer interactive systems that can anticipate a user's attention. Computer vision models have been developed to predict the fixations made by people as they search for target objects. But what about when the image has no target? Equally important is to know how people search when they cannot find a target, and when they would stop searching. In this paper, we propose the first data-driven computational model that addresses the search-termination problem and predicts the scanpath of search fixations made by people searching for targets that do not appear in images. We model visual search as an imitation learning problem and represent the internal knowledge that the viewer acquires through fixations using a novel state representation that we call Foveated Feature Maps (FFMs). FFMs integrate a simulated foveated retina into a pretrained ConvNet that produces an in-network feature pyramid, all with minimal computational overhead. Our method integrates FFMs as the state representation in inverse reinforcement learning. Experimentally, we improve the state of the art in predicting human target-absent search behavior on the COCO-Search18 dataset

READ FULL TEXT
research
05/28/2020

Predicting Goal-directed Human Attention Using Inverse Reinforcement Learning

Being able to predict human gaze behavior has obvious importance for beh...
research
02/18/2015

Prediction of Search Targets From Fixations in Open-World Settings

Previous work on predicting the target of visual search from human fixat...
research
03/16/2023

Predicting Human Attention using Computational Attention

Most models of visual attention are aimed at predicting either top-down ...
research
01/31/2020

Predicting Goal-directed Attention Control Using Inverse-Reinforcement Learning

Understanding how goal states control behavior is a question ripe for in...
research
10/27/2022

Predicting Visual Attention and Distraction During Visual Search Using Convolutional Neural Networks

Most studies in computational modeling of visual attention encompass tas...
research
12/08/2014

When Computer Vision Gazes at Cognition

Joint attention is a core, early-developing form of social interaction. ...
research
11/23/2018

Learning to attend in a brain-inspired deep neural network

Recent machine learning models have shown that including attention as a ...

Please sign up or login with your details

Forgot password? Click here to reset