A Self Validation Network for Object-Level Human Attention Estimation

10/31/2019
by   Zehua Zhang, et al.
8

Due to the foveated nature of the human vision system, people can focus their visual attention on a small region of their visual field at a time, which usually contains only a single object. Estimating this object of attention in first-person (egocentric) videos is useful for many human-centered real-world applications such as augmented reality applications and driver assistance systems. A straightforward solution for this problem is to pick the object whose bounding box is hit by the gaze, where eye gaze point estimation is obtained from a traditional eye gaze estimator and object candidates are generated from an off-the-shelf object detector. However, such an approach can fail because it addresses the where and the what problems separately, despite that they are highly related, chicken-and-egg problems. In this paper, we propose a novel unified model that incorporates both spatial and temporal evidence in identifying as well as locating the attended object in firstperson videos. It introduces a novel Self Validation Module that enforces and leverages consistency of the where and the what concepts. We evaluate on two public datasets, demonstrating that Self Validation Module significantly benefits both training and testing and that our model outperforms the state-of-the-art.

READ FULL TEXT

page 2

page 8

page 14

research
05/03/2022

Retail Gaze: A Dataset for Gaze Estimation in Retail Environments

The concept of gaze object estimation predicts a bounding box that a per...
research
08/04/2022

RAZE: Region Guided Self-Supervised Gaze Representation Learning

Automatic eye gaze estimation is an important problem in vision based as...
research
01/04/2018

Object Referring in Videos with Language and Human Gaze

We investigate the problem of object referring (OR) i.e. to localize a t...
research
08/08/2022

Gaze Estimation Approach Using Deep Differential Residual Network

Gaze estimation, which is a method to determine where a person is lookin...
research
07/16/2015

Driver Gaze Region Estimation Without Using Eye Movement

Automated estimation of the allocation of a driver's visual attention ma...
research
08/25/2020

On estimating gaze by self-attention augmented convolutions

Estimation of 3D gaze is highly relevant to multiple fields, including b...
research
03/24/2019

Periphery-Fovea Multi-Resolution Driving Model guided by Human Attention

Inspired by human vision, we propose a new periphery-fovea multi-resolut...

Please sign up or login with your details

Forgot password? Click here to reset