Detecting Attended Visual Targets in Video

03/05/2020
by   Eunji Chong, et al.
0

We address the problem of detecting attention targets in video. Specifically, our goal is to identify where each person in each frame of a video is looking, and correctly handle the out-of-frame case. Our novel architecture effectively models the dynamic interaction between the scene and head features in order to infer time-varying attention targets. We introduce a new dataset, VideoAttentionTarget, consisting of fully-annotated video clips containing complex and dynamic patterns of real-world gaze behavior. Experiments on this dataset show that our model can effectively infer attention in videos. To further demonstrate the utility of our approach, we apply our predicted attention maps to two social gaze behavior recognition tasks, and show that the resulting classifiers significantly outperform existing methods. We achieve state-of-the-art performance on three datasets: GazeFollow (static images), VideoAttentionTarget (videos), and VideoCoAtt (videos), and obtain the first results for automatically classifying clinically-relevant gaze behavior without wearable cameras or eye trackers.

READ FULL TEXT

page 1

page 3

page 4

page 6

page 7

page 8

research
06/10/2020

A gaze driven fast-forward method for first-person videos

The growing data sharing and life-logging cultures are driving an unprec...
research
10/05/2022

Learning Video-independent Eye Contact Segmentation from In-the-Wild Videos

Human eye contact is a form of non-verbal communication and can have a g...
research
06/12/2019

LAEO-Net: revisiting people Looking At Each Other in videos

Capturing the `mutual gaze' of people is essential for understanding and...
research
08/08/2022

Where Are You Looking?: A Large-Scale Dataset of Head and Gaze Behavior for 360-Degree Videos and a Pilot Study

360 videos in recent years have experienced booming development. Compare...
research
11/29/2016

Social Behavior Prediction from First Person Videos

This paper presents a method to predict the future movements (location a...
research
11/21/2018

Learning to Attend Relevant Regions in Videos from Eye Fixations

Attentively important objects in videos account for a majority part of s...
research
04/17/2021

Gaze Perception in Humans and CNN-Based Model

Making accurate inferences about other individuals' locus of attention i...

Please sign up or login with your details

Forgot password? Click here to reset