Discovery and usage of joint attention in images

04/10/2018
by   Daniel Harari, et al.
0

Joint visual attention is characterized by two or more individuals looking at a common target at the same time. The ability to identify joint attention in scenes, the people involved, and their common target, is fundamental to the understanding of social interactions, including others' intentions and goals. In this work we deal with the extraction of joint attention events, and the use of such events for image descriptions. The work makes two novel contributions. First, our extraction algorithm is the first which identifies joint visual attention in single static images. It computes 3D gaze direction, identifies the gaze target by combining gaze direction with a 3D depth map computed for the image, and identifies the common gaze target. Second, we use a human study to demonstrate the sensitivity of humans to joint attention, suggesting that the detection of such a configuration in an image can be useful for understanding the image, including the goals of the agents and their joint activity, and therefore can contribute to image captioning and related tasks.

READ FULL TEXT

page 1

page 2

page 3

research
08/18/2016

Seeing with Humans: Gaze-Assisted Neural Image Captioning

Gaze reflects how humans process visual scenes and is therefore increasi...
research
04/17/2021

Gaze Perception in Humans and CNN-Based Model

Making accurate inferences about other individuals' locus of attention i...
research
12/08/2014

When Computer Vision Gazes at Cognition

Joint attention is a core, early-developing form of social interaction. ...
research
09/01/2021

From simple innate biases to complex visual concepts

Early in development, infants learn to solve visual problems that are hi...
research
11/29/2016

Measuring and modeling the perception of natural and unconstrained gaze in humans and machines

Humans are remarkably adept at interpreting the gaze direction of other ...
research
08/10/2023

Interaction-aware Joint Attention Estimation Using People Attributes

This paper proposes joint attention estimation in a single image. Differ...
research
01/12/2020

Attention Flow: End-to-End Joint Attention Estimation

This paper addresses the problem of understanding joint attention in thi...

Please sign up or login with your details

Forgot password? Click here to reset