A Modular Multimodal Architecture for Gaze Target Prediction: Application to Privacy-Sensitive Settings

07/11/2023
by   Anshul Gupta, et al.
0

Predicting where a person is looking is a complex task, requiring to understand not only the person's gaze and scene content, but also the 3D scene structure and the person's situation (are they manipulating? interacting or observing others? attentive?) to detect obstructions in the line of sight or apply attention priors that humans typically have when observing others. In this paper, we hypothesize that identifying and leveraging such priors can be better achieved through the exploitation of explicitly derived multimodal cues such as depth and pose. We thus propose a modular multimodal architecture allowing to combine these cues using an attention mechanism. The architecture can naturally be exploited in privacy-sensitive situations such as surveillance and health, where personally identifiable information cannot be released. We perform extensive experiments on the GazeFollow and VideoAttentionTarget public datasets, obtaining state-of-the-art performance and demonstrating very competitive results in the privacy setting case.

READ FULL TEXT
research
08/23/2022

Multimodal Across Domains Gaze Target Detection

This paper addresses the gaze target detection problem in single images ...
research
07/04/2023

ChildPlay: A New Benchmark for Understanding Children's Gaze Behaviour

Gaze behaviors such as eye-contact or shared attention are important mar...
research
10/10/2021

FLAME: Facial Landmark Heatmap Activated Multimodal Gaze Estimation

3D gaze estimation is about predicting the line of sight of a person in ...
research
12/09/2016

Following Gaze Across Views

Following the gaze of people inside videos is an important signal for un...
research
03/13/2023

Contextually-rich human affect perception using multimodal scene information

The process of human affect understanding involves the ability to infer ...
research
11/30/2020

Detecting expressions with multimodal transformers

Developing machine learning algorithms to understand person-to-person en...

Please sign up or login with your details

Forgot password? Click here to reset