Face-to-Face Contrastive Learning for Social Intelligence Question-Answering

07/29/2022
by   Alex Wilf, et al.
7

Creating artificial social intelligence - algorithms that can understand the nuances of multi-person interactions - is an exciting and emerging challenge in processing facial expressions and gestures from multimodal videos. Recent multimodal methods have set the state of the art on many tasks, but have difficulty modeling the complex face-to-face conversational dynamics across speaking turns in social interaction, particularly in a self-supervised setup. In this paper, we propose Face-to-Face Contrastive Learning (F2F-CL), a graph neural network designed to model social interactions using factorization nodes to contextualize the multimodal face-to-face interaction along the boundaries of the speaking turn. With the F2F-CL model, we propose to perform contrastive learning between the factorization nodes of different speaking turns within the same video. We experimentally evaluated the challenging Social-IQ dataset and show state-of-the-art results.

READ FULL TEXT
research
01/29/2019

Guidelines for creating man-machine multimodal interfaces

Understanding details of human multimodal interaction can elucidate many...
research
03/24/2022

Self-supervised Video-centralised Transformer for Video Face Clustering

This paper presents a novel method for face clustering in videos using a...
research
11/24/2022

On the Linguistic and Computational Requirements for Creating Face-to-Face Multimodal Human-Machine Interaction

In this study, conversations between humans and avatars are linguistical...
research
11/28/2018

The validity of RFID badges measuring face-to-face interactions

Face-to-face interactions are important for a variety of individual beha...
research
11/24/2017

Self-Supervised Vision-Based Detection of the Active Speaker as a Prerequisite for Socially-Aware Language Acquisition

This paper presents a self-supervised method for detecting the active sp...
research
06/10/2019

Towards Social Artificial Intelligence: Nonverbal Social Signal Prediction in A Triadic Interaction

We present a new research task and a dataset to understand human social ...
research
07/28/2023

When to generate hedges in peer-tutoring interactions

This paper explores the application of machine learning techniques to pr...

Please sign up or login with your details

Forgot password? Click here to reset