DeepAI AI Chat
Log In Sign Up

DANTE: Deep Affinity Network for Clustering Conversational Interactants

by   Mason Swofford, et al.
Yale University
Stanford University

We propose a data-driven approach to visually detect conversational groups by identifying spatial arrangements typical of these focused social encounters. Our approach uses a novel Deep Affinity Network (DANTE) to predict the likelihood that two individuals in a scene are part of the same conversational group, considering contextual information like the position and orientation of other nearby individuals. The predicted pair-wise affinities are then used in a graph clustering framework to identify both small (e.g., dyads) and bigger groups. The results from our evaluation on two standard benchmarks suggest that the combination of powerful deep learning methods with classical clustering techniques can improve the detection of conversational groups in comparison to prior approaches. Our technique has a wide range of applications from visual scene understanding, e.g., for surveillance, to social robotics.


page 1

page 7

page 8


Conversational Group Detection With Deep Convolutional Networks

Detection of interacting and conversational groups from images has appli...

F-formation Detection: Individuating Free-standing Conversational Groups in Images

Detection of groups of interacting people is a very interesting and usef...

Conversation Group Detection With Spatio-Temporal Context

In this work, we propose an approach for detecting conversation groups i...

Real-time Trajectory-based Social Group Detection

Social group detection is a crucial aspect of various robotic applicatio...

BiasAsker: Measuring the Bias in Conversational AI System

Powered by advanced Artificial Intelligence (AI) techniques, conversatio...

SPONGE: A generalized eigenproblem for clustering signed networks

We introduce a principled and theoretically sound spectral method for k-...

GROWL: Group Detection With Link Prediction

Interaction group detection has been previously addressed with bottom-up...