BOSS: A Benchmark for Human Belief Prediction in Object-context Scenarios

06/21/2022
by   Jiafei Duan, et al.
0

Humans with an average level of social cognition can infer the beliefs of others based solely on the nonverbal communication signals (e.g. gaze, gesture, pose and contextual information) exhibited during social interactions. This social cognitive ability to predict human beliefs and intentions is more important than ever for ensuring safe human-robot interaction and collaboration. This paper uses the combined knowledge of Theory of Mind (ToM) and Object-Context Relations to investigate methods for enhancing collaboration between humans and autonomous systems in environments where verbal communication is prohibited. We propose a novel and challenging multimodal video dataset for assessing the capability of artificial intelligence (AI) systems in predicting human belief states in an object-context scenario. The proposed dataset consists of precise labelling of human belief state ground-truth and multimodal inputs replicating all nonverbal communication inputs captured by human perception. We further evaluate our dataset with existing deep learning models and provide new insights into the effects of the various input modalities and object-context relations on the performance of the baseline models.

READ FULL TEXT

page 3

page 5

research
04/07/2021

Learning Triadic Belief Dynamics in Nonverbal Communication from Videos

Humans possess a unique social cognition capability; nonverbal communica...
research
09/13/2021

MindCraft: Theory of Mind Modeling for Situated Dialogue in Collaborative Tasks

An ideal integration of autonomous agents in a human world implies that ...
research
04/25/2020

Joint Inference of States, Robot Knowledge, and Human (False-)Beliefs

Aiming to understand how human (false-)belief–a core socio-cognitive abi...
research
06/10/2019

Towards Social Artificial Intelligence: Nonverbal Social Signal Prediction in A Triadic Interaction

We present a new research task and a dataset to understand human social ...
research
03/01/2023

Multiperspective Teaching of Unknown Objects via Shared-gaze-based Multimodal Human-Robot Interaction

For successful deployment of robots in multifaceted situations, an under...
research
01/07/2018

Perceptual Context in Cognitive Hierarchies

Cognition does not only depend on bottom-up sensor feature abstraction, ...
research
06/27/2023

MindDial: Belief Dynamics Tracking with Theory-of-Mind Modeling for Situated Neural Dialogue Generation

Humans talk in free-form while negotiating the expressed meanings or com...

Please sign up or login with your details

Forgot password? Click here to reset