Baby Intuitions Benchmark (BIB): Discerning the goals, preferences, and actions of others

02/23/2021
by   Kanishk Gandhi, et al.
41

To achieve human-like common sense about everyday life, machine learning systems must understand and reason about the goals, preferences, and actions of others. Human infants intuitively achieve such common sense by making inferences about the underlying causes of other agents' actions. Directly informed by research on infant cognition, our benchmark BIB challenges machines to achieve generalizable, common-sense reasoning about other agents like human infants do. As in studies on infant cognition, moreover, we use a violation of expectation paradigm in which machines must predict the plausibility of an agent's behavior given a video sequence, making this benchmark appropriate for direct validation with human infants in future studies. We show that recently proposed, deep-learning-based agency reasoning models fail to show infant-like reasoning, leaving BIB an open challenge.

READ FULL TEXT

page 4

page 5

page 6

page 8

page 14

page 15

page 16

page 18

research
08/04/2022

Solving the Baby Intuitions Benchmark with a Hierarchically Bayesian Theory of Mind

To facilitate the development of new models to bridge the gap between ma...
research
07/15/2022

Reasoning about Actions over Visual and Linguistic Modalities: A Survey

'Actions' play a vital role in how humans interact with the world and en...
research
02/24/2021

AGENT: A Benchmark for Core Psychological Reasoning

For machine agents to successfully interact with humans in real-world se...
research
06/19/2020

Contextual and Possibilistic Reasoning for Coalition Formation

In multiagent systems, agents often have to rely on other agents to reac...
research
06/15/2020

Machine Common Sense

Machine common sense remains a broad, potentially unbounded problem in a...
research
09/01/2010

Not only a lack of right definitions: Arguments for a shift in information-processing paradigm

Machine Consciousness and Machine Intelligence are not simply new buzzwo...
research
11/05/2018

On the Evaluation of Common-Sense Reasoning in Natural Language Understanding

The NLP and ML communities have long been interested in developing model...

Please sign up or login with your details

Forgot password? Click here to reset