PACS: A Dataset for Physical Audiovisual CommonSense Reasoning

03/21/2022
by   Samuel Yu, et al.
4

In order for AI to be safely deployed in real-world scenarios such as hospitals, schools, and the workplace, they should be able to reason about the physical world by understanding the physical properties and affordances of available objects, how they can be manipulated, and how they interact with other physical objects. This research field of physical commonsense reasoning is fundamentally a multi-sensory task since physical properties are manifested through multiple modalities, two of them being vision and acoustics. Our paper takes a step towards real-world physical commonsense reasoning by contributing PACS: the first audiovisual benchmark annotated for physical commonsense attributes. PACS contains a total of 13,400 question-answer pairs, involving 1,377 unique physical commonsense questions and 1,526 videos. Our dataset provides new opportunities to advance the research field of physical reasoning by bringing audio as a core component of this multimodal problem. Using PACS, we evaluate multiple state-of-the-art models on this new challenging task. While some models show promising results (70 human performance (95 importance of multimodal reasoning and providing possible avenues for future research.

READ FULL TEXT

page 16

page 20

page 22

page 31

page 32

page 33

page 34

page 35

research
11/26/2019

PIQA: Reasoning about Physical Commonsense in Natural Language

To apply eyeshadow without a brush, should I use a cotton swab or a toot...
research
06/07/2021

PROST: Physical Reasoning of Objects through Space and Time

We present a new probing dataset named PROST: Physical Reasoning about O...
research
10/27/2021

How Much Coffee Was Consumed During EMNLP 2019? Fermi Problems: A New Reasoning Challenge for AI

Many real-world problems require the combined application of multiple re...
research
06/28/2021

Modeling and Reasoning in Event Calculus using Goal-Directed Constraint Answer Set Programming

Automated commonsense reasoning is essential for building human-like AI ...
research
07/16/2023

Recognition of Mental Adjectives in An Efficient and Automatic Style

In recent years, commonsense reasoning has received more and more attent...
research
06/04/2021

MERLOT: Multimodal Neural Script Knowledge Models

As humans, we understand events in the visual world contextually, perfor...
research
07/30/2011

CBR with Commonsense Reasoning and Structure Mapping: An Application to Mediation

Mediation is an important method in dispute resolution. We implement a c...

Please sign up or login with your details

Forgot password? Click here to reset