iReason: Multimodal Commonsense Reasoning using Videos and Natural Language with Interpretability

06/25/2021
by   Aman Chadha, et al.
0

Causality knowledge is vital to building robust AI systems. Deep learning models often perform poorly on tasks that require causal reasoning, which is often derived using some form of commonsense knowledge not immediately available in the input but implicitly inferred by humans. Prior work has unraveled spurious observational biases that models fall prey to in the absence of causality. While language representation models preserve contextual knowledge within learned embeddings, they do not factor in causal relationships during training. By blending causal relationships with the input features to an existing model that performs visual cognition tasks (such as scene understanding, video captioning, video question-answering, etc.), better performance can be achieved owing to the insight causal relationships bring about. Recently, several models have been proposed that have tackled the task of mining causal data from either the visual or textual modality. However, there does not exist widespread research that mines causal relationships by juxtaposing the visual and language modalities. While images offer a rich and easy-to-process resource for us to mine causality knowledge from, videos are denser and consist of naturally time-ordered events. Also, textual information offers details that could be implicit in videos. We propose iReason, a framework that infers visual-semantic commonsense knowledge using both videos and natural language captions. Furthermore, iReason's architecture integrates a causal rationalization module to aid the process of interpretability, error analysis and bias detection. We demonstrate the effectiveness of iReason using a two-pronged comparative analysis with language representation learning models (BERT, GPT-2) as well as current state-of-the-art multimodal causality models.

READ FULL TEXT
research
12/13/2020

Learning Contextual Causality from Time-consecutive Images

Causality knowledge is crucial for many artificial intelligence systems....
research
08/29/2020

iPerceive: Applying Common-Sense Reasoning to Multi-Modal Dense Video Captioning and Video Question Answering

Most prior art in visual understanding relies solely on analyzing the "w...
research
12/10/2020

Causal-BERT : Language models for causality detection between events expressed in text

Causality understanding between events is a critical natural language pr...
research
12/10/2020

A Practical Approach towards Causality Mining in Clinical Text using Active Transfer Learning

Objective: Causality mining is an active research area, which requires t...
research
06/02/2021

John praised Mary because he? Implicit Causality Bias and Its Interaction with Explicit Cues in LMs

Some interpersonal verbs can implicitly attribute causality to either th...
research
01/31/2022

Causal Inference Principles for Reasoning about Commonsense Causality

Commonsense causality reasoning (CCR) aims at identifying plausible caus...
research
07/27/2017

Detecting and Explaining Causes From Text For a Time Series Event

Explaining underlying causes or effects about events is a challenging bu...

Please sign up or login with your details

Forgot password? Click here to reset