IQ-VQA: Intelligent Visual Question Answering

07/08/2020
by   Vatsal Goel, et al.
7

Even though there has been tremendous progress in the field of Visual Question Answering, models today still tend to be inconsistent and brittle. To this end, we propose a model-independent cyclic framework which increases consistency and robustness of any VQA architecture. We train our models to answer the original question, generate an implication based on the answer and then also learn to answer the generated implication correctly. As a part of the cyclic framework, we propose a novel implication generator which can generate implied questions from any question-answer pair. As a baseline for future works on consistency, we provide a new human annotated VQA-Implications dataset. The dataset consists of  30k questions containing implications of 3 types - Logical Equivalence, Necessary Condition and Mutual Exclusion - made from the VQA v2.0 validation dataset. We show that our framework improves consistency of VQA models by  15 robustness by  2 quantitatively show improvement in attention maps which highlights better multi-modal understanding of vision and language.

READ FULL TEXT

page 6

page 10

page 11

page 12

page 13

research
02/15/2019

Cycle-Consistency for Robust Visual Question Answering

Despite significant progress in Visual Question Answering over the years...
research
02/19/2020

VQA-LOL: Visual Question Answering under the Lens of Logic

Logical connectives and their implications on the meaning of a natural l...
research
12/15/2021

3D Question Answering

Visual Question Answering (VQA) has witnessed tremendous progress in rec...
research
07/18/2023

Generative Visual Question Answering

Multi-modal tasks involving vision and language in deep learning continu...
research
03/15/2022

CARETS: A Consistency And Robustness Evaluative Test Suite for VQA

We introduce CARETS, a systematic test suite to measure consistency and ...
research
03/16/2023

Logical Implications for Visual Question Answering Consistency

Despite considerable recent progress in Visual Question Answering (VQA) ...
research
10/10/2017

iVQA: Inverse Visual Question Answering

In recent years, visual question answering (VQA) has become topical as a...

Please sign up or login with your details

Forgot password? Click here to reset