Incorporating External Knowledge to Answer Open-Domain Visual Questions with Dynamic Memory Networks

12/03/2017
by   Guohao Li, et al.
0

Visual Question Answering (VQA) has attracted much attention since it offers insight into the relationships between the multi-modal analysis of images and natural language. Most of the current algorithms are incapable of answering open-domain questions that require to perform reasoning beyond the image contents. To address this issue, we propose a novel framework which endows the model capabilities in answering more complex questions by leveraging massive external knowledge with dynamic memory networks. Specifically, the questions along with the corresponding images trigger a process to retrieve the relevant information in external knowledge bases, which are embedded into a continuous vector space by preserving the entity-relation structures. Afterwards, we employ dynamic memory networks to attend to the large body of facts in the knowledge graph and images, and then perform reasoning over these facts to generate corresponding answers. Extensive experiments demonstrate that our model not only achieves the state-of-the-art performance in the visual question answering task, but can also answer open-domain questions effectively by leveraging the external knowledge.

READ FULL TEXT

page 7

page 8

page 11

research
06/13/2018

Learning Visual Knowledge Memory Networks for Visual Question Answering

Visual question answering (VQA) requires joint comprehension of images a...
research
06/13/2023

AVIS: Autonomous Visual Information Seeking with Large Language Models

In this paper, we propose an autonomous information seeking visual quest...
research
03/21/2022

Targeted Extraction of Temporal Facts from Textual Resources for Improved Temporal Question Answering over Knowledge Bases

Knowledge Base Question Answering (KBQA) systems have the goal of answer...
research
03/23/2021

Multi-Modal Answer Validation for Knowledge-Based VQA

The problem of knowledge-based visual question answering involves answer...
research
09/15/2018

Improving Natural Language Inference Using External Knowledge in the Science Questions Domain

Natural Language Inference (NLI) is fundamental to many Natural Language...
research
01/29/2020

MEMO: A Deep Network for Flexible Combination of Episodic Memories

Recent research developing neural network architectures with external me...
research
10/29/2019

Generating Questions for Knowledge Bases via Incorporating Diversified Contexts and Answer-Aware Loss

We tackle the task of question generation over knowledge bases. Conventi...

Please sign up or login with your details

Forgot password? Click here to reset