Ask Your Neurons: A Neural-based Approach to Answering Questions about Images

05/05/2015
by   Mateusz Malinowski, et al.
0

We address a question answering task on real-world images that is set up as a Visual Turing Test. By combining latest advances in image representation and natural language processing, we propose Neural-Image-QA, an end-to-end formulation to this problem for which all parts are trained jointly. In contrast to previous efforts, we are facing a multi-modal problem where the language output (answer) is conditioned on visual and natural language input (image and question). Our approach Neural-Image-QA doubles the performance of the previous best approach on this problem. We provide additional insights into the problem by analyzing how much information is contained only in the language part for which we provide a new human baseline. To study human consensus, which is related to the ambiguities inherent in this challenging task, we propose two novel metrics and collect additional answers which extends the original DAQUAR dataset to DAQUAR-Consensus.

READ FULL TEXT

page 8

page 11

page 12

page 13

research
05/09/2016

Ask Your Neurons: A Deep Learning Approach to Visual Question Answering

We address a question answering task on real-world images that is set up...
research
10/01/2014

A Multi-World Approach to Question Answering about Real-World Scenes based on Uncertain Input

We propose a method for automatically answering questions about images b...
research
04/04/2020

Talk to Papers: Bringing Neural Question Answering to Academic Search

We introduce Talk to Papers, which exploits the recent open-domain quest...
research
07/19/2019

A Comparative Evaluation of Visual and Natural Language Question Answering Over Linked Data

With the growing number and size of Linked Data datasets, it is crucial ...
research
10/29/2014

Towards a Visual Turing Challenge

As language and visual understanding by machines progresses rapidly, we ...
research
04/19/2016

M^2S-Net: Multi-Modal Similarity Metric Learning based Deep Convolutional Network for Answer Selection

Recent works using artificial neural networks based on distributed word ...
research
04/20/2018

Right Answer for the Wrong Reason: Discovery and Mitigation

Exposing the weaknesses of neural models is crucial for improving their ...

Please sign up or login with your details

Forgot password? Click here to reset