Think Visually: Question Answering through Virtual Imagery

05/25/2018
by   Ankit Goyal, et al.
0

In this paper, we study the problem of geometric reasoning in the context of question-answering. We introduce Dynamic Spatial Memory Network (DSMN), a new deep network architecture designed for answering questions that admit latent visual representations. DSMN learns to generate and reason over such representations. Further, we propose two synthetic benchmarks, FloorPlanQA and ShapeIntersection, to evaluate the geometric reasoning capability of QA systems. Experimental results validate the effectiveness of our proposed DSMN for visual thinking tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/04/2016

Dynamic Memory Networks for Visual and Textual Question Answering

Neural network architectures with memory and attention mechanisms exhibi...
research
06/14/2016

Query-Reduction Networks for Question Answering

In this paper, we study the problem of question answering when reasoning...
research
10/30/2018

Compositional Attention Networks for Interpretability in Natural Language Question Answering

MAC Net is a compositional attention network designed for Visual Questio...
research
02/01/2018

Adaptive Memory Networks

We present Adaptive Memory Networks (AMN) that processes input-question ...
research
12/20/2022

Towards Unsupervised Visual Reasoning: Do Off-The-Shelf Features Know How to Reason?

Recent advances in visual representation learning allowed to build an ab...
research
02/01/2021

Can Small and Synthetic Benchmarks Drive Modeling Innovation? A Retrospective Study of Question Answering Modeling Approaches

Datasets are not only resources for training accurate, deployable system...
research
12/31/2018

The meaning of "most" for visual question answering models

The correct interpretation of quantifier statements in the context of a ...

Please sign up or login with your details

Forgot password? Click here to reset