StepGame: A New Benchmark for Robust Multi-Hop Spatial Reasoning in Texts

04/18/2022
by   Zhengxiang Shi, et al.
0

Inferring spatial relations in natural language is a crucial ability an intelligent system should possess. The bAbI dataset tries to capture tasks relevant to this domain (task 17 and 19). However, these tasks have several limitations. Most importantly, they are limited to fixed expressions, they are limited in the number of reasoning steps required to solve them, and they fail to test the robustness of models to input that contains irrelevant or redundant information. In this paper, we present a new Question-Answering dataset called StepGame for robust multi-hop spatial reasoning in texts. Our experiments demonstrate that state-of-the-art models on the bAbI dataset struggle on the StepGame dataset. Moreover, we propose a Tensor-Product based Memory-Augmented Neural Network (TP-MANN) specialized for spatial reasoning tasks. Experimental results on both datasets show that our model outperforms all the baselines with superior generalization and robustness performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/02/2020

Constructing A Multi-hop QA Dataset for Comprehensive Evaluation of Reasoning Steps

A multi-hop question answering (QA) dataset aims to test reasoning and i...
research
01/15/2018

An Interpretable Reasoning Network for Multi-Relation Question Answering

Multi-relation Question Answering is a challenging task, due to the requ...
research
10/11/2022

How Well Do Multi-hop Reading Comprehension Models Understand Date Information?

Several multi-hop reading comprehension datasets have been proposed to r...
research
04/18/2021

Generative Context Pair Selection for Multi-hop Question Answering

Compositional reasoning tasks like multi-hop question answering, require...
research
10/27/2021

SQALER: Scaling Question Answering by Decoupling Multi-Hop and Logical Reasoning

State-of-the-art approaches to reasoning and question answering over kno...
research
09/10/2019

Neural Belief Reasoner

This paper proposes a new generative model called neural belief reasoner...
research
03/16/2022

E-KAR: A Benchmark for Rationalizing Natural Language Analogical Reasoning

The ability to recognize analogies is fundamental to human cognition. Ex...

Please sign up or login with your details

Forgot password? Click here to reset