SpartQA: : A Textual Question Answering Benchmark for Spatial Reasoning

04/12/2021
by   Roshanak Mirzaee, et al.
0

This paper proposes a question-answering (QA) benchmark for spatial reasoning on natural language text which contains more realistic spatial phenomena not covered by prior work and is challenging for state-of-the-art language models (LM). We propose a distant supervision method to improve on this task. Specifically, we design grammar and reasoning rules to automatically generate a spatial description of visual scenes and corresponding QA pairs. Experiments show that further pretraining LMs on these automatically generated data significantly improves LMs' capability on spatial understanding, which in turn helps to better solve two external datasets, bAbI, and boolQ. We hope that this work can foster investigations into more sophisticated models for spatial reasoning over text.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/28/2022

Inferring Implicit Relations with Language Models

A prominent challenge for modern language understanding systems is the a...
research
10/30/2022

Transfer Learning with Synthetic Corpora for Spatial Role Labeling and Reasoning

Recent research shows synthetic data as a source of supervision helps pr...
research
08/18/2023

Towards Grounded Visual Spatial Reasoning in Multi-Modal Vision Language Models

With the advances in large scale vision-and-language models (VLMs) it is...
research
01/05/2023

SPRING: Situated Conversation Agent Pretrained with Multimodal Questions from Incremental Layout Graph

Existing multimodal conversation agents have shown impressive abilities ...
research
04/11/2020

Exploring The Spatial Reasoning Ability of Neural Models in Human IQ Tests

Although neural models have performed impressively well on various tasks...
research
07/05/2023

SpaceNLI: Evaluating the Consistency of Predicting Inferences in Space

While many natural language inference (NLI) datasets target certain sema...
research
05/17/2021

TAT-QA: A Question Answering Benchmark on a Hybrid of Tabular and Textual Content in Finance

Hybrid data combining both tabular and textual content (e.g., financial ...

Please sign up or login with your details

Forgot password? Click here to reset