GeoQA: A Geometric Question Answering Benchmark Towards Multimodal Numerical Reasoning

05/30/2021
by   Jiaqi Chen, et al.
25

Automatic math problem solving has recently attracted increasing attention as a long-standing AI benchmark. In this paper, we focus on solving geometric problems, which requires a comprehensive understanding of textual descriptions, visual diagrams, and theorem knowledge. However, the existing methods were highly dependent on handcraft rules and were merely evaluated on small-scale datasets. Therefore, we propose a Geometric Question Answering dataset GeoQA, containing 5,010 geometric problems with corresponding annotated programs, which illustrate the solving process of the given problems. Compared with another publicly available dataset GeoS, GeoQA is 25 times larger, in which the program annotations can provide a practical testbed for future research on explicit and explainable numerical reasoning. Moreover, we introduce a Neural Geometric Solver (NGS) to address geometric problems by comprehensively parsing multimodal information and generating interpretable programs. We further add multiple self-supervised auxiliary tasks on NGS to enhance cross-modal semantic representation. Extensive experiments on GeoQA validate the effectiveness of our proposed NGS and auxiliary tasks. However, the results are still significantly lower than human performance, which leaves large room for future research. Our benchmark and code are released at https://github.com/chen-judge/GeoQA .

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 7

page 9

page 10

research
01/15/2022

A Benchmark for Generalizable and Interpretable Temporal Question Answering over Knowledge Bases

Knowledge Base Question Answering (KBQA) tasks that involve complex reas...
research
08/20/2019

GeoSQA: A Benchmark for Scenario-based Question Answering in the Geography Domain at High School Level

Scenario-based question answering (SQA) has attracted increasing researc...
research
05/21/2023

TheoremQA: A Theorem-driven Question Answering dataset

The recent LLMs like GPT-4 and PaLM-2 have made tremendous progress in s...
research
05/30/2019

MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms

We introduce a large-scale dataset of math word problems and an interpre...
research
05/10/2021

Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning

Geometry problem solving has attracted much attention in the NLP communi...
research
10/25/2021

IconQA: A New Benchmark for Abstract Diagram Understanding and Visual Language Reasoning

Current visual question answering (VQA) tasks mainly consider answering ...
research
02/22/2023

A Multi-Modal Neural Geometric Solver with Textual Clauses Parsed from Diagram

Geometry problem solving (GPS) is a high-level mathematical reasoning re...

Please sign up or login with your details

Forgot password? Click here to reset