TAT-QA: A Question Answering Benchmark on a Hybrid of Tabular and Textual Content in Finance

05/17/2021
by   Fengbin Zhu, et al.
7

Hybrid data combining both tabular and textual content (e.g., financial reports) are quite pervasive in the real world. However, Question Answering (QA) over such hybrid data is largely neglected in existing research. In this work, we extract samples from real financial reports to build a new large-scale QA dataset containing both Tabular And Textual data, named TAT-QA, where numerical reasoning is usually required to infer the answer, such as addition, subtraction, multiplication, division, counting, comparison/sorting, and the compositions. We further propose a novel QA model termed TAGOP, which is capable of reasoning over both tables and text. It adopts sequence tagging to extract relevant cells from the table along with relevant spans from the text to infer their semantics, and then applies symbolic reasoning over them with a set of aggregation operators to arrive at the final answer. TAGOPachieves 58.0 inF1, which is an 11.1 model, according to our experiments on TAT-QA. But this result still lags far behind performance of expert human, i.e.90.8 our TAT-QA is very challenging and can serve as a benchmark for training and testing powerful QA models that address hybrid form data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/20/2020

Open Question Answering over Tables and Text

In open question answering (QA), the answer to a question is produced by...
research
05/05/2023

Multi-View Graph Representation Learning for Answering Hybrid Numerical Reasoning Question

Hybrid question answering (HybridQA) over the financial report contains ...
research
05/24/2023

TACR: A Table-alignment-based Cell-selection and Reasoning Model for Hybrid Question-Answering

Hybrid Question-Answering (HQA), which targets reasoning over tables and...
research
11/07/2022

NAPG: Non-Autoregressive Program Generation for Hybrid Tabular-Textual Question Answering

Hybrid tabular-textual question answering (QA) requires reasoning from h...
research
04/12/2021

SpartQA: : A Textual Question Answering Benchmark for Spatial Reasoning

This paper proposes a question-answering (QA) benchmark for spatial reas...
research
08/15/2021

HiTab: A Hierarchical Table Dataset for Question Answering and Natural Language Generation

Tables are often created with hierarchies, but existing works on table r...
research
08/03/2023

RealCQA: Scientific Chart Question Answering as a Test-bed for First-Order Logic

We present a comprehensive study of chart visual question-answering(QA) ...

Please sign up or login with your details

Forgot password? Click here to reset