FinQA: A Dataset of Numerical Reasoning over Financial Data

09/01/2021
by   Zhiyu Chen, et al.
0

The sheer volume of financial statements makes it difficult for humans to access and analyze a business's financials. Robust numerical reasoning likewise faces unique challenges in this domain. In this work, we focus on answering deep questions over financial data, aiming to automate the analysis of a large corpus of financial documents. In contrast to existing tasks on general domain, the finance domain includes complex numerical reasoning and understanding of heterogeneous representations. To facilitate analytical progress, we propose a new large-scale dataset, FinQA, with Question-Answering pairs over Financial reports, written by financial experts. We also annotate the gold reasoning programs to ensure full explainability. We further introduce baselines and conduct comprehensive experiments in our dataset. The results demonstrate that popular, large, pre-trained models fall far short of expert humans in acquiring finance knowledge and in complex multi-step numerical reasoning on that knowledge. Our dataset – the first of its kind – should therefore enable significant, new community research into complex application domains. The dataset and code are publicly available<https://github.com/czyssrs/FinQA>.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/07/2022

ConvFinQA: Exploring the Chain of Numerical Reasoning in Conversational Finance Question Answering

With the recent advance in large pre-trained language models, researcher...
research
11/05/2020

EXAMS: A Multi-Subject High School Examinations Dataset for Cross-Lingual and Multilingual Question Answering

We propose EXAMS – a new benchmark dataset for cross-lingual and multili...
research
08/05/2019

Sabrina: Modeling and Visualization of Economy Data with Incremental Domain Knowledge

Investment planning requires knowledge of the financial landscape on a l...
research
01/18/2023

How Close is ChatGPT to Human Experts? Comparison Corpus, Evaluation, and Detection

The introduction of ChatGPT has garnered widespread attention in both ac...
research
05/24/2023

Leveraging LLMs for KPIs Retrieval from Hybrid Long-Document: A Comprehensive Framework and Dataset

Large Language Models (LLMs) demonstrate exceptional performance in text...
research
07/25/2023

GPT-3 Models are Few-Shot Financial Reasoners

Financial analysis is an important tool for evaluating company performan...
research
07/12/2022

A Novel DeBERTa-based Model for Financial Question Answering Task

As a rising star in the field of natural language processing, question a...

Please sign up or login with your details

Forgot password? Click here to reset