FinQA: A Dataset of Numerical Reasoning over Financial Data

Zhiyu Chen, Wenhu Chen, Charese Smiley, Sameena Shah, Iana Borova, Dylan Langdon, Reema Moussa, Matt Beane, Ting-Hao Huang, Bryan Routledge, William Yang Wang

2021-09-01EMNLP 2021 11Question Answering

Paper PDF Code(official)

Abstract

The sheer volume of financial statements makes it difficult for humans to access and analyze a business's financials. Robust numerical reasoning likewise faces unique challenges in this domain. In this work, we focus on answering deep questions over financial data, aiming to automate the analysis of a large corpus of financial documents. In contrast to existing tasks on general domain, the finance domain includes complex numerical reasoning and understanding of heterogeneous representations. To facilitate analytical progress, we propose a new large-scale dataset, FinQA, with Question-Answering pairs over Financial reports, written by financial experts. We also annotate the gold reasoning programs to ensure full explainability. We further introduce baselines and conduct comprehensive experiments in our dataset. The results demonstrate that popular, large, pre-trained models fall far short of expert humans in acquiring finance knowledge and in complex multi-step numerical reasoning on that knowledge. Our dataset -- the first of its kind -- should therefore enable significant, new community research into complex application domains. The dataset and code are publicly available\url{https://github.com/czyssrs/FinQA}.

Results

Task	Dataset	Metric	Value	Model
Question Answering	FinQA	Execution Accuracy	65.05	FinQANet (RoBERTa-large)
Question Answering	FinQA	Program Accuracy	63.52	FinQANet (RoBERTa-large)
Question Answering	FinQA	Execution Accuracy	57.43	FinQANet (BERT-large)
Question Answering	FinQA	Program Accuracy	55.52	FinQANet (BERT-large)
Question Answering	FinQA	Execution Accuracy	53.71	FinQANet (FinBert )
Question Answering	FinQA	Program Accuracy	51.71	FinQANet (FinBert )

FinQA: A Dataset of Numerical Reasoning over Financial Data

Abstract

Results

Related Papers

FinQA: A Dataset of Numerical Reasoning over Financial Data

Abstract

Results

Related Papers