DocFinQA: A Long-Context Financial Reasoning Dataset (2401.06915v2)

Published 12 Jan 2024 in cs.CL and cs.AI

Abstract: For LLMs to be effective in the financial domain -- where each decision can have a significant impact -- it is necessary to investigate realistic tasks and data. Financial professionals often interact with documents that are hundreds of pages long, but most financial research datasets only deal with short excerpts from these documents. To address this, we introduce a long-document financial QA task. We augment 7,437 questions from the existing FinQA dataset with the full-document context, extending the average context length from under 700 words in FinQA to 123k words in DocFinQA. We conduct extensive experiments over retrieval-based QA pipelines and long-context LLMs. DocFinQA proves a significant challenge for even state-of-the-art systems. We also provide a case-study on the longest documents in DocFinQA and find that models particularly struggle on these documents. Addressing these challenges may have a wide reaching impact across applications where specificity and long-range contexts are critical, like gene sequences and legal document contract analysis.

PDF HTML Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

References (31)

Authors (6)

Varshini Reddy (12 papers)
Rik Koncel-Kedziorski (19 papers)
Viet Dac Lai (25 papers)
Chris Tanner (18 papers)
Michael Krumdick (10 papers)
Charles Lovering (13 papers)

Citations (8)

View on Semantic Scholar

Tweets

https://twitter.com/ClaudeFeldges/status/1806415588752863534

DocFinQA: A Long-Context Financial Reasoning Dataset (2401.06915v2)

Related Papers

Tweets