TAT-LLM: A Specialized Language Model for Discrete Reasoning over Tabular and Textual Data (2401.13223v3)

Published 24 Jan 2024 in cs.CL and cs.AI

Abstract: In this work, we address question answering (QA) over a hybrid of tabular and textual data that are very common content on the Web (e.g. SEC filings), where discrete reasoning capabilities are often required. Recently, LLMs like GPT-4 have demonstrated strong multi-step reasoning capabilities. We then consider harnessing the amazing power of LLMs to solve our task. We abstract a Step-wise Pipeline for tabular and textual QA, which consists of three key steps, including Extractor, Reasoner and Executor, and initially design an instruction to instantiate the pipeline and validate that GPT-4 outperforms all existing methods. However, utilizing an online LLM like GPT-4 holds various challenges in terms of cost, latency, and data security risk, which motivates us to specialize smaller LLMs in this task. We develop a TAT-LLM LLM by fine-tuning LLaMA 2 with the training data generated automatically from existing expert-annotated datasets following the Step-wise Pipeline. The experimental results have verified that our TAT-LLM model can outperform all baseline models, including the previous best fine-tuned models and very large-scale LLMs like GPT-4 on FinQA, TAT-QA and TAT-DQA benchmarks.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (56)

Authors (6)

Fengbin Zhu (19 papers)
Ziyang Liu (26 papers)
Fuli Feng (143 papers)
Chao Wang (555 papers)
Moxin Li (13 papers)
Tat-Seng Chua (359 papers)

Citations (11)

View on Semantic Scholar

TAT-LLM: A Specialized Language Model for Discrete Reasoning over Tabular and Textual Data (2401.13223v3)

Related Papers

Tweets