Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

NormTab: Improving Symbolic Reasoning in LLMs Through Tabular Data Normalization (2406.17961v1)

Published 25 Jun 2024 in cs.CL, cs.AI, cs.DB, and cs.IR

Abstract: In recent years, LLMs have demonstrated remarkable capabilities in parsing textual data and generating code. However, their performance in tasks involving tabular data, especially those requiring symbolic reasoning, faces challenges due to the structural variance and inconsistency in table cell values often found in web tables. In this paper, we introduce NormTab, a novel framework aimed at enhancing the symbolic reasoning performance of LLMs by normalizing web tables. We study table normalization as a stand-alone, one-time preprocessing step using LLMs to support symbolic reasoning on tabular data. Our experimental evaluation, conducted on challenging web table datasets such as WikiTableQuestion and TabFact, demonstrates that leveraging NormTab significantly improves symbolic reasoning performance, showcasing the importance and effectiveness of web table normalization for enhancing LLM-based symbolic reasoning tasks.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. Md Mahadi Hasan Nahid (4 papers)
  2. Davood Rafiei (26 papers)
Citations (1)
X Twitter Logo Streamline Icon: https://streamlinehq.com