Papers

Topics

Authors

Recent

View all

Gemini 2.5 Flash

41 tokens/sec

GPT-4o

59 tokens/sec

Gemini 2.5 Pro Pro

41 tokens/sec

o3 Pro

7 tokens/sec

GPT-4.1 Pro

50 tokens/sec

DeepSeek R1 via Azure Pro

28 tokens/sec

2000 character limit reached

5 17

ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context (2403.02177v2)

Published 4 Mar 2024 in cs.CL

Abstract: Tables play a crucial role in conveying information in various domains. We propose a Plan-then-Reason framework to answer different types of user queries over tables with sentence context. The framework first plans the reasoning paths over the context, then assigns each step to program-based or textual reasoning to reach the final answer. This framework enhances the table reasoning abilities for both in-context learning and fine-tuning methods. GPT-3.5-Turbo following Plan-then-Reason framework surpasses other prompting baselines without self-consistency while using less API calls and in-context demonstrations. We also construct an instruction tuning set TrixInstruct to evaluate the effectiveness of fine-tuning with this framework. We present ProTrix model family by finetuning models on TrixInstruct. Our experiments show that ProTrix family generalizes to diverse unseen tabular tasks with only 6k training instances. We further demonstrate that ProTrix can generate accurate and faithful explanations to answer complex free-form questions. Our work underscores the importance of the planning and reasoning abilities towards a model over tabular tasks with generalizability and interpretability. We open-source our dataset and models at https://github.com/WilliamZR/ProTrix.

PDF HTML Abstract

ProTrix: Enhancing Planning and Reasoning Abilities for Querying Over Tables with Contextual Sentence Information

Introduction to the Challenge

Tables are foundational structures for data representation, playing a pivotal role across numerous domains by efficiently encapsulating complex information. However, unlocking the insights they hold, especially when contextual sentences accompany them, can be a daunting task. This necessitates leveraging advanced LLMs capable of not just understanding but also reasoning over such mixed data formats to answer user queries effectively.

The Plan-then-Reason Framework

The core proposal of the discussed paper revolves around a novel Plan-then-Reason framework. This two-fold approach first outlines a strategic pathway to reason, dividing the task into components that can be addressed through either program-based reasoning or textual reasoning. Notably, the framework accommodates for scenarios where a direct programmatic solution is infeasible, blending in contextual sentence information to bridge data gaps.

Planning: Here, the model embarks on a preliminary analysis to identify gaps between the user's query and available context, employing common knowledge or specific insights to map out an actionable plan. It delineates when and how to pivot between programmatic queries and textual analysis to gather the necessary information.

Reasoning: Following the blueprint laid out in the planning stage, the model executes the plan. It takes a dual approach, leveraging SQL queries for direct table manipulation and natural language processing for extracting and integrating nuanced information from sentences.

TrixInstruct: A Dataset for Instruction Tuning

To practically implement and fine-tune models within this framework, the creation of an instruction tuning set dubbed TrixInstruct is pivotal. Distinct for incorporating queries that defy purely programmatic solutions and necessitate the amalgamation of table and sentence data, this dataset is instrumental in teaching models to inherit the requisite planning and reasoning capabilities. The paper benchmarks these abilities across several datasets, demonstrating the potent adaptability and interpretability of models trained via this instruction set.

ProTrix: The Model

The embodiment of this framework, ProTrix, showcases remarkable generalization across a spectrum of tabular tasks, outperforming or matching up with the capabilities of much larger models such as GPT-3.5-turbo while utilizing substantially less data for training. Its performance not only underlines the model's adeptness at generating accurate and faithful answers to complex queries but also underscores the efficiency of the Plan-then-Reason framework in enhancing model interpretability.

Empirical Validation and Insights

Experimental evaluations paint a comprehensive picture of ProTrix's capabilities. With an array of benchmarks spanning short-form question answering, fact verification, and free-form question answering tasks, the model's performance is rigorously tested. Notably, ProTrix demonstrates a significant edge in scenarios demanding a blend of tabular and textual data processing, numerical reasoning, and multi-hop reasoning. Such results solidify the framework's effectiveness in fostering models that can navigate the intricacies of mixed data formats seamlessly.

An ablation paper further clarifies the contribution of both planning and reasoning components to the model's success, emphasizing their synergistic value. Additionally, specific analysis into program-unsolvable queries showcases ProTrix's superiority in leveraging common sense and conceptual understanding to fill informational voids left by tables.

Future Projections and Considerations

Despite its strengths, ProTrix currently faces limitations, particularly around handling complex tables with hierarchical headers and queries spanning multiple tables. Addressing these challenges will be pivotal in broadening the model's applicability and enhancing its real-world utility. Moreover, refining the evaluation metrics to better align with the nuanced outputs generated by such advanced models remains an area ripe for exploration.

Concluding Remarks

ProTrix and the underlying Plan-then-Reason framework mark a significant stride toward equipping LLMs with the nuanced capabilities needed to tackle the multifaceted challenge of reasoning over tables with accompanying sentence context. By marrying programmatic precision with the depth of textual analysis, this approach not only elevates the model's performance across diverse querying tasks but also advances the interpretability and adaptability of LLMs in handling complex, real-world data structures.

PDF Markdown Bookmark Chat (Pro)

References (45)

Authors (2)

Zirui Wu (13 papers)
Yansong Feng (81 papers)

Citations (4)

View on Semantic Scholar

GitHub

GitHub - WilliamZR/ProTrix: Code for ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context (17 stars)

Tweets

https://twitter.com/WilliamZR7/status/1764932653642568113