Making Long-Context Language Models Better Multi-Hop Reasoners (2408.03246v1)

Published 6 Aug 2024 in cs.CL

Abstract: Recent advancements in long-context modeling have enhanced LLMs (LMs) for complex tasks across multiple NLP applications. Despite this progress, we find that these models struggle with multi-hop reasoning and exhibit decreased performance in the presence of noisy contexts. In this paper, we introduce Reasoning with Attributions, a novel approach that prompts LMs to supply attributions for each assertion during their reasoning. We validate our approach through experiments on three multi-hop datasets, employing both proprietary and open-source models, and demonstrate its efficacy and resilience. Furthermore, we explore methods to augment reasoning capabilities via fine-tuning and offer an attribution-annotated dataset and a specialized training strategy. Our fine-tuned model achieves competitive performance on multi-hop reasoning benchmarks, closely paralleling proprietary LMs such as ChatGPT and Claude-instant.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

Authors (4)

Yanyang Li (22 papers)
Shuo Liang (3 papers)
Michael R. Lyu (176 papers)
Liwei Wang (239 papers)

Citations (3)

View on Semantic Scholar

Making Long-Context Language Models Better Multi-Hop Reasoners (2408.03246v1)

Related Papers

Tweets