Eliciting Better Multilingual Structured Reasoning from LLMs through Code (2403.02567v2)

Published 5 Mar 2024 in cs.CL and cs.AI

Abstract: The development of LLMs (LLM) has shown progress on reasoning, though studies have largely considered either English or simple reasoning tasks. To address this, we introduce a multilingual structured reasoning and explanation dataset, termed xSTREET, that covers four tasks across six languages. xSTREET exposes a gap in base LLM performance between English and non-English reasoning tasks. We then propose two methods to remedy this gap, building on the insight that LLMs trained on code are better reasoners. First, at training time, we augment a code dataset with multilingual comments using machine translation while keeping program code as-is. Second, at inference time, we bridge the gap between training and inference by employing a prompt structure that incorporates step-by-step code primitives to derive new facts and find a solution. Our methods show improved multilingual performance on xSTREET, most notably on the scientific commonsense reasoning subtask. Furthermore, the models show no regression on non-reasoning tasks, thus demonstrating our techniques maintain general-purpose abilities.

PDF HTML Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

References (23)

Authors (5)

Bryan Li (17 papers)
Tamer Alkhouli (7 papers)
Daniele Bonadiman (10 papers)
Nikolaos Pappas (188 papers)
Saab Mansour (32 papers)

Citations (2)

View on Semantic Scholar

Tweets

https://twitter.com/LLMSherpa/status/1844033614050394422

https://twitter.com/LLMSherpa/status/1843020283172864503

https://twitter.com/knishimae0531/status/1806526460280357225

https://twitter.com/calculito/status/1843872992343765250

HackerNews

Eliciting Better Multilingual Structured Reasoning from LLMs Through Code (2 points, 0 comments)

Eliciting Better Multilingual Structured Reasoning from LLMs through Code (2403.02567v2)

Related Papers

Tweets

HackerNews