Distilling Tool Knowledge into Language Models via Back-Translated Traces (2506.19171v1)

Published 23 Jun 2025 in cs.LG

Abstract: LLMs often struggle with mathematical problems that require exact computation or multi-step algebraic reasoning. Tool-integrated reasoning (TIR) offers a promising solution by leveraging external tools such as code interpreters to ensure correctness, but it introduces inference-time dependencies that hinder scalability and deployment. In this work, we propose a new paradigm for distilling tool knowledge into LLMs purely through natural language. We first construct a Solver Agent that solves math problems by interleaving planning, symbolic tool calls, and reflective reasoning. Then, using a back-translation pipeline powered by multiple LLM-based agents, we convert interleaved TIR traces into natural language reasoning traces. A Translator Agent generates explanations for individual tool calls, while a Rephrase Agent merges them into a fluent and globally coherent narrative. Empirically, we show that fine-tuning a small open-source model on these synthesized traces enables it to internalize both tool knowledge and structured reasoning patterns, yielding gains on competition-level math benchmarks without requiring tool access at inference.

Collections

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Paper Prompts

Explore 10 Community Prompts

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Distilling Tool Knowledge into Language Models via Back-Translated Traces (2506.19171v1)

Collections

Summary

Paper Prompts

Follow-up Questions

Authors (12)

Don't miss out on important new AI/ML research

Distilling Tool Knowledge into Language Models via Back-Translated Traces (2506.19171v1)

Collections

Summary

Paper Prompts

Follow-up Questions

Related Papers

Authors (12)

Don't miss out on important new AI/ML research