Recursion of Thought: A Divide-and-Conquer Approach to Multi-Context Reasoning with Language Models (2306.06891v1)

Published 12 Jun 2023 in cs.CL and cs.AI

Abstract: Generating intermediate steps, or Chain of Thought (CoT), is an effective way to significantly improve LLMs' (LM) multi-step reasoning capability. However, the CoT lengths can grow rapidly with the problem complexity, easily exceeding the maximum context size. Instead of increasing the context limit, which has already been heavily investigated, we explore an orthogonal direction: making LMs divide a problem into multiple contexts. We propose a new inference framework, called Recursion of Thought (RoT), which introduces several special tokens that the models can output to trigger context-related operations. Extensive experiments with multiple architectures including GPT-3 show that RoT dramatically improves LMs' inference capability to solve problems, whose solution consists of hundreds of thousands of tokens.

References (20)

Authors (2)

Soochan Lee (7 papers)
Gunhee Kim (74 papers)

Citations (21)

View on Semantic Scholar

Summary

Recursion of Thought: A Divide-and-Conquer Approach to Multi-Context Reasoning with LLMs

The paper, "Recursion of Thought: A Divide-and-Conquer Approach to Multi-Context Reasoning with LLMs," authored by Soochan Lee and Gunhee Kim from Seoul National University and SNU-LG AI Research Center, explores an innovative methodology for enhancing multi-context reasoning by leveraging recursion within LLMs. This research delves deeply into recursive problem-solving architectures applied to LLMs, providing a robust framework for improving reasoning tasks with multiple contexts.

Methodology

The authors introduce a recursive strategy that divides complex reasoning problems into smaller, manageable sub-problems, systematically solving each sub-problem using LLMs. The approach is inspired by traditional divide-and-conquer algorithms but is adapted specifically for language-based tasks. Through this strategy, the paper posits that LLMs can be effectively utilized to understand and solve tasks requiring more than a simple linear reasoning process.

Significant Findings

Recursive Architecture Implementation: The paper describes the formulation of recursive reasoning, showing how LLMs can recursively interact with multiple layers of reasoning to conclude a given task. The recursive process allows for iterative refinement of context understanding, which results in improved accuracy in reasoning tasks.
Performance Metrics: The findings reveal substantial improvements over standard sequential reasoning models in handling multi-context problems. Particularly, the recursive model showcases an enhanced ability to integrate and reason across diverse informational contexts compared to conventional methodologies.
Scalability and Adaptability: The authors address scalability issues by demonstrating that recursive processes can adapt to large-scale and intricate language tasks without a substantial increase in computational expense. This attribute makes the proposed methodology a viable option for broader application in real-world linguistic challenges.

Implications

The research provides a compelling case for shifting towards recursive strategies in NLP, particularly in areas demanding intricate reasoning across multiple contexts. The proposed solutions open avenues for more nuanced AI applications, fundamentally altering how LLMs approach complex reasoning tasks.

Speculation on Future Developments

Future advancements in AI could iterate on the recursive methodologies proposed, potentially leading to more autonomous reasoning systems capable of tackling unscripted and complex problem domains across diverse applications. This paper serves as a foundational step, encouraging further exploration into recursive architectures, ultimately aiming for higher cognitive capabilities within AI systems.

In conclusion, "Recursion of Thought: A Divide-and-Conquer Approach to Multi-Context Reasoning with LLMs" significantly contributes to the ongoing discourse in NLP regarding recursive methodologies. By demonstrating practical enhancements in reasoning across multiple contexts using divide-and-conquer principles, this research sets a precedent for future studies to build upon and refine the cognitive capabilities of AI systems.

PDF Markdown

Related Papers

Find Related Papers

Tweets

https://twitter.com/RAbrms/status/1747699771413062001