- The paper presents a novel multi-agent framework, PlanGEN, that generates planning and reasoning trajectories using constraint, verification, and selection agents.
- It employs a dynamic algorithm selection mechanism based on a modified Upper Confidence Bound policy to adapt inference strategies according to instance complexity.
- Experimental results demonstrate state-of-the-art performance with superior Exact Match and accuracy scores on various benchmarks, validating its practical impact on complex problem solving.
PlanGEN: A Multi-Agent Framework for Generating Planning and Reasoning Trajectories for Complex Problem Solving
Introduction
The paper introduces PlanGEN, a model-agnostic multi-agent framework designed to generate effective planning and reasoning trajectories for complex problem solving. PlanGEN addresses limitations in current agent frameworks and inference-time algorithms that struggle with verifying generated plans and adapting to varying instance-level complexity. The proposed framework decomposes the planning task using three specialized agents: constraint, verification, and selection agents, each contributing to enhance the performance of inference-time algorithms within the framework. The experimental results demonstrate that PlanGEN achieves state-of-the-art (SOTA) results on several benchmarks, establishing its efficacy in handling complex planning problems (2502.16111).
Methodology
PlanGEN Framework
PlanGEN employs a multi-agent approach with three key components:
- Constraint Agent: Extracts instance-specific constraints such as task rules and environmental limits, laying the groundwork for verification and selection processes.
- Verification Agent: Uses a constraint-guided iterative verification process, assigning reward scores to plans based on their adherence to constraints. This agent is critical for improving the quality of inference-time decisions.
- Selection Agent: Dynamically selects the most suitable inference algorithm based on instance complexity using a modified Upper Confidence Bound (UCB) policy, enhancing adaptability to complex problems.
PlanGEN integrates these agents with three inference-time algorithms: Best of N, Tree-of-Thought (ToT), and REBASE, allowing it to achieve superior performance across various problem domains.
Figure 1: Schematic representation of PlanGEN (Mixture of Algorithms). An initial plan and constraints guide iterative plan refinement.
Experimental Results
PlanGEN was evaluated on multiple benchmarks, significantly outperforming existing baselines. The experimental setup included tasks from NATURAL PLAN, OlympiadBench, GPQA, and DocFinQA, assessed using metrics such as Exact Match (EM) and accuracy.
Analysis and Discussion
The analysis reinforces the importance of each component in PlanGEN:
- Verification Agent plays a crucial role in differentiating successful and unsuccessful plans by assigning precise reward values based on constraint adherence.
- Selection Agent effectively adapts to varying problem complexities by selecting appropriate algorithms, underscoring the significance of instance-specific evaluation in complex planning scenarios.
A notable finding is the superior performance of the Mixture of Algorithms framework, demonstrating how dynamic algorithm selection significantly enhances problem-solving efficiency.
Conclusion
PlanGEN establishes a novel approach to planning and reasoning by leveraging multi-agent collaboration and adaptive inference-time strategies. The framework's ability to achieve SOTA across diverse benchmarks validates its design and efficacy. Future developments may explore integrating reinforcement learning for dynamic strategy optimization and extending its applicability to multi-modal and multi-lingual contexts, broadening its impact in AI research (2502.16111).