- The paper introduces PCArena, a hierarchical framework that uses subgoals to segment and summarize working memory for long-horizon tasks.
- It achieves a twofold increase in success rate by reducing average task steps by 3.8 and cutting context length by 35.02%.
- The method integrates trajectory retrieval and proactive memory replacement, highlighting its potential for robotics and interactive systems.
Detailed Summary of "HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with LLM"
Introduction
The paper "HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with LLM" (2408.09559) presents a framework, PCArena, designed to enhance the performance of LLM-based agents in solving long-horizon tasks. These agents are tasked with generating actions based on environmental observations, and their efficacy heavily relies on memory mechanisms that store past action-observation sequences. The paper identifies a gap in current research around optimizing in-trial memory or working memory, as opposed to cross-trial memory, and addresses this through PCArena, which hierarchically manages working memory by leveraging subgoals as memory chunks.
Hierarchical Working Memory Management
PCArena's methodology diverges from traditional paradigms by integrating human-inspired problem-solving strategies, specifically the use of subgoals to manage cognitive load efficiently. Typically, existing systems directly input entire sequences of past actions and observations into LLMs during task-solving, leading to inefficiency and redundancy, especially in long-horizon tasks. PCArena mitigates this by prompting LLMs to formulate subgoals, which serve as memory chunks. Subsequently, agents summarize these memory chunks after subgoals are achieved, retaining only relevant action-observation pairs for future decision-making.
Figure 1: A standard paradigm for LLM-based agents, demonstrating top-standard and bottom-PCArena strategies in managing working memory.
The strategic reduction in memory consumption results in a twofold increase in success rate across various tasks, demonstrating PCArena's robustness and efficiency.
Methodological Details
PCArena functions by synchronizing the generation of subgoals with executable actions and uses a summarization strategy to trim the working memory. The process is detailed as follows:
- Subgoal Generation: LLMs are prompted to establish subgoals, each considered a memory chunk.
- Action Execution and Summarization: Upon achieving a subgoal, the associated action-observation pairs are summarized, compacting the working memory efficiently.
- Proactive Memory Management: LLMs proactively determine necessary memory replacements with summarized observations based on the current subgoals.
Figure 2: An overview of the PCArena process illustrating subgoal-based hierarchical memory management.
Further, PCArena incorporates a trajectory retrieval module, retrieving detailed past trajectories on demand for improved decision-making, reinforcing flexibility.
Experimental Framework
The effectiveness of PCArena is validated across five long-horizon tasks, demonstrating significant improvements in success rate, progress rate, and execution efficiency. The experimental outcomes illustrate that PCArena reduces the average task completion steps by 3.8, cuts context length by 35.02%, and run time by 19.42%.
Impact and Future Directions
PCArena's successful deployment in handling long-horizon tasks suggests its potential applicability in various domains requiring complex decision-making and memory management strategies, especially in robotics and interactive systems. The hierarchical memory framework of PCArena could inspire further advancements in developing autonomous systems that simulate human-like problem-solving capabilities by adopting efficient memory management paradigms.
Conclusion
PCArena presents a significant advance in memory management for LLM-based agents by hierarchically structuring working memory around subgoals, thus improving both efficiency and effectiveness in executing long-horizon tasks. This paper contributes a novel perspective to the field of AI, offering a promising approach to enhancing the cognitive capabilities of autonomous agents.

Figure 3: Progress rate at different steps showing PCArena's consistency and effectiveness over standard strategies.