TaskGen: A Task-Based, Memory-Infused Agentic Framework using StrictJSON (2407.15734v1)

Published 22 Jul 2024 in cs.AI and cs.MA

Abstract: TaskGen is an open-sourced agentic framework which uses an Agent to solve an arbitrary task by breaking them down into subtasks. Each subtask is mapped to an Equipped Function or another Agent to execute. In order to reduce verbosity (and hence token usage), TaskGen uses StrictJSON that ensures JSON output from the LLM, along with additional features such as type checking and iterative error correction. Key to the philosophy of TaskGen is the management of information/memory on a need-to-know basis. We empirically evaluate TaskGen on various environments such as 40x40 dynamic maze navigation with changing obstacle locations (100% solve rate), TextWorld escape room solving with dense rewards and detailed goals (96% solve rate), web browsing (69% of actions successful), solving the MATH dataset (71% solve rate over 100 Level-5 problems), Retrieval Augmented Generation on NaturalQuestions dataset (F1 score of 47.03%)

Citations (1)

View on Semantic Scholar

Summary

The paper introduces TaskGen, a framework that decomposes complex tasks into manageable subtasks using Equipped Functions and hierarchical agents.
It integrates a memory management system with Shared Memory and Global Context to improve task accuracy and reduce computational redundancy.
The framework achieves high empirical performance, including a 100% solve rate in dynamic mazes and 96% in text-based puzzles through precise output parsing with StrictJSON.

Analysis of TaskGen: A Task-Based, Memory-Infused Agentic Framework

The paper presents "TaskGen," a novel open-sourced agentic framework designed to address the complexity of task decomposition and efficient execution in AI systems. The framework introduces a modular and structured approach to handling complex tasks using Equipped Functions and Inner Agents, reinforced by a memory management system to optimize performance.

Core Contributions

This paper positions TaskGen as a framework capable of decomposing complex tasks into efficient and actionable subtasks. The distinct contributions include:

Task Decomposition: TaskGen employs LLMs to break down complex tasks into smaller, manageable subtasks, each allocated to specific Equipped Functions or Inner Agents. This modular breakdown aids in improving task execution efficiency.
Memory Management: The framework integrates a memory system utilizing Shared Memory and Global Context to store critical task execution data. This aids in reducing redundant computing processes and improving the accuracy of future computations by allowing access to historical decision-making data.
StrictJSON: TaskGen employs StrictJSON for output parsing, which is more concise and structured than traditional JSON schemas. By reducing verbosity, TaskGen achieves lower token usage and increased processing speed, which is integral for real-time applications.
Hierarchical Agents: The use of Inner Agents under a Meta Agent is emphasized, promoting the distribution of workloads. This hierarchical structure ensures that tasks are executed by the most contextually appropriate agent, enhancing efficiency and accuracy.

Empirical Evaluations

The authors provide empirical assessments in diverse environments to demonstrate TaskGen’s robustness:

Dynamic Maze Navigation: The framework achieves a 100% solve rate in navigation tasks within a 40x40 dynamic maze, outperforming conventional learning algorithms and underscoring TaskGen's adaptability and planning prowess.
TextWorld Escape Room: TaskGen achieves a solve rate of 96% in goal-directed text-based puzzles, indicating its effective use of memory and context to inform decision-making better than simple LLM-based approaches.
Web Browsing Tasks: It successfully completes 69% of web browsing tasks, showcasing the framework's utility in real-world scenarios that require context extraction and interaction with digital environments.
MATH Dataset: TaskGen demonstrates substantial improvement in solving complex mathematical problems, achieving an accuracy of 71% on Level-5 problems, indicating the efficacy of its Equipped Functions in computational tasks.
Retrieval Augmented Generation (RAG): Applied to the NaturalQuestions dataset, TaskGen shows a significant F1 score improvement, recording 47.03%, demonstrating its effectiveness in dynamically refining context for more accurate question answering.

Theoretical and Practical Implications

TaskGen advances theoretical foundations in agentic AI by integrating structured task decomposition, efficient memory management, and adaptable agent hierarchies. Practically, TaskGen offers a framework that can be adopted for a broad spectrum of applications, from automated web assistant roles to intricate problem-solving scenarios in scientific computation and data retrieval systems.

This framework challenges existing paradigms by emphasizing a hybrid approach that nests both rule-based strategies and LLM-driven decision-making dynamics. Notably, the careful management of cognitive load through Shared Memory systems echoes principles from cognitive science concerning memory and task context.

Future Directions

The paper sets the ground for future explorations including enhanced planning algorithms integrating state-based graphs, advanced memory abstraction techniques, and implementation of multi-modal support. Additionally, they aim to explore collaborative agent environments, where multi-agent interaction could leverage diverse decision-making heuristics for enriched learning outcomes.

In conclusion, TaskGen offers a meaningful contribution to agentic frameworks by streamlining task execution through intelligent decomposition and strategic memory use. Its performance in varied empirical scenarios highlights its potential as a robust framework for future AI applications. TaskGen's development trajectory aims to further integrate AI as a versatile agent within computational ecosystems, presenting rich opportunities for research and application in intelligent systems design.

PDF Markdown

Related Papers

Tweets

https://twitter.com/aiagentsglobalc/status/1815672272864079909

YouTube

Show All Videos