Experiential Co-Learning of Software-Developing Agents (2312.17025v3)

Published 28 Dec 2023 in cs.CL, cs.AI, cs.LG, and cs.SE

Abstract: Recent advancements in LLMs have brought significant changes to various domains, especially through LLM-driven autonomous agents. A representative scenario is in software development, where LLM agents demonstrate efficient collaboration, task division, and assurance of software quality, markedly reducing the need for manual involvement. However, these agents frequently perform a variety of tasks independently, without benefiting from past experiences, which leads to repeated mistakes and inefficient attempts in multi-step task execution. To this end, we introduce Experiential Co-Learning, a novel LLM-agent learning framework in which instructor and assistant agents gather shortcut-oriented experiences from their historical trajectories and use these past experiences for future task execution. The extensive experiments demonstrate that the framework enables agents to tackle unseen software-developing tasks more effectively. We anticipate that our insights will guide LLM agents towards enhanced autonomy and contribute to their evolutionary growth in cooperative learning. The code and data are available at https://github.com/OpenBMB/ChatDev.

PDF HTML Abstract

Overview of "Experiential Co-Learning of Software-Developing Agents"

The paper "Experiential Co-Learning of Software-Developing Agents" explores an innovative framework designed to overcome one of the persistent limitations of LLMs in the domain of autonomous agents: the lack of integration of past experiences into current task-solving processes. The research introduces Experiential Co-Learning (ECL), a multi-agent paradigm that enhances collaborative task solving by leveraging experiential knowledge. This approach involves instructor and assistant agents that engage in mutual reasoning through accumulated experiences, thereby enhancing the efficiency and effectiveness of tackling complex tasks, particularly in software development.

Core Components of Experiential Co-Learning

The ECL framework is structured around three integral modules: co-tracking, co-memorizing, and co-reasoning. Each of these modules plays a crucial role in refining the task-solving capabilities of language agents by building upon previous experiences.

Co-Tracking Module: This module focuses on creating procedural trajectories for various training tasks. By engaging in interactive rehearsals, instructor and assistant agents collaboratively explore and document their interactions, which form the backbone of their historical knowledge base.
Co-Memorizing Module: This component is dedicated to extracting "shortcuts" from historical trajectories. These shortcuts represent efficient solutions derived from external feedback, which are then stored in the agents' experience pools. By interleaving these experiential insights, the module facilitates the development of improved reasoning strategies.
Co-Reasoning Module: Utilizing the collective experience pools, this module enhances the collaborative interaction between agents when confronted with new tasks. By revisiting past experiences, the agents can provide more refined instructions and responses, thereby improving their overall problem-solving effectiveness.

Methodological Insights

The methodology of the paper is grounded in the application of LLMs within autonomous agents, where these agents operate in specific roles during task execution. The ECL framework capitalizes on the inherent strengths of LLMs, such as their contextual understanding and pattern recognition capabilities, while addressing their shortcomings in experience retention and application. Through the structured decomposition of tasks and interactive instruction-response dynamics, the ECL framework enhances the agents' autonomy, thereby minimizing reliance on human intervention.

Evaluation and Results

The paper presents an empirical evaluation utilizing the NLDD dataset, which is a curated collection of natural language to software generation challenges. The experimental results underscore the substantial improvements in task-solving autonomy achieved by the ECL framework, surpassing existing benchmarks such as GPT-Engineer, MetaGPT, and ChatDev. Specifically, the ECL framework demonstrates notable enhancements in key dimensions like task completeness, executability, and consistency with natural language requirements. The autonomy metric, reflecting a holistic measure of these factors, highlights the framework's capacity to efficiently handle complex software development tasks with reduced manual oversight.

Implications and Future Prospects

The implications of the ECL framework are multifaceted, extending beyond its immediate application in software development. The integration of experiential co-learning offers a promising avenue for enhancing the generalization capabilities of autonomous agents across various domains, facilitating the development of more adaptive and resilient AI systems. From a theoretical standpoint, the paper contributes to the burgeoning discourse on the role of experiential learning in AI, presenting a paradigm that aligns with human-like learning processes.

Looking forward, the research points to several potential avenues for further development, including the refinement of heuristic reward design in co-memorizing modules, the exploration of more nuanced consistency metrics, and the expansion of experiential learning to more diverse and complex task environments. As LLMs and AI systems continue to evolve, the concepts and methodologies elucidated in this paper may serve as foundational building blocks for future advancements in the field of autonomous agents and beyond.

PDF Markdown Bookmark Chat (Pro)

References (62)

Authors (12)

Chen Qian (226 papers)
Yufan Dang (9 papers)
Jiahao Li (80 papers)
Wei Liu (1135 papers)
Weize Chen (34 papers)
Cheng Yang (168 papers)
Zhiyuan Liu (433 papers)
Maosong Sun (337 papers)
Zihao Xie (11 papers)
Yifei Wang (141 papers)
Xin Cong (46 papers)
Xiaoyin Che (5 papers)

Citations (31)

View on Semantic Scholar

YouTube

Show All Videos