WESE: Weak Exploration to Strong Exploitation for LLM Agents (2404.07456v1)

Published 11 Apr 2024 in cs.AI and cs.MA

Abstract: Recently, LLMs have demonstrated remarkable potential as an intelligent agent. However, existing researches mainly focus on enhancing the agent's reasoning or decision-making abilities through well-designed prompt engineering or task-specific fine-tuning, ignoring the procedure of exploration and exploitation. When addressing complex tasks within open-world interactive environments, these methods exhibit limitations. Firstly, the lack of global information of environments leads to greedy decisions, resulting in sub-optimal solutions. On the other hand, irrelevant information acquired from the environment not only adversely introduces noise, but also incurs additional cost. This paper proposes a novel approach, Weak Exploration to Strong Exploitation (WESE), to enhance LLM agents in solving open-world interactive tasks. Concretely, WESE involves decoupling the exploration and exploitation process, employing a cost-effective weak agent to perform exploration tasks for global knowledge. A knowledge graph-based strategy is then introduced to store the acquired knowledge and extract task-relevant knowledge, enhancing the stronger agent in success rate and efficiency for the exploitation task. Our approach is flexible enough to incorporate diverse tasks, and obtains significant improvements in both success rates and efficiency across four interactive benchmarks.

PDF HTML Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

References (30)

Authors (8)

Xu Huang (56 papers)
Weiwen Liu (59 papers)
Xiaolong Chen (86 papers)
Xingmei Wang (7 papers)
Defu Lian (142 papers)
Yasheng Wang (91 papers)
Ruiming Tang (171 papers)
Enhong Chen (242 papers)

Citations (1)

View on Semantic Scholar

Tweets

https://twitter.com/SolidReturnLda/status/1778819393876791394

https://twitter.com/realmofresearch/status/1779330285051715923

WESE: Weak Exploration to Strong Exploitation for LLM Agents (2404.07456v1)

Related Papers

Tweets