LifelongAgentBench: Evaluating LLM Agents as Lifelong Learners (2505.11942v3)

Published 17 May 2025 in cs.AI

Abstract: Lifelong learning is essential for intelligent agents operating in dynamic environments. Current LLM-based agents, however, remain stateless and unable to accumulate or transfer knowledge over time. Existing benchmarks treat agents as static systems and fail to evaluate lifelong learning capabilities. We present LifelongAgentBench, the first unified benchmark designed to systematically assess the lifelong learning ability of LLM agents. It provides skill-grounded, interdependent tasks across three interactive environments, Database, Operating System, and Knowledge Graph, with automatic label verification, reproducibility, and modular extensibility. Extensive experiments reveal that conventional experience replay has limited effectiveness for LLM agents due to irrelevant information and context length constraints. We further introduce a group self-consistency mechanism that significantly improves lifelong learning performance. We hope LifelongAgentBench will advance the development of adaptive, memory-capable LLM agents.

Collections

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Paper Prompts

Explore 10 Community Prompts

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

LifelongAgentBench: Evaluating LLM Agents as Lifelong Learners (2505.11942v3)

Collections

Summary

Paper Prompts

Follow-up Questions

Related Papers

Authors (8)