A Self-Improving Coding Agent (2504.15228v2)

Published 21 Apr 2025 in cs.AI

Abstract: Recent advancements in LLMs have spurred interest in deploying LLM agents to undertake tasks in the world. LLMs are often deployed in agent systems: code that orchestrates LLM calls and provides them with tools. We demonstrate that an agent system, equipped with basic coding tools, can autonomously edit itself, and thereby improve its performance on benchmark tasks. We find performance gains from 17% to 53% on a random subset of SWE Bench Verified, with additional performance gains on LiveCodeBench, as well as synthetically generated agent benchmarks. Our work represents an advancement in the automated and open-ended design of agentic systems, and demonstrates a data-efficient, non gradient-based learning mechanism driven by LLM reflection and code updates.

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/laurence_ai/status/1929537966138876030

https://twitter.com/wild_reptilians/status/1914700136464236984

YouTube

Show All Videos

A Self-Improving Coding Agent (2504.15228v2)

Summary

Related Papers

Tweets

YouTube