CigaR: Cost-efficient Program Repair with LLMs (2402.06598v2)

Published 9 Feb 2024 in cs.SE and cs.LG

Abstract: LLMs (LLM) have proven to be effective at automated program repair (APR). However, using LLMs can be costly, with companies invoicing users by the number of tokens. In this paper, we propose CigaR, the first LLM-based APR tool that focuses on minimizing the repair cost. CigaR works in two major steps: generating a first plausible patch and multiplying plausible patches. CigaR optimizes the prompts and the prompt setting to maximize the information given to LLMs using the smallest possible number of tokens. Our experiments on 429 bugs from the widely used Defects4J and HumanEval-Java datasets shows that CigaR reduces the token cost by 73%. On average, CigaR spends 127k tokens per bug while the baseline uses 467k tokens per bug. On the subset of bugs that are fixed by both, CigaR spends 20k per bug while the baseline uses 608k tokens, a cost saving of 96%. Our extensive experiments show that CigaR is a cost-effective LLM-based program repair tool that uses a low number of tokens to automatically generate patches.

References (55)

Authors (4)

Dávid Hidvégi (1 paper)
Khashayar Etemadi (12 papers)
Sofia Bobadilla (6 papers)
Martin Monperrus (155 papers)

Citations (12)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

GitHub

GitHub - ASSERT-KTH/cigar: Efficient APR with LLMs http://arxiv.org/pdf/2402.06598 (15 stars)

Tweets

https://twitter.com/ComputerPapers/status/1781295566816715214

https://twitter.com/realmofresearch/status/1782072962340032764

https://twitter.com/ComputerPapers/status/1757019149069361356

CigaR: Cost-efficient Program Repair with LLMs (2402.06598v2)

Summary

Related Papers

GitHub

Tweets