Assured LLM-Based Software Engineering (2402.04380v1)

Published 6 Feb 2024 in cs.SE

Abstract: In this paper we address the following question: How can we use LLMs to improve code independently of a human, while ensuring that the improved code - does not regress the properties of the original code? - improves the original in a verifiable and measurable way? To address this question, we advocate Assured LLM-Based Software Engineering; a generate-and-test approach, inspired by Genetic Improvement. Assured LLMsE applies a series of semantic filters that discard code that fails to meet these twin guarantees. This overcomes the potential problem of LLM's propensity to hallucinate. It allows us to generate code using LLMs, independently of any human. The human plays the role only of final code reviewer, as they would do with code generated by other human engineers. This paper is an outline of the content of the keynote by Mark Harman at the International Workshop on Interpretability, Robustness, and Benchmarking in Neural Software Engineering, Monday 15th April 2024, Lisbon, Portugal.

References (41)

Citations (4)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/loretoparisi/status/1770274976563692006

https://twitter.com/sungkim11/status/1755512639005061586

https://twitter.com/ComputerPapers/status/1755478202821091688

YouTube

Show All Videos

HackerNews

Assured LLM-Based Software Engineering (2 points, 0 comments)
Assured LLM-Based Software Engineering (1 point, 0 comments)

Assured LLM-Based Software Engineering (2402.04380v1)

Summary

Related Papers

Tweets

YouTube

HackerNews