Test-case-free self-correction for LLM program debugging
Determine whether large language models can debug and self-correct programs using only intermediate runtime execution information without access to correctness-labeled test cases, i.e., establish the feasibility of test-case-free debugging for large language models.
References
It remains an open question in future study whether LLMs are able to do self-correct by simply looking at its intermediate execution without knowing whether the result is correct or not (a.k.a. test-case-free debugging).
— Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step
(2402.16906 - Zhong et al., 25 Feb 2024) in Section: Limitation