2000 character limit reached
Prover Agent: An Agent-based Framework for Formal Mathematical Proofs (2506.19923v1)
Published 24 Jun 2025 in cs.AI and cs.LG
Abstract: We present Prover Agent, a novel AI agent for automated theorem proving that integrates LLMs with a formal proof assistant, Lean. Prover Agent coordinates an informal reasoning LLM, a formal prover model, and feedback from Lean while also generating auxiliary lemmas to assist in discovering the overall proof strategy. It achieves an 86.1% success rate on the MiniF2F benchmark, establishing a new state-of-the-art among methods using small LLMs (SLMs) with a much lower sample budget than previous approaches. We also present case studies illustrating how these generated lemmas contribute to solving challenging problems.