Achieving continuous improvements in expert iteration for neural theorem proving
Develop training procedures for neural theorem provers that achieve continuous performance improvements under expert iteration by effectively leveraging the unlimited feedback provided by formal verifiers, rather than experiencing diminishing returns after a few iterations.
References
It is still an open problem to obtain continuous improvements, leveraging the potentially unlimited feedback that a formal verifier can provide.
— Formal Mathematical Reasoning: A New Frontier in AI
(2412.16075 - Yang et al., 20 Dec 2024) in Recent Progress — Neural Theorem Proving: Expert Iteration (Section 3.2)