Dice Question Streamline Icon: https://streamlinehq.com

Achieving continuous improvements in expert iteration for neural theorem proving

Develop training procedures for neural theorem provers that achieve continuous performance improvements under expert iteration by effectively leveraging the unlimited feedback provided by formal verifiers, rather than experiencing diminishing returns after a few iterations.

Information Square Streamline Icon: https://streamlinehq.com

Background

Expert iteration alternates between solving new theorems and fine-tuning on the resulting proofs, but current gains tend to plateau after several iterations.

The authors identify the unmet goal of sustained improvement using formal environments’ potentially unlimited feedback, framing a key algorithmic open problem for future provers.

References

It is still an open problem to obtain continuous improvements, leveraging the potentially unlimited feedback that a formal verifier can provide.

Formal Mathematical Reasoning: A New Frontier in AI (2412.16075 - Yang et al., 20 Dec 2024) in Recent Progress — Neural Theorem Proving: Expert Iteration (Section 3.2)