Automated Generation and Verification of Natural-Language Math Proofs
Establish reliable automated methodologies for generating and verifying natural-language mathematical proofs produced by large language models, ensuring that their correctness and completeness can be assessed with fidelity comparable to expert human grading.
References
Recent advances in LLMs for mathematical reasoning have largely focused on tasks with easily verifiable final answers; however, generating and verifying natural language math proofs remains an open challenge.
— Reliable Fine-Grained Evaluation of Natural Language Math Proofs
(2510.13888 - Ma et al., 14 Oct 2025) in Abstract (page 1)