Effectiveness of inference-time scaling with neural verifiers on advanced mathematics
Determine whether inference-time scaling of natural-language mathematical large language models by combining search with neural verifiers to mitigate hallucinated reasoning yields effective performance on advanced mathematical problems beyond the pre-college or AIME level.
References
While this approach has gained traction, its effectiveness on advanced mathematical problems is an open question.
— Formal Mathematical Reasoning: A New Frontier in AI
(2412.16075 - Yang et al., 20 Dec 2024) in Introduction (Section 1)