Wu's Method can Boost Symbolic AI to Rival Silver Medalists and AlphaGeometry to Outperform Gold Medalists at IMO Geometry (2404.06405v2)
Abstract: Proving geometric theorems constitutes a haLLMark of visual reasoning combining both intuitive and logical skills. Therefore, automated theorem proving of Olympiad-level geometry problems is considered a notable milestone in human-level automated reasoning. The introduction of AlphaGeometry, a neuro-symbolic model trained with 100 million synthetic samples, marked a major breakthrough. It solved 25 of 30 International Mathematical Olympiad (IMO) problems whereas the reported baseline based on Wu's method solved only ten. In this note, we revisit the IMO-AG-30 Challenge introduced with AlphaGeometry, and find that Wu's method is surprisingly strong. Wu's method alone can solve 15 problems, and some of them are not solved by any of the other methods. This leads to two key findings: (i) Combining Wu's method with the classic synthetic methods of deductive databases and angle, ratio, and distance chasing solves 21 out of 30 methods by just using a CPU-only laptop with a time limit of 5 minutes per problem. Essentially, this classic method solves just 4 problems less than AlphaGeometry and establishes the first fully symbolic baseline strong enough to rival the performance of an IMO silver medalist. (ii) Wu's method even solves 2 of the 5 problems that AlphaGeometry failed to solve. Thus, by combining AlphaGeometry with Wu's method we set a new state-of-the-art for automated theorem proving on IMO-AG-30, solving 27 out of 30 problems, the first AI method which outperforms an IMO gold medalist.
- Aimo prize. https://aimoprize.com/.
- Imo grand challenge. https://imo-grand-challenge.github.io/.
- Peter novotný’s masters thesis. https://skmo.sk/cvika/ukazpdf.php?pdf=diplomka.pdf.
- A formal system for euclid’s elements. The Review of Symbolic Logic, 2009.
- Geocoq. 2018.
- A deductive database approach to automated geometry theorem proving and discovering. Journal of Automated Reasoning, 2000. doi: 10.1023/A:1006171315513.
- Shang-Ching Chou. Proving elementary geometry theorems using wu’s algorithm. In Woodrow Wilson Bledsoe and Donald W Loveland, editors, Automated Theorem Proving: After 25 Years, volume 89. American Mathematical Soc., 1984.
- Shang-Ching Chou. An introduction to wu’s method for mechanical theorem proving in geometry. Journal of Automated Reasoning, 1988.
- Automated production of traditional proofs in solid geometry. Journal of Automated Reasoning, 14(2):257–291, 1995.
- Nicolaas Govert de Bruijn. AUTOMATH, a language for mathematics. 1983.
- Probabilistic verification of elementary geometry statements. In Automated Deduction in Geometry, 1997. doi: 10.1007/BFb0022721.
- H. Gelernter. Realization of a geometry-theorem proving machine. Computers & Thought, 1995. doi: 10.5555/207644.207647.
- W. T. Gowers. How can it be feasible to find proofs? https://drive.google.com/file/d/1-FFa6nMVg18m1zPtoAQrFalwpx2YaGK4/view. Accessed: 7 April 2024.
- Guy McCrossan Haworth. 6-man chess solved. ICGA Journal, 28(3):153–153, 2005.
- Thomas Little Heath et al. The thirteen books of Euclid’s Elements. 1956.
- Deepak Kapur. Using gröbner bases to reason about geometry problems. Journal of Symbolic Computation, 1986.
- Deepak Kapur. A refutational approach to geometry theorem proving. Artificial Intelligence, 1988.
- Michelle Y. Kim. Visual reasoning in geometry theorem proving. In Proceedings of the Eleventh International Joint Conference on Artificial Intelligence, 1989.
- Open source prover in the attic. arXiv preprint arXiv:2401.13702, 2024.
- Dealing with degeneracies in automated theorem proving in geometry. Mathematics, 2021.
- Space-efficient indexing of chess endgame tables. ICGA Journal, 23(3):148–162, 2000.
- Arthur J Nevins. Plane geometry theorem proving using forward chaining. Artificial Intelligence, 6(1):1–23, 1975.
- The Interactive Geometry Software Cinderella. 1999.
- Barycentric coordinates in olympiad geometry. Olympiad Articles, 2012.
- Justin Stevens. Coordinate and trigonometry bashing. http://services.artofproblemsolving.com/download.php?id=YXR0YWNobWVudHMvYi9kLzRmMTA5OWJhNmI1MTg2YzM2ODdkZTVhYTJjMGU0NjdmYmViNGRk&rn=Q29vcmRpbmF0ZSBhbmQgVHJpZ29ub21ldHJ5IEJhc2hpbmcucGRm. Accessed: 4 April 2024.
- Solving olympiad geometry without human demonstrations. Nature, 2024.
- Dongming Wang. Reasoning about geometric problems using an elimination method. Automated practical reasoning: Algebraic approaches, pages 147–185, 1995.
- Wu Wen-Tsün. On the decision problem and the mechanization of theorem proving in elementary geometry. Scientia Sinica, 1978.
- Wenjun Wu. On zeros of algebraic equations–an application of ritt principle. Kexue Tongbao, 1986.
- An introduction to java geometry expert – (extended abstract). 2011. doi: 10.1007/978-3-642-21046-4_10.