PatternBoost: Constructions in Mathematics with a Little Help from AI (2411.00566v1)
Abstract: We introduce PatternBoost, a flexible method for finding interesting constructions in mathematics. Our algorithm alternates between two phases. In the first local'' phase, a classical search algorithm is used to produce many desirable constructions. In the secondglobal'' phase, a transformer neural network is trained on the best such constructions. Samples from the trained transformer are then used as seeds for the first phase, and the process is repeated. We give a detailed introduction to this technique, and discuss the results of its application to several problems in extremal combinatorics. The performance of PatternBoost varies across different problems, but there are many situations where its performance is quite impressive. Using our technique, we find the best known solutions to several long-standing problems, including the construction of a counterexample to a conjecture that had remained open for 30 years.
- Sum of squares of sum of squares function r2(n)subscript𝑟2𝑛r_{2}(n)italic_r start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_n ). https://math.stackexchange.com/questions/322086/sum-of-squares-of-sum-of-squares-function-r-2n?rq=1. Accessed: 2024-09-27.
- Triangles in the integer grid [n]×[n]delimited-[]𝑛delimited-[]𝑛[n]\times[n][ italic_n ] × [ italic_n ]. https://mathweb.ucsd.edu/~asuk/solymosi.pdf. Accessed: 2024-09-27.
- Global Lyapunov functions: a long-standing open problem in mathematics, with symbolic transformers, 2024.
- On partitions of discrete boxes. Discrete mathematics, 257(2-3):255–258, 2002.
- Brandon Amos. Tutorial on amortized optimization, 2023.
- OptNet: Differentiable optimization as a layer in neural networks, 2021.
- Improved bounds for cross-Sperner systems. arXiv preprint arXiv:2302.02516, 2023.
- A note on the random greedy independent set algorithm. Random Structures & Algorithms, 49(3):479–502, 2016.
- Estimates for representation numbers of quadratic forms. 2006.
- An improvement to the Kelley-Meka bounds on three-term arithmetic progressions. arXiv preprint arXiv:2309.02353, 2023.
- Edge deletion preserving the diameter of the hypercube. Discrete Applied Mathematics, 63(1):91–95, 1995.
- Roger W Brockett. Dynamical systems that sort lists, diagonalize matrices, and solve linear programming problems. Linear Algebra and its applications, 146:79–91, 1991.
- Language models are few-shot learners. arXiv preprint arXiv:2005.14165, 2020.
- Pattern-avoiding (0, 1)-matrices. arXiv preprint arXiv:2005.00379, 2020.
- François Charton. Linear algebra with transformers, 2022.
- Neural optimization machine: A neural network approach for optimization, 2022.
- Neural ordinary differential equations, 2019.
- George Cybenko. Approximation by superpositions of a sigmoidal function. Mathematics of control, signals and systems, 2(4):303–314, 1989.
- Advancing mathematics by guiding human intuition with AI. Nature, 2021.
- A neural network based approach to mechanical design optimization. Engineering Optimization, 20(3):187–203, 1992.
- Convergence rates for ordinal embedding. arXiv preprint arXiv:1904.12994, 2019.
- Hypercube subgraphs with minimal detours. Journal of Graph Theory, 23(2):119–128, 1996.
- The history of degenerate (bipartite) extremal graph problems. In Erdős centennial, pages 169–264. Springer, 2013.
- Saturating Sperner families. Graphs and Combinatorics, 29(5):1355–1364, 2013.
- Timothy Gowers. Turán’s theorem for triangles. https://wtgowers.github.io/human-style-atp/2023/01/11/turan.html. Accessed: 2024-09-29.
- Changing and unchanging the diameter of a hypercube. Discrete Applied Mathematics, 37:265–274, 1992.
- Gurobi Optimization, LLC. Gurobi Optimizer Reference Manual, 2024.
- Gaussian Error Linear Units (GELUs), 2023.
- Computing with neural circuits: A model. Science, 233(4764):625–633, 1986.
- Kurt Hornik. Approximation capabilities of multilayer feedforward networks. Neural networks, 4(2):251–257, 1991.
- Andrej Karpathy. Makemore. https://github.com/karpathy/makemore. Accessed: 2024-09-29.
- Finite algebras of finite complexity. Discrete mathematics, 207(1-3):89–135, 1999.
- Deep learning for symbolic mathematics. arXiv preprint arXiv:1912.01412, 2019.
- Decomposing the complete r-graph. Journal of Combinatorial Theory, Series A, 154:21–31, 2018.
- Teaching arithmetic to small transformers, 2023.
- Saturation of k𝑘kitalic_k-chains in the Boolean lattice. arXiv preprint arXiv:2402.14113, 2024.
- Finding increasingly large extremal graphs with AlphaZero and Tabu search. arXiv preprint arXiv:2311.03583, 2023.
- On saturated k𝑘kitalic_k-Sperner systems. arXiv preprint arXiv:1402.5646, 2014.
- Bounded degree spanners of the hypercube. arXiv preprint arXiv:1910.09868, 2019.
- Investigating the limitations of transformers with simple arithmetic tasks. arXiv preprint arXiv:2102.13019, 2021.
- OEIS Foundation Inc. The On-Line Encyclopedia of Integer Sequences, 2024. Published electronically at http://oeis.org.
- Language models are unsupervised multitask learners. 2019.
- Mathematical discoveries from program search with large language models. Nature, 625(7995):468–475, 2024.
- Michael Saks. Kleitman and combinatorics. Discrete mathematics, 257(2-3):225–247, 2002.
- Torsten Thiele. Geometric selection problems and hypergraphs. PhD thesis, Citeseer, 1995.
- LLaMA: Open and Efficient Foundation Language Models. arXiv preprint arXiv:2302.13971, 2023.
- Solving olympiad geometry without human demonstrations. Nature, 625(7995):476–482, 2024.
- Attention is all you need. In Advances in Neural Information Processing Systems, pages 6000–6010, 2017.
- Adam Zsolt Wagner. Constructions in combinatorics via neural networks. arXiv preprint arXiv:2104.14516, 2021.
- Chai Wah Wu. Counting the number of isosceles triangles in rectangular regular grids. arXiv preprint arXiv:1605.00180, 2016.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.