- The paper presents a formal Isabelle/HOL proof that constructs a universal pair (11, δ) to show the undecidability of Diophantine equations with bounded complexity.
- Key methodologies include the Coding Theorem, Lucas sequences related to Pell’s equation, and the Matiyasevich-Robinson lemma to combine complex Diophantine conditions.
- The research employs custom metaprogramming infrastructure to manage over 20,000 lines of formal proof, significantly advancing the library of verified mathematical results.
This paper (2505.16963) presents a formal verification in Isabelle/HOL of results concerning the complexity of Diophantine equations, specifically focusing on establishing a universal pair over the integers (∈). This work builds upon the known undecidability of Hilbert's Tenth Problem, which states there is no general algorithm to determine if an arbitrary Diophantine equation has integer solutions. The paper explores the problem when restricted to equations with bounded complexity, measured by the number of variables (ν) and total degree (δ). A pair (ν,δ) is considered "universal" if every Diophantine set (a set of parameters for which a Diophantine equation has solutions) can be represented by a polynomial within these complexity bounds. If such a pair is universal, the class of equations defined by these bounds is also undecidable.
The primary contribution is the formalization of a construction that demonstrates the universality of the pair $(11, \eta(\nu, \delta))_$, where η is a function of a known universal pair over natural numbers $(\nu, \delta)_$. This is the first formal derivation of a non-trivial universal pair over the integers, improving upon trivial bounds that would arise from simply converting natural number unknowns (like in the known $(9, \delta)_$ case) to integers using representations like Lagrange's four-square theorem.
The formal proof in Isabelle/HOL, totaling 20,000 lines, relies on extending existing libraries and formalizing concepts from number theory and computability theory. The core of the proof is the construction of a polynomial Q~ with 11 integer unknowns that is equivalent in solvability over the integers to an arbitrary Diophantine polynomial P with any number of natural number unknowns. This construction is broken down into several key stages:
- Coding Theorem: This theorem provides a mechanism to encode the solvability of a general Diophantine equation ν0 (with ν1 and ν2) into an equivalent Diophantine relation involving only the parameter ν3 and two new variables ν4. This relation involves checking if ν5 is a power of two and if a binomial coefficient ν6 is divisible by ν7, where ν8 and ν9 are polynomials in δ0. The formalization of this step uses Isabelle's library for digit expansions and requires formalizing properties of factorials and multinomial coefficients. The equivalence takes the form:
δ1, where δ2 are polynomials.
In Isabelle, this is represented within a locale
coding_theorem that manages the dependencies between the parameter a, variable f, and the generated polynomials:
$(11, \eta(\nu, \delta))_$9
The formal statement of the theorem's reverse direction involves checking the existence of g satisfying the range and divisibility conditions:
η0
- Lucas Sequences and Pell Equation (Bridge Theorem): The exponential and binomial coefficient conditions from the Coding Theorem are not directly elementary polynomial relations. The Bridge Theorem [(2505.16963), Theorem 2] translates these into conditions involving perfect squares, divisibility, and inequality, which are amenable to the Matiyasevich-Robinson relation combining technique. This translation relies heavily on the properties of Lucas sequences and their connection to solutions of Pell's equation δ3. The formalization defines Lucas sequences δ4 and δ5 recursively for natural numbers and extends them to integers.
η1
A key lemma formalized is that pairs δ6 are solutions to δ7:
η2
The Bridge Theorem itself involves a complex definition of auxiliary variables based on the inputs (δ8) and additional unknowns (δ9). These variables are constructed such that specific combinations of them must be perfect squares, satisfy a divisibility condition, and an inequality, if and only if (ν,δ)0 is a power of two and (ν,δ)1. The formalization structures these complex dependencies using a locale
bridge_variables.
- Relation Combining (Matiyasevich-Robinson Lemma): This technique, based on work by Matiyasevich and Robinson [MR75], shows that a conjunction of conditions like (ν,δ)2, (ν,δ)3, and (ν,δ)4 being perfect squares can be expressed as the existence of a single non-negative integer (ν,δ)5 such that (ν,δ)6, where (ν,δ)7 is a specific polynomial. The paper uses the (ν,δ)8 case. This lemma is crucial for consolidating the various Diophantine relations derived from the Bridge Theorem into a single polynomial equation.
η3
Combining these three steps allows the authors to prove an intermediate result (Theorem 3 (2505.16963)), stating that any Diophantine equation over natural numbers (ν,δ)9 is equivalent to the existence of 8 integer unknowns and 1 natural number unknown satisfying a polynomial equation $(11, \eta(\nu, \delta))_$0.
The final step to reach 11 integer unknowns utilizes the Three Squares Theorem ($(11, \eta(\nu, \delta))_$1). By substituting $(11, \eta(\nu, \delta))_$2 for the natural number unknown $(11, \eta(\nu, \delta))_$3 in $(11, \eta(\nu, \delta))_$4, they obtain the polynomial $(11, \eta(\nu, \delta))_$5, which has 11 integer unknowns and is equivalent in solvability to $(11, \eta(\nu, \delta))_$6 over integers (with $(11, \eta(\nu, \delta))_$7). This proves Theorem 11-unknowns (2505.16963), showing any Diophantine equation over natural numbers is equivalent to one with 11 integer unknowns.
The practical implementation aspect is less about building a runnable application and more about constructing a complex formal proof within a theorem prover. The authors developed custom metaprogramming infrastructure (\texttt{poly_extract}) to manage the large and complex polynomials arising from the many substitutions. This framework helps bridge the gap between abstract Isabelle terms and the formal int mpoly type, simplifying the definition and manipulation of these polynomials.
η4
A significant outcome of this project is the formalization of a substantial body of mathematical results in number theory and polynomial theory within Isabelle/HOL, contributing to the library of verified mathematics. The paper highlights the benefits of formalizing mathematics in-situ, i.e., in parallel with the mathematical research, as the proof assistant helped refine definitions, identify missing assumptions, and correct errors in the draft manuscript.
While the theorems themselves are fundamental results in computability and number theory with implications for the theoretical limits of algorithms, the practical application discussed in the paper is primarily the formal verification methodology and the development of reusable formal libraries. The proof demonstrates a complex, multi-step construction formalized rigorously, pushing the boundaries of what is formally verified. The explicit universal pair $(11, \eta(\nu, \delta))_$8 provides a concrete bound on the complexity threshold for undecidability of Diophantine equations over integers.