Papers
Topics
Authors
Recent
2000 character limit reached

StageSAT: Staged FP Satisfiability Solver

Updated 15 January 2026
  • StageSAT is a solver for quantifier-free floating-point formulas that integrates numerical optimization with soundness guarantees, ensuring bit-exact IEEE-754 solutions.
  • It uses a three-stage approach—projection-aided objectives, ULP-scale constraint encoding, and lattice refinement—to progressively align real-valued optimization with true FP semantics.
  • Empirical evaluations indicate superior recall, precision, and speed over competing solvers, with no false SAT models across diverse benchmark suites.

StageSAT is a satisfiability solver for quantifier-free floating-point (FP) formulas that integrates numerical optimization with soundness guarantees. Unlike traditional SMT-based solvers, StageSAT conceptualizes floating-point satisfiability as a sequence of real-valued optimization problems with progressively finer alignment to the semantics of IEEE-754 arithmetic, culminating in bit-exact solution validation. The method is fundamentally "black-box": all FP operations are evaluated dynamically, eschewing explicit case analysis, bit-blasting, or SMT encodings. Empirical data demonstrates StageSAT's superior recall, precision, and speed over state-of-the-art numeric and bit-precise alternatives in extensive benchmark competitions (Zhang et al., 8 Jan 2026).

1. Problem Formulation and Three-Stage Optimization Process

StageSAT addresses input formulas C\mathcal{C} in conjunctive normal form (CNF), partitioning constraints into linear equalities L\mathcal{L} (Ax=bA x = b over Rd\mathbb{R}^d) and a remainder N\mathcal{N} covering non-linear and inequality literals. The search proceeds through three stages:

  1. Projection-Aided Squared Objective (S1S_1): Uses the Moore–Penrose pseudoinverse (AAT)†(A A^T)^\dagger to realize the orthogonal projection onto the affine linear manifold:

PL(x)=xāˆ’AT(AAT)†(Axāˆ’b)P_{\mathcal{L}}(x) = x - A^T (A A^T)^\dagger (A x - b)

The squared Euclidean distance to L\mathcal{L} is ∄xāˆ’PL(x)∄22\|x - P_{\mathcal{L}}(x)\|_2^2. Non-linear and inequality literals are encoded via squared residuals in the clause structure:

Distā–”(Ļ•;x)=āˆā„“āˆˆĻ•{h(x)2,ℓ≔h(x)=0Ā (max⁔{0,f(x)āˆ’g(x)})2,ℓ≔f(x)≤g(x)\mathrm{Dist}_{\square}(\phi; x) = \prod_{\ell \in \phi} \begin{cases} h(x)^2, & \ell \equiv h(x) = 0 \ (\max\{0, f(x) - g(x)\})^2, & \ell \equiv f(x) \leq g(x) \end{cases}

Combining, the objective is

S1(x)=∄xāˆ’PL(x)∄22+āˆ‘Ļ•āˆˆCNDistā–”(Ļ•;x)S_1(x) = \|x - P_{\mathcal{L}}(x)\|_2^2 + \sum_{\phi \in \mathcal{C}_{\mathcal{N}}} \mathrm{Dist}_{\square}(\phi; x)

This structure leverages exact geometric distance to the solution manifold for the linear part and ensures predictable descent behavior.

  1. Squared-ULP Objective (S2S_2): Encodes constraints at the granularity of IEEE-754 unit-in-the-last-place (ULP) steps. For each literal ā„“\ell and point xx,

dulp(ā„“;x)={ULP(f(x),g(x)),ℓ≔f(x)=g(x)Ā 0,ℓ≔f(x)≤g(x)Ā (holds)Ā min⁔{ULP(y,g(x))∣y>g(x)},ℓ≔f(x)≤g(x)Ā (violated)d_{\mathrm{ulp}}(\ell; x) = \begin{cases} \mathrm{ULP}(f(x), g(x)), & \ell \equiv f(x) = g(x) \ 0, & \ell \equiv f(x) \leq g(x) \text{ (holds)} \ \min\{\mathrm{ULP}(y, g(x)) \mid y > g(x)\}, & \ell \equiv f(x) \leq g(x) \text{ (violated)} \end{cases}

The clause distance is the squared product over literals, and the total optimization objective is

S2(x)=āˆ‘Ļ•āˆˆCāˆā„“āˆˆĻ•[dulp(ā„“;x)]2S_2(x) = \sum_{\phi \in \mathcal{C}} \prod_{\ell \in \phi} [d_{\mathrm{ulp}}(\ell; x)]^2

S2(x)=0S_2(x) = 0 if and only if xx is a model of C\mathcal{C}, making S2S_2 a representing function for the formula.

  1. nn-ULP Lattice Refinement (S3S_3): Starting from the best continuous candidate x2āˆ—x_2^* after Stage 2, the search is refined along the IEEE-754 lattice by introducing integer offsets n∈Zdn \in \mathbb{Z}^d:

S3(n)=āˆ‘Ļ•āˆˆCāˆā„“āˆˆĻ•[dulp(ā„“;nULP(n,x2āˆ—))]2,S_3(n) = \sum_{\phi \in \mathcal{C}} \prod_{\ell \in \phi} [d_{\mathrm{ulp}}(\ell; \mathrm{nULP}(n, x_2^*))]^2,

where nULP(n,x)\mathrm{nULP}(n, x) denotes shifting xx by nn ULPs per coordinate. The search is carried out within a bounded region, and S3(n)=0S_3(n) = 0 guarantees a bit-exact model.

2. Theoretical Properties

StageSAT admits several notable theoretical guarantees for both correctness and descent behavior:

  • Partial Monotone Descent: For xx and x′x' satisfying all non-linear and inequality literals (N\mathcal{N}), any strict decrease S1(x′)<S1(x)S_1(x') < S_1(x) is equivalent to a strict reduction in Euclidean distance to L\mathcal{L}. If RR is a descent function, then R(x′)<R(x)ā€…ā€ŠāŸ¹ā€…ā€Šd(x′,M)<d(x,M)R(x') < R(x) \implies d(x',\mathcal{M}) < d(x,\mathcal{M}) for the true model set M={x∣x⊨C}\mathcal{M} = \{x \mid x \models \mathcal{C}\}.
  • Stall Avoidance via Geometric Projection: Direct use of the exact projection term eliminates optimizer stalling due to zero gradient in tangential directions; any step reducing ∄xāˆ’PL(x)∄\|x - P_{\mathcal{L}}(x)\| moves toward the solution manifold.
  • Representing-Function Soundness: Both S2S_2 and S3S_3 are representing functions; thus, S2(x)=0S_2(x) = 0 or S3(n)=0S_3(n) = 0 is sufficient and necessary for model existence, eliminating the possibility of false SATs.

3. Implementation Details

  • Black-Box FP Oracle: Every FP operation (addition, multiplication, transcendental functions) appears as a runtime oracle call. No bit-level decomposition, case analysis, or IEEE-flags are handled explicitly.
  • Projection Calculation: The matrix pseudoinverse is computed once per formula, after which each projection is O(d2)O(d^2) in variable count. For formulas without linear constraints, this stage is bypassed.
  • ULP Distance and Lattice Search: ULP computations and nn-ULP lattice navigation are performed by casting floats to integer bit patterns, incrementing/decrementing, and re-casting—an O(1)O(1) operation per dimension with exact IEEE-754 semantics.
  • Optimization: All stages use established black-box optimizers, such as basinhopping and CRS2, leveraging real-valued objectives without reference to bit-level circuit representations.

4. Comparative Empirical Evaluation

StageSAT has been evaluated on the SMT-COMP’25 QF_FP suites across small, middle, and large benchmark categories, and on previously identified challenging cases from tools such as XSat, Grater, and JFS.

Summary of Empirical Performance

Metric StageSAT Best Prior Numeric Solver Best Complete SMT Solver
SAT-Recall 99.4% (345/347) e.g., XSat: 12 SAT on large suite 39/49 (MathSAT-Large)
False SAT Rate 0% Nonzero for prior tools 0%
Median Speedup 5–10Ɨ – –
Max Speedup 20Ɨ – –
Coverage (large) 45/49 ≤60% (goSAT, Grater, JFS) 39/49

StageSAT solved strictly more formulas under identical time budgets than any competing solver, with no spurious SAT models. It outperforms XSat (24 SAT vs. 12 on large files), and surpasses goSAT, Grater, and JFS in both recall (92.3% vs. ≤60%) and soundness (0% spurious SATs). Ablation confirms all three stages are critical to optimum performance and reliability.

5. Methodological Distinctions

Distinct from traditional SMT solvers, StageSAT does not construct bit-blasted or explicit Boolean encodings. Instead, it operates entirely through floating-point evaluations and geometric objectives. This permits direct handling of complex arithmetic expressions (including black-box transcendental functions) and mixed-precision tasks, without specialized abstractions or loss of theoretical soundness.

Furthermore, the staged optimization approach—combining geometric manifold pursuit, ULP-scale minimization, and discrete lattice refinement—circumvents well-known pitfalls of non-monotonic objective landscapes in floating-point domains. The explicit use of representing functions ensures that each reported SAT model corresponds to a bit-exact IEEE-754 assignment; unsatisfiability is only guessed if exhaustive lattice refinement fails to close the gap around the best continuous candidate.

6. Benchmarks, Limitations, and Future Work

StageSAT's evaluation encompasses both standard competition benchmarks and challenging pathological instances that historically stymied numeric and SMT methods. Its design is empirically validated for large and nonlinear benchmarks as well as mixed-precision scenarios.

A plausible implication is that further extension of the staged paradigm, potentially adapting the lattice search to broader classes of discretized theories or integrating symbolic techniques for quantified or higher-order constraints, may extend StageSAT's applicability. Current limitations include reliance on black-box optimizers' ability to explore nonconvex spaces and handling only quantifier-free formulas with real-valued and floating-point atoms.

StageSAT demonstrates that staged optimization, grounded in geometric and IEEE-754-aligned objectives, can achieve both scalability and correctness previously unattainable for FP-satisfiability solvers (Zhang et al., 8 Jan 2026).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to StageSAT.