- The paper introduces γ-soft coloring and PRS to achieve parallel exact uniform sampling of proper graph colorings.
- It decomposes the global coloring task into smaller, independent subgraphs of size O(log n) for efficient concurrent resolution.
- Empirical results show up to two orders of magnitude improvement with bounded γ-levels, ensuring robust performance for dense graphs.
Introduction and Motivation
The paper addresses the challenge of perfect, exact uniform sampling of proper k-colorings on general graphs with n vertices, maximum degree Δ, and coloring constraint k≥f(Δ), aiming for practical, parallelizable algorithms. Traditional approaches such as Coupling From The Past (CFTP) [PW96, MH96] are fundamentally sequential, posing a significant barrier to exploiting parallel computation architecture. Partial rejection sampling (PRS) [GJL17] offers inherent parallelizability but degenerates into naïve rejection sampling when applied directly to coloring due to the lack of any always-'safe' states. The current work overcomes this barrier by introducing a novel "soft coloring" relaxation, yielding the first genuinely parallelizable, practical PRS-based method for exact uniform sampling of proper colorings on general graphs.
γ-Soft Coloring: Framework and Algorithmic Structure
The central innovation is the definition of γ-soft coloring: each vertex is augmented with a continuous auxiliary variable Uv∼Unif(0,1). New "passive states" are defined such that Uv≤γdv for vertex v of degree dv, ensuring that certain vertices cannot ever become 'bad' (i.e., their coloring will not lead to a coloring constraint violation). This property was unattainable in standard, ‘hard’ coloring formulations. The parameter n0 interpolates between the trivial measure (n1; unconstrained) and the hard coloring constraint (n2; exact proper coloring).
The sampling approach combines PRS applied at a chosen n3-level, iteratively reducing n4 whenever a configuration is not yet proper, with recursive or hybrid decompositions of the coloring space. Each iteration constructs a resampling set expanded from bad vertices only through non-passive vertices, which are proved to form small, independent connected components for appropriate n5. This enables genuinely concurrent resolution using any existing (potentially sequential) perfect coloring sampler restricted to subgraphs of size n6.
Theoretical Properties and Computational Complexity
The n7-soft PRS scheme's critical advantage is its decomposition of the global coloring task into independent subproblems of logarithmic size in n8. If a component solver requires n9 time for a subgraph of Δ0 vertices, the total time is reduced to Δ1 per level. Any future breakthrough in the component solver thus immediately yields a better global sampler via this framework. Applied to the celebrated BC20 CFTP method (best known for Δ2) [BC20], the hybrid achieves Δ3 expected time, versus direct CFTP's Δ4, an improvement by a factor of Δ5.
The hybrid can be recursively nested, using itself as the component solver. After Δ6 recursions, the coloring problem size is reduced to Δ7 per component, for any fixed Δ8. At the limit Δ9, the number of recursion levels, the total time is k≥f(Δ)0. If, as conjectured and strongly evidenced by simulation results, the number of k≥f(Δ)1 levels k≥f(Δ)2 is k≥f(Δ)3 independent of k≥f(Δ)4, this grants linear-time, scalable, parallel exact uniform sampling for general graphs.
Numerical Results and Strong Claims
The empirical results highlight:
- With the hybrid scheme, for dense graphs (k≥f(Δ)5), the number of effective k≥f(Δ)6-levels (k≥f(Δ)7) remains bounded, generally between 1 and 20, invariant with k≥f(Δ)8.
- The majority of PRS work is completed in a small window of k≥f(Δ)9-values directly above the percolation threshold.
- The size of resampling components stabilizes at γ0, enabling scalable parallelization.
- When using CFTP or NRS as component solvers, the hybrid PRS uses up to two orders of magnitude fewer resampling steps than direct rejection sampling or non-hybrid PRS, even for dense graphs and large γ1.
- For large γ2, convergence is rapid, and the method remains robust as γ3 approaches the critical coloring threshold.
The paper conjectures that γ4 can be bounded independently of γ5—if true, this implies γ6 time parallelizable perfect coloring, an unprecedented outcome for such a general class of graphs.
Structural Implications and Open Questions
The main practical implication is that this PRS-based approach is inherently parallelizable and modular. The method automatically benefits from any advances in sequential perfect sampling for small graphs and is directly applicable on high-performance computing systems. Theoretically, it synthesizes the Lovász Local Lemma's local independence structure and percolation theory with Moser-Tardos-type PRS, establishing a new approach to uniform sampling for hard combinatorial constraints.
The crucial theoretical open problem is the uniform boundedness of the number of necessary γ7-levels γ8 as γ9 and, equivalently, whether, above the percolation threshold, PRS finds a proper coloring with positive probability uniformly in γ0 and γ1, for γ2 (for some constant γ3). The simulation results strongly support a positive answer.
Prospects and Future Developments
Potential directions include:
- Rigorous confirmation of bounded γ4, yielding an unconditional linear-time parallel perfect sampler.
- Extension of the method to other constraint satisfaction and Markov random field models.
- Optimization of the γ5-sequence or component selection to accelerate practical convergence.
- Leveraging the modular nature for hybrid parallel-distributed architectures in large-scale scientific, AI, or statistical applications.
The work connects advances in local-to-global decomposable sampling, percolation-theoretic cutoff analysis, and algorithmic Lovász Local Lemma into a unified and practical framework for a fundamental problem in both theoretical computer science and AI.
Conclusion
This paper introduces a PRS-based parallel algorithm for perfect uniform γ6-coloring sampling on general graphs, enabled by the concept of γ7-soft coloring. The method achieves logarithmic-scale decomposition of the sampling problem and strong runtime improvements, with simulations supporting an unprecedented path toward linear-time, parallel uniform coloring sampling. If the main conjecture holds, the algorithm would resolve a central question at the intersection of randomized algorithms, sampling, and combinatorics. A Python implementation is publicly available, facilitating further investigation and practical adoption.
References:
- (2604.03947)
- [BC20]
- [GJL17]
- [MH96]
- [PW96]
- [FGY22]
- [JSS21]