Maximum Independent Set Problem

Updated 7 February 2026

The Maximum Independent Set problem is defined on an undirected graph where no two selected vertices share an edge, serving as a fundamental NP-complete optimization challenge.
Exact algorithms leverage measure-and-conquer strategies and refined reduction rules to achieve exponential-time improvements on graphs with bounded degrees.
Heuristic and learning-augmented methods, including quantum implementations, provide scalable approaches for tackling large-scale and real-world MIS instances.

The Maximum Independent Set (MIS) problem is a canonical NP-complete optimization problem central to combinatorics, theoretical computer science, discrete optimization, and network analysis. Given a finite undirected graph $G = (V, E)$ , the objective is to find a largest independent set: a subset $S \subseteq V$ such that no two vertices in $S$ share an edge, maximizing $|S|$ . The NP-hardness and structural richness of MIS have spawned algorithmic, complexity-theoretic, and application-driven research spanning exact algorithms, heuristics, relaxations, parameterized techniques, distributed methods, and physical computation platforms.

1. Formal Definition and Structural Properties

Let $G = (V, E)$ be a simple undirected graph. An independent set is a subset $S \subseteq V$ such that

$\forall u,v \in S, \ (u,v) \notin E.$

The MIS problem is to compute

$\alpha(G) = \max\left\{ |S| : S \subseteq V, \ \forall u,v \in S, \ (u,v) \notin E \right\}.$

MIS can be formulated as an integer program: $\begin{aligned} \max\quad & \sum_{j=1}^n x_j \ \text{s.t.}\quad & x_i + x_j \leq 1,\ \forall (i, j) \in E, \ & x_j \in \{0, 1\}\ \forall j. \end{aligned}$ Here $x_j = 1$ indicates vertex $j$ is included in the independent set.

2. Exact Algorithms and Complexity

MIS is among Karp’s original set of NP-complete problems. The best published exponential-time algorithm achieves worst-case complexity $O^*(1.1996^n)$ in polynomial space (Xiao et al., 2013). For graphs of small average degree, refined measure-and-conquer branching combined with strong reduction enables improvements:

Average degree ≤ 3: $O^*(1.0854^n)$
Average degree ≤ 4: $O^*(1.1571^n)$
Average degree ≤ 5: $O^*(1.1969^n)$
Average degree ≤ 6: $O^*(1.2149^n)$ (0901.1563)

Key reduction rules include folding, domination, separator-based decomposition, unconfined vertex removal, and edge-based branching. Hierarchical frameworks lift degree-bounded algorithms to general graphs via recursion on higher-degree vertices or "short" neighborhood connections (Xiao et al., 2013).

3. Heuristic and Metaheuristic Approaches

For large-scale or real-world instances where exact computation is infeasible, heuristic schemes dominate. The ARIR (adaptive restart with inference and reduction) framework (Zhu et al., 2022) is a modern state-of-the-art algorithm, combining:

Kernelization: Applying advanced exact rules (e.g., VCsolver) to reduce the instance.
Local search: ARW $(1,2)$ -swap with restarts upon stagnation.
Recurrent evaluation and adaptive restart: Monitoring progress and triggering search restarts on reduced subgraphs with increasing probability under stagnation.
Sampling-driven inference: Repeated sampling of current solutions, fixing vertices found in all samples, and eliminating them and their neighbors.

Empirically, ARIR-II (using VCsolver) outperforms previous heuristics on 85%+ of real-world benchmark instances under modest time budgets (Zhu et al., 2022). However, solutions are not always certifiably optimal, and approximation guarantees are primarily empirical.

4. Exact and Convex Relaxation-Based Mathematical Programming

MIS admits a natural LP and convex programming relaxation:

LP relaxations: Allow $x_j \in [0, 1]$ , relax the integrality. Edge constraints and a total-size equality $\sum_j x_j = k$ (for fixed $k$ ) induce polyhedral structure, but half-integrality gaps can persist (Manyem, 2022).
CP(k) approach (Manyem, 2022): Introduces a separable convex function $f(x)$ , designed via bin-packing inequalities, minimizing $\sum_j f(x_j)$ subject to the MIS constraints. By carefully tuning the convex objective, fractional solutions are discouraged, increasing likelihood of integral optima at feasible $k$ . Partial solution seeding improves convergence for challenging instances.

Continuous differentiable formulations using quadratic objectives (resembling a two-layer neural network) have emerged (Alkhouri et al., 2024): $f(x) = -\mathbf{e}^T x + \frac{\gamma}{2} x^T A_G x - \frac{1}{2} x^T A_{G'} x, \quad x \in [0, 1]^n,$ with $A_G$ the adjacency matrix and $A_{G'}$ its complement. Parallel momentum-based gradient descent, efficient MIS-checking criteria, and diverse initializations yield competitive large-scale solutions and practical scaling $O(n)$ on GPUs.

5. Specialized Structures: Geometric, String, and Fractal Graph Instances

In geometric settings, the complexity of MIS varies:

B₁-EPG graphs: NP-hard to approximate within better than constant factors; 4-approximation algorithm exists; PTAS is possible when path segments are bounded in length; fixed-parameter tractable for three-shape restrictions (Bougeret et al., 2015).
Outerstring graphs: Dynamic programming yields an $O(n^3)$ time algorithm for constant-complexity strings; grounded y-monotone paths admit $O(n^2)$ -time solution; SETH-based lower bounds rule out truly subquadratic algorithms (Bose et al., 2019).
Fractal structures: In the pseudofractal scale-free web $\mathcal{G}_n$ , the independence number is $\alpha_n = 3^{n-1}$ , with a unique MIS. For the Sierpiński gasket $\mathcal{S}_n$ , $\alpha_n = (3^{n-1} + 3)/2$ and the number of MISs grows exponentially as $2^{(3^{n-2}-1)/2}$ , highlighting the profound impact of symmetry and local structure on MIS count and solution structure (Shan et al., 2018).

6. Distributed and Learning-Augmented Algorithms

Distributed Algorithms

MIS has been extensively studied in distributed and local computation models:

Randomized distributed algorithms: Classic results yield global convergence in $O(\log n)$ rounds (Luby, Alon-Babai-Itai), with local optimality (per-node termination) in $O(\log \deg(v)+\log 1/\epsilon)$ rounds (Ghaffari, 2015).
Anonymous beeping models: Without topology or identifiers, local convergence requires $\Omega(\sqrt{n/\log n})$ rounds unless symmetry-breaking relaxations are added (e.g., upper bound on $n$ , wake-on-beep, sender-side collision detection, or synchronized clocks), enabling $O(\log^2 n)$ – $O(\log^3 n)$ time maximal independent set (not necessarily maximum) solutions (Afek et al., 2012).

Learning-Augmented Algorithms

Recent advances show substantial approximation improvements when augmented with oracular predictions from ML models:

If given an oracle (persistent or non-persistent) that predicts vertex MIS-membership correctly with probability $1/2 + \varepsilon$ , one can achieve $\tilde{O}(\sqrt{\Delta}/\varepsilon)$ - or constant-factor approximations in $\tilde{O}(m)$ time, breaking the traditional NP-hard-to-approximate barrier of $n^{1-\delta}$ for any $\delta > 0$ (Braverman et al., 2024).

7. Physical and Quantum Implementations, Hard Instance Construction

MIS maps directly to Ising Hamiltonians, making it a prime target for quantum annealing and neutral-atom quantum simulators. For example:

King’s Lattice and Unit-Disk Graphs: Rydberg-atom arrays naturally encode hard instances via blockade constraints, mapping atom positions to graph vertices and blockade radii to edges (Kim et al., 2023, Cazals et al., 6 Feb 2025).
- Quantum experiments on up to 141 atoms on King’s lattice deliver large-scale solution datasets; performance is currently limited by coherence, repetition rates, and graph structure.
- For more challenging (“hard”) unit-disk graphs (moderate-to-high density; large treewidth), classical branch-and-cut (CPLEX) is exponentially hard; quantum speedup requires scaling to ~1000 atoms and kHz repetition (Cazals et al., 6 Feb 2025).
Hard deterministic instances: Explicit infinite families of graphs ( $G_p$ , inverse graphs) for which all standard local and LP-based algorithms (cycle–chain certificates) cannot certify tighter bounds than $\alpha(G_p)/|V| = 1/2$ , despite the true ratio being $<1/2$ . This demonstrates provable gaps between local algorithmic power and global integrality (Shiraishi et al., 2018).

8. Structural and Parameterized Insights

Parameterized and structural complexity have provided a spectrum of tractable and intractable frontiers:

MIS is fixed-parameter tractable by solution size on graphs with treewidth $w$ , with runtime $O(2^w w^{O(1)} n)$ , and for unit-disk graphs with bounded "thickness" $t$ , with complexity $O(t^2 2^{2t} n)$ (Cazals et al., 6 Feb 2025).
Highly symmetric constructions (e.g., Sierpiński gasket) yield exponential numbers of MISs; heterogeneity (e.g., unique high-degree hubs) can enforce uniqueness (Shan et al., 2018).

In summary, the Maximum Independent Set problem is a paradigmatic locus for the interplay of combinatorial structure, computational hardness, algorithm engineering, and physical implementation. While state-of-the-art heuristics (e.g., ARIR/ARES (Zhu et al., 2022), differentiable quadratic optimization (Alkhouri et al., 2024)) and advanced exact methods (Xiao et al., 2013, 0901.1563) push practical boundaries, fundamental hardness persists; yet, research continues to bridge worst-case complexity, realistic heuristics, and future quantum or learning-augmented computational frontiers.