Papers
Topics
Authors
Recent
Search
2000 character limit reached

Cograph Deletion Problem

Updated 7 January 2026
  • Cograph Deletion is the NP-hard problem of removing at most k edges from a graph to eliminate all induced P4 paths, ensuring a P4-free (cograph) structure.
  • The approach leverages modular decomposition and module merges, transforming prime modules into merged units to simplify the graph’s structure.
  • Advanced techniques include exact, fixed-parameter, and branch-and-reduce algorithms, offering scalable, parameterized solutions for practical applications.

A cograph is a finite simple graph with no induced path on four vertices (P4P_4-free). The Cograph Deletion Problem asks, given a graph G=(V,E)G = (V,E) and integer k0k \ge 0, whether it is possible to delete at most kk edges from GG such that the resulting graph is a cograph. This classical graph modification problem is central in structural graph theory and finds applications in phylogenetics, the analysis of relational data, and network simplification. The following survey covers the precise statements, solution frameworks based on modular decomposition and module merges, algorithmic advances, fixed-parameter tractability, and recent theoretical breakthroughs.

1. Problem Definition and Basic Properties

The Cograph Deletion Problem is defined formally as follows. Given G=(V,E)G = (V,E) (an undirected simple graph) and kNk \in \mathbb{N}, decide if there exists DED \subseteq E with Dk|D| \le k such that the graph G=(V,ED)G' = (V, E \setminus D) contains no induced path P4P_4 (sequence of vertices v1,v2,v3,v4v_1,v_2,v_3,v_4 with edges v1v2,v2v3,v3v4v_1v_2,v_2v_3,v_3v_4 and no others, i.e., chordless path). The optimization variant asks for the smallest DD such that GDG \setminus D is a cograph.

Cographs are the graphs generated from the single-vertex graph under disjoint union and complement operations. Cographs are precisely those graphs whose modular decomposition lacks any prime nodes; equivalently, every internal node of their modular decomposition tree is labeled “parallel” (union/disconnected) or “series” (join/co-disconnected), but never “prime” (Hellmuth et al., 2015, Hellmuth et al., 2017).

The problem is NP-hard under standard reductions. In particular, Cograph Deletion (edge-deletion) is a special case of the general Cograph Editing problem, which is known to be NP-complete, and a direct reduction from Vertex Cover provides its NP-hardness (Hellmuth et al., 2015).

2. Modular Decomposition and Prime Modules

Modular decomposition is a canonical tool in the analysis and solution of cograph modification problems. A set MVM \subseteq V is a module of GG if for all x,yMx, y \in M and vVMv \in V \setminus M we have vv adjacent to xx if and only if vv is adjacent to yy. Trivial modules are singletons and VV itself. GG is prime if its only strong modules are trivial.

The modular decomposition tree (MDT) of GG is a rooted tree whose nodes correspond to strong modules, with leaves representing single vertices. Each internal node is labeled as:

  • “parallel” if the induced subgraph G[M]G[M] is disconnected,
  • “series” if the complement G[M]\overline{G[M]} is disconnected,
  • “prime” if both G[M]G[M] and its complement are connected.

The children of a module MM are the maximal strong submodules of MM. For any fixed graph, MDT(G)(G) can be computed in O(n+m)O(n + m) time (Hellmuth et al., 2015, Hellmuth et al., 2017). The decomposition allows divide-and-conquer for edge modification problems: one solves the problem independently on each prime strong module and their quotient graphs.

The following decomposition lemma is central: if every minimal forbidden induced subgraph is prime, then

opt(G)=opt(G/M)+MiMopt(G[Mi])\operatorname{opt}(G) = \operatorname{opt}(G/\mathcal{M}) + \sum_{M_i \in \mathcal{M}} \operatorname{opt}(G[M_i])

where M\mathcal{M} is any modular partition (Lafond et al., 5 Jan 2026).

3. Equivalence of P4P_4 Deletion and Module Merges

A fundamental insight is that all induced P4P_4s in GG are contained entirely in some prime strong module—a fact implied by the Folkman–Fulkerson property (Hellmuth et al., 2015). Rather than searching for and destroying P4P_4s individually, it is sufficient (and optimal) to “resolve” each prime module by merging certain children into larger modules until no prime structure remains.

The merge operation is defined as follows: let M1,,MpM_1, \ldots, M_p be disjoint modules in GG (children of a prime module MM). In GG', M1,,MpM_1, \ldots, M_p are merged into M:=i=1pMiM := \bigcup_{i=1}^p M_i if (i) each MiM_i and MM are modules of GG' but MM is not a module of GG (Hellmuth et al., 2015, Hellmuth et al., 2017). This corresponds to editing all edges between MM and VMV\setminus M required to synchronize their adjacencies and collapse M1,,MpM_1,\ldots,M_p into a larger module.

The equivalence theorem states that every optimal cograph deletion set can be realized as a sequence of pairwise module merges across prime modules, and every minimal sequence of such merges corresponds to a unique minimal edge-deletion solution (Hellmuth et al., 2015, Hellmuth et al., 2017). This equivalence underpins algorithmic strategies—solving globally difficult graph editing by orchestrating a local sequence of module merges.

4. Exact and Parameterized Algorithms

The module-merge perspective leads directly to an exact branching algorithm:

  • Compute the modular decomposition of GG.
  • Recurse on each prime module MM with children M1,,MM_1,\ldots, M_\ell.
  • For each unordered pair (i,j)(i,j), compute the minimum edge set DijD_{ij} whose deletion merges MiM_i and MjM_j (forcing twin structure).
  • Branch on deleting DijD_{ij}, and recurse with an updated budget kDijk-|D_{ij}|.
  • Repeat until all prime modules are eliminated.

This branching scheme explores all pairwise merges, guaranteed by key lemmas to capture all minimal solutions [(Hellmuth et al., 2015), Lemma 5] [(Hellmuth et al., 2017), Theorem 4.4]. The running time is O((2)kpoly(n))O((\ell^2)^k \cdot \text{poly}(n)) where \ell is the maximum number of children in any prime module. Since the process is recursive on disjoint modules and at most kk branches occur, the algorithm is fixed-parameter tractable (FPT) in kk. By reparameterizing by modular-width, one achieves O(n2(k!)22k)O(n^2 (k!)^2 2^k) for the editing problem; the same holds for deletion-only (Hellmuth et al., 2017).

In practice, the following heuristics are fast and almost always yield high-quality solutions:

  • O(n3)O(n^3) time: greedily merge the smallest-cost pair of modules at every step, updating adjacency tables.
  • O(n2)O(n^2) time: randomly select pairs to merge, maintain the quotient graph only, and realize edits after all merges (Hellmuth et al., 2017).

The correctness of these algorithms relies on the preservation of strong modules under edge deletions and the fact that all new strong modules in GG' correspond to unions of children of a prime module of GG (Hellmuth et al., 2015, Hellmuth et al., 2017).

5. Branch-and-Reduce and Bounded Search Tree Methods

Independent of modular decomposition, several FPT branching algorithms exploit forbidden subgraph relaxations, particularly P4P_4-sparse graphs. A P4P_4-sparse graph is one where every five-vertex induced subgraph contains at most one P4P_4. The seven forbidden subgraphs on five vertices (e.g., C5C_5, P5P_5, kite, pan, fork, co-4-pan, P5\overline{P_5}) provide short constant-size patterns to branch upon.

The basic procedure (Nastos et al., 2010, Tsur, 2019):

  • Recursively search for a forbidden pattern.
  • For each minimal edge-deletion set covering all P4P_4s in the pattern, branch, decrease budget, and repeat.
  • Once the graph is P4P_4-sparse, use the linear-time “spider” decomposition for a direct cograph completion.

The worst-case search tree for edge-deletion, governed by the 4-pan forbidden subgraph, leads to recurrence T(k)T(k1)+4T(k2)T(k)\le T(k-1)+4T(k-2), yielding O(2.562k)O(2.562^k) (Nastos et al., 2010). More refined approaches (including larger forbidden patterns and automated branching rule generation) reduce this exponent to O(2.303k)O(2.303^k) (Tsur, 2019). The bottleneck is the fraction of P4P_4s destroyed in each branch, optimized via analysis of inclusion-minimal deletion sets.

Notably, every branching rule is exhaustive and safe, ensuring that all solutions are enumerated, and once in P4P_4-sparse class, decomposition allows for optimal resolution in O(n+m)O(n+m) time (Nastos et al., 2010).

6. Recent Advances: O((2+ε)k)O^*((2+\varepsilon)^k) Algorithm

Lafond and Sarrazin (2026) introduced a new direction by leveraging structural theorems for large prime graphs in concert with modular decomposition (Lafond et al., 5 Jan 2026). Their strategy employs the result (Chudnovsky–Kim–Oum–Seymour, 2015) that every sufficiently large prime graph contains as an induced subgraph one of seven “unavoidable” structures (and their complements): 1-subdivision of K1,cK_{1,c}, line graph of K2,cK_{2,c}, thin spiders, half-graphs, and specific chain graphs, each guaranteeing Ω(c)\Omega(c) induced P4P_4s.

This insight enables branching rules where, on encountering a large prime quotient, the process:

  • Identifies one of the unavoidable subgraphs with Θ(c)\Theta(c) P4P_4s.
  • Branches by deleting a small “pivot” edge (cost $1$ or a few) or, if not, pays a large linear penalty forced by the overlap structure (cost Θ(c)\Theta(c)).
  • As cc\to\infty, the branching vector yields an exponent arbitrarily close to $2$.

The overall algorithm applies modular decomposition, recursing on prime modules and quotient graphs, and, crucially, controls the branching number via linear-cost pivots: T(k)2T(k1)+T(k(c1)),T(k) \leq 2T(k-1) + T(k-(c-1)), with the largest root tending to 2+ε2+\varepsilon for sufficiently large cc. Thus, the subexponential bottleneck of the 2.303k2.303^k-type algorithms is circumvented for the first time (Lafond et al., 5 Jan 2026). The correctness is underpinned by decomposition lemmas (all minimal forbidden induced subgraphs are prime) and the universality of the structural theorem for prime graphs.

This approach generalizes to any hereditary HH-free editing problem where all minimal forbidden graphs are prime, including cluster editing, chordal editing, and trivial-perfect editing, provided this structural criterion holds.

7. Summary Table: Algorithmic Landscape for Cograph Deletion

Algorithmic Approach Exponential Factor Core Technique Reference
Forbidden P4P_4 branching (naive) O(3k)O(3^k) P4P_4-search (Tsur, 2019)
P4P_4-sparse subgraph branching O(2.562k)O(2.562^k) Forbidden 5-vertex (Nastos et al., 2010)
Automated pattern branching O(2.303k)O(2.303^k) Enumerate all minimal (Tsur, 2019)
Modular decomposition + pairwise merges FPT in kk Module merges (Hellmuth et al., 2015, Hellmuth et al., 2017)
Modular-width parameterization O(n2(k!)22k)O(n^2 (k!)^2 2^k) Brute-force over modules (Hellmuth et al., 2017)
Large-prime subgraph structural branching O((2+ε)k)O^*((2+\varepsilon)^k) Exploit prime substructures (Lafond et al., 5 Jan 2026)

Each advance channeled structural insight—first by local forbidden subgraphs, then local-to-global module merges, and most recently by recognizing and exploiting global prime-induced subgraphs—to improve the exponent of the FPT runtime. The methodologies not only precisely solve Cograph Deletion but have implications for a broad class of hereditary graph modification problems.


References

  • (Hellmuth et al., 2015) Techniques for the Cograph Editing Problem: Module Merge is equivalent to Editing P4s
  • (Hellmuth et al., 2017) Cograph Editing: Merging Modules is equivalent to Editing P4's
  • (Nastos et al., 2010) Bounded Search Tree Algorithms for Parameterized Cograph Deletion: Efficient Branching Rules by Exploiting Structures of Special Graph Classes
  • (Tsur, 2019) Faster algorithms for cograph edge modification problems
  • (Lafond et al., 5 Jan 2026) A O((2+ε)k)O^*((2 + ε)^k) Time Algorithm for Cograph Deletion Using Unavoidable Subgraphs in Large Prime Graphs

Topic to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Cograph Deletion Problem.