Greedy Equivalence Search (GES)
- Greedy Equivalence Search (GES) is a causal discovery algorithm that learns DAG structures by moving along the edges of the characteristic imset polytope.
- It reframes structure learning as a linear optimization problem, leveraging edge-addition, deletion, and reorientation moves for enhanced performance.
- Extensions like the greedy CIM algorithm integrate skeleton constraints with polytope moves to improve model recovery and computational efficiency.
The Greedy Equivalence Search (GES) algorithm is a foundational method for causal discovery, particularly for estimating the structure of directed acyclic graphs (DAGs) that encode observable conditional independencies. Recent geometric analysis reframes GES as a discrete edge-walk on the characteristic imset polytope (), thereby offering unified conceptual clarity and extending the algorithmic toolkit for structure learning. All technical definitions and results below follow the original account by Studený, Hemmecke, Lindner and subsequent researchers (Linusson et al., 2021).
1. The Characteristic Imset Polytope Framework
GES operates on the space of Markov equivalence classes of DAGs. Each DAG on nodes is associated with its characteristic imset , defined by iff such that . The vector encodes all membership in Markov equivalence classes.
The polytope has as vertices exactly the characteristic imsets, uniquely representing Markov equivalence classes. This geometric reformulation turns causal structure learning into a linear optimization problem.
2. Skeleton Constraints and Faces of
Hybrid approaches to structure learning often begin by estimating an undirected skeleton of the target DAG. Correspondingly, the subset forms a face of the full polytope . By selecting a linear functional supported only on 2-subsets, the maximizers are precisely DAGs whose skeleton is a subgraph of . This structural correspondence clarifies the constraint-enforcement mechanism in hybrid algorithms.
3. Geometric Realization of GES Moves as Edge-Walks
GES comprises three classes of elementary moves, each corresponding precisely to an edge of the polytope :
A) Edge-addition (“forward phase”): For DAG and pair non-adjacent in the skeleton, one can sometimes add without acyclicity violation. Proposition 3.10 and Theorem 3.12 show and is an edge of . This move is called an edge pair.
B) Edge-deletion (“backward phase”): Removal of from the essential graph is the reverse operation, and since is a 0/1-polytope, if two vertices differ in exactly one coordinate they span an edge (Lemma 3.1). Every valid deletion is therefore an edge of .
C) Edge-reorientation (“turn phase”): GES also reverses existing arrows (“Chickering-turn” operations) within the same equivalence class; when this switch creates a new MEC, the corresponding imsets again span an edge—see Proposition 3.5 and Theorem 3.8 for the explicit combinatorics.
In summary, every GES add, delete, or turn move is a walk along an edge of .
4. Extension Beyond Classical GES: Larger Classes of Polytope Moves
The edge pairs and turn pairs in strictly generalize the classical GES moves—there exist polytope edges corresponding to equivalence class transitions that do not arise from simple edge reorientations in any DAG of the current class (Example 3.7), but still realize valid greedy steps. This suggests potential improvements over standard GES by expanding the move set at each sweep.
5. The Greedy CIM Algorithm
Leveraging the geometric generalization, the “greedy CIM” algorithm performs a simplex-type walk on , considering all available edge-pairs (additions and deletions) and turn-pairs permitted by acyclicity. At each step, it selects the move yielding the maximal increase in a chosen score (e.g., BIC). Its algorithmic skeleton is:
- Initialize to the empty DAG.
- Edge-phase: evaluate all valid edge-pairs with , and take the best.
- Turn-phase: evaluate all valid turn-pairs (consistent with acyclicity) with , and take the best.
- Repeat until convergence.
All moves correspond to polytope edges, so the procedure is a greedy edge-walk in the sense of linear programming.
6. Empirical Performance and Hybrid Variants
On Gaussian linear-SEM data with and , the breadth-first, phased version of greedy CIM matches or outperforms GES and GIES in structure recovery. The “skeletal greedy CIM” hybrid variant—where a skeleton is first learned by CI tests, followed by greedy CIM turn-pairs—outperforms MMHC and PC in model and SHD metrics. Reported computational times are (skeletal) to (full) per replication in an R+Python implementation.
7. Synthesis: Unifying Geometric Perspective
GES is a restricted greedy edge-walk on the characteristic imset polytope . Casting GES, GIES, and MMHC in this geometric framework exposes the combinatorial structure of their operations, justifies new algorithmic extensions, and connects skeleton constraints with polytope faces. The greedy CIM algorithm demonstrates principled improvement by exploiting polytope edges not traversable by standard GES, offering both a theoretical unification and a practically robust methodology for greedy causal discovery (Linusson et al., 2021).