Top ZDDs: Compressed ZDD Structures
- Top ZDDs are advanced compressed Zero-suppressed Binary Decision Diagrams that merge identical subgraphs across varied heights using top-tree compression.
- They are constructed by extracting a spanning tree and forming a binary top tree that is then compressed into a top DAG for efficient navigation and membership queries.
- Empirical results demonstrate that Top ZDDs can reduce space from exponential to linear in some cases, though with a 2–5× slower traversal compared to traditional methods.
A Top ZDD is an advanced, compressed form of Zero-suppressed Binary Decision Diagram (ZDD), optimized for memory efficiency via top-tree-based compression of directed acyclic graphs (DAGs). Top ZDDs facilitate compact representation and polylogarithmic-time navigation for large and highly repetitive set families, extending the core structural benefits of traditional ZDDs. Their utility and variants are particularly relevant in combinatorial set representation and symbolic model checking, especially for large, sparse, and asymmetric boolean functions and Kripke structures (Miedema et al., 2023, Matsuda et al., 2020).
1. Foundations: ZDDs and Their Compression Limits
Zero-suppressed Binary Decision Diagrams (ZDDs) are reduced, ordered decision diagrams over a finite variable set , represented as DAGs with internal nodes labeled by . Each node has "else" (low) and "then" (high) children, following a fixed variable order. Classical ZDD reduction rules replace the symmetric BDD "identical children" elimination with the zero-suppression rule: any node where is removed, with incoming arcs redirected to . This succinctly encodes itemsets where variables are usually absent, making ZDDs efficient for sparse families (Miedema et al., 2023, Matsuda et al., 2020).
However, for large or extremely regular set families (e.g., the power set), even ZDDs can require exponential space because standard compression only merges isomorphic subtrees at the same height and cannot exploit deeper repetition, leading to impractically large diagrams (Matsuda et al., 2020).
2. Top ZDDs: Definition and Construction
Top ZDDs apply the top-tree compression paradigm to ZDDs by recognizing and merging identical subgraphs across different heights in the DAG. Construction proceeds as follows:
- Spanning Tree Extraction: A spanning tree is extracted from the branching-node subgraph of the original ZDD, ignoring 0/1 distinction. Non-tree edges are termed complement edges.
- Top Tree Formation: A binary top tree is constructed on via merges (vertical/horizontal) that produce clusters—subtrees with shared boundaries. A greedy bottom-up method yields height.
- DAG Compression: The minimal DAG of (top DAG) is produced by identifying isomorphic subtrees.
- Complement Edge Integration: Complement edges are attached at appropriate clusters, storing local-preorder and delta-label information.
- Final Top ZDD: The result is a labeled DAG where each node represents a possibly repeated cluster, with succinct navigation structures (balanced parentheses, bit-vectors, rank/select for fast access).
A Top ZDD thus consists of:
- A succinctly encoded top DAG.
- For each node: type bits, local label-differences, and complement-edge data.
- Navigation methods supporting in and in .
For certain set families, including the power-set of , Top ZDDs can reduce the representation size from ZDD nodes to Top ZDD nodes (Matsuda et al., 2020).
3. ZDD Variants and Suppression Rules for Model Checking
In symbolic model checking and knowledge compilation, the choice of ZDD suppression rule affects representation size and efficiency:
- Classic ZDD (T0): Then-0 suppression; node with is removed.
- E0/T1/E1 Variants: Defined via generalized rules:
- : BDD-style, symmetric elimination.
Other practical variants (simulated via edge/leaf-flip transformations on or its negations), provide a toolkit for matching ZDD structure to Boolean function asymmetries. Canonical-ness and interrelations among these variants hold as established (Miedema et al., 2023).
Suppression Rule Patterns
| Variant | Suppression Condition | Typical Advantage Scenario |
|---|---|---|
| T0 | high | Highly sparse models, dead 1-branches |
| E0 | low | Dually sparse, dead 0-branches |
| T1 | high | Large disjunctions (OR) |
| E1 | low | Large conjunctions (AND) |
| EQ | low(n)$</td>
<td>Balanced/symmetric</td>
</tr>
</tbody></table></div><h2 class='paper-heading' id='empirical-memory-efficiency-and-use-cases'>4. Empirical Memory Efficiency and Use Cases</h2>
<p>Top ZDD and ZDD variant effectiveness is demonstrated empirically in model checking of Dynamic Epistemic Logic (DEL) puzzles and large itemset datasets:</p>
<ul>
<li><strong>Model Checking Benchmarks</strong> (<a href="/papers/2307.05067" title="" rel="nofollow" data-turbo="false" class="assistant-link" x-data x-tooltip.raw="">Miedema et al., 2023</a>):
<ul>
<li><strong>Sum & Product Puzzle</strong>: T0/E0 can shrink node count to 40–60% of standard BDD. Gains arise from very sparse models where most high-order bits are 0.</li>
<li><strong>Dining Cryptographers</strong>: With sparsity 0.25–0.0625, T0/E0 outperform BDDs by 3–5%.</li>
<li><strong>Muddy Children</strong>: Choice of variant depends on the state-law; T1/E0 excel at extremal rounds (large disjunction/conjunction), collapsing diagrams to a single node.</li>
<li><strong>Intermediate Cases</strong>: BDDc (complement-edge BDD) often captures most of the symmetry benefit; ZDDs yield further improvement with careful asymmetry matching.</li>
</ul></li>
<li><strong>Set Family Compression Benchmarks</strong> (<a href="/papers/2004.04586" title="" rel="nofollow" data-turbo="false" class="assistant-link" x-data x-tooltip.raw="">Matsuda et al., 2020</a>):
<ul>
<li><strong>Power Sets</strong>: Top ZDD achieves exponential compression—$O(m)2^mA=1000A=1000nn=9O(c\,\log^2 n)cO(n_T/\log n_T\,\log\log n_T)n_TO(\log n)O(\log^2 n)O(c\cdot \log^2 n)$.
7. Limitations and Open DirectionsAlthough Top ZDDs compress highly repetitive families exponentially and maintain polylogarithmic-time navigability, they incur navigation overhead (2–5× slower traversal than DenseZDD), and compression efficacy depends strongly on the redundancy exploitable by spanning-tree selection. Finding optimal or improved spanning trees and extending top-DAG-based compression to broader classes of Boolean or DAG-encoded structures are characterized as substantive open problems. Further research is needed to support advanced operations (beyond membership and navigation) in polylogarithmic time (Matsuda et al., 2020). A plausible implication is that for applications prioritizing maximal compression over traversal speed—such as static storage, canonical set-family enumeration, or massively combinatorial symbolic model checking—Top ZDDs are preferable. For dynamic or access-latency-critical scenarios, trade-off analyses may be required. Cited Papers: (Miedema et al., 2023) Exploiting Asymmetry in Logic Puzzles: Using ZDDs for Symbolic Model Checking Dynamic Epistemic Logic (Matsuda et al., 2020) Storing Set Families More Compactly with Top ZDDs |