Ontology-Based Depth–Breadth Matrix
- Depth–Breadth Matrix (Ontology-Based) is a framework that quantifies hierarchical taxonomies by measuring maximum tree depth and category breadth using ontological methods.
- It utilizes local ontologies and semantic distance thresholds to dynamically decide when to merge, add a sibling, or insert a child category.
- The approach enables algorithmic evaluation and reconfiguration of directory structures to maintain balance and improve semantic organization.
A Depth–Breadth Matrix in the context of ontology-based administration of Web directories formalizes the structural organization of hierarchical taxonomies by quantifying both their depth (the longest root-to-leaf path) and breadth (the number of categories at each level). This abstraction provides a framework for analyzing, comparing, and algorithmically evolving directory structures using semantic criteria derived from ontologies. While the term "Depth–Breadth Matrix" does not explicitly appear in the foundational work by Horvat et al., the essential mathematical and algorithmic components for constructing such a representation underlie their approach to Web directory management and automation (Horvat et al., 2013).
1. Hierarchical Web Directory Structure
A Web directory is defined as a rooted tree of categories , where each category occupies a level , with the root always at . The depth is formally the maximal level:
Breadth at a given level is quantified as the number of categories present at that level:
Locally, a per-node branching factor is given by the number of children of a category :
This structure serves as the backbone for semantic organization and evaluation using ontological similarity metrics.
2. Ontology-Driven Semantics for Categorization
Each directory category possesses a local ontology generated via a semantic-extraction function $\sem\colon R \to O$ applied either to the set of resources contained in or recursively constructed from the ontologies of its child categories. This semantic enrichment enables automated, meaning-preserving directory construction and refinement, as sub-categories and mergers are not solely determined by syntactic or heuristic mechanisms but by quantitative semantic similarity (Horvat et al., 2013).
3. Ontological Distance and Decision Criteria
Dynamic construction and maintenance of the directory’s structure is governed by the ontological distance between categories:
$\dist(O_a, O_b) = \frac{1}{\sim(O_a, O_b)}$
where denotes the semantic similarity function. Two primary thresholds regulate when new categories emerge and at which position:
- Minimal horizontal semantic distance (): Threshold for adding siblings at the same level.
- Minimal vertical semantic distance (): Threshold for adding children, increasing the depth.
A resource ontology is:
- Merged if $\dist \le \min\{\mathit{mindist}_H, \mathit{mindist}_V\}$,
- Inserted as a sibling if $\mathit{mindist}_H < \dist \le \mathit{mindist}_V$,
- Inserted as a child category (increasing depth) if $\dist > \mathit{mindist}_V$.
These thresholds render the depth and breadth dynamics responsive to semantic rather than purely structure-driven considerations.
4. Formal Representation: Depth–Breadth Matrix
Although the explicit terminology "Depth–Breadth Matrix" is not used, the following representation is directly constructed from the paper's definitions:
- For directory depth and breadths at each level:
Or as a column vector:
This vector suffices to profile a directory’s level widths, facilitating comparative analysis across directories, and tracking evolutionary changes as resources are added or hierarchical reconfigurations occur.
| Level () | Number of Categories () |
|---|---|
| 1 | |
A full matrix embedding both breadth (per level) and depth (level index) can further be constructed, but for single-directory analysis, the breadth vector captures all essential structure information.
5. Ontology-Based Directory Construction Algorithm
The construction process automates hierarchy formation via iterative evaluation against semantic distance thresholds. The pseudocode based on Horvat et al. is as follows:
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 |
for each resource r in R: Onew = sem(r) if only root exists: create new category at level 2 with ontology Onew place r in new category else: c_star = argmin_c dist(Onew, O_c) d = dist(Onew, O_{c_star}) if d > mindistV: create Cnew as child of c_star, level = l(c_star)+1 place r in Cnew elif d > mindistH: create Cnew as sibling of c_star, level = l(c_star) place r in Cnew else: merge r into c_star O_{c_star} = O_{c_star} + Onew # unify ontologies |
6. Application: Directory Analysis and Quality Evaluation
The depth and breadth vector function as objective, quantifiable measures of a directory’s structure. Algorithmic procedures can automatically reconfigure the hierarchy to optimize for a desirable “Depth–Breadth Matrix”—for instance, balancing the tree or enforcing targets for maximum depth or even-level widths. These measurements serve as a basis for structural quality evaluation, longitudinal monitoring, and feedback in the ongoing development or realignment of Web directories (Horvat et al., 2013).
A sample scenario demonstrates these processes:
- Initial root, .
- Inserting : directory becomes , .
- Inserting (depending on $\dist$ and thresholds): possible outcomes , (new child), or , (new sibling), or a merge if ontologies are highly similar.
7. Significance and Extensions
The ontology-based approach enables the Depth–Breadth Matrix (or vector) to not only document structural properties but also connect directly to semantic considerations. The semantic thresholds and introduce a principled mechanism for structure evolution closely tied to the meanings and relationships within the knowledge domain. A plausible implication is that such ontologically-driven matrices could guide the automated optimization or refactoring of directory structures towards arbitrarily specified depth–breadth profiles, supporting a broad class of use cases in knowledge organization and retrieval system design (Horvat et al., 2013).