Monotonic Relative Neighborhood Graph (MRNG)
- MRNG is a formally defined proximity graph that guarantees strictly decreasing, monotonic paths in greedy best-first search for ANNS.
- It is constructed by ordering neighbors by increasing Euclidean distance and selecting edges based on an empty-lune condition to maintain constant out-degree.
- MRNG underpins scalable, efficient graph-based ANN indices and inspires extensions like block-aware MRNG for disk-resident data environments.
A Monotonic Relative Neighborhood Graph (MRNG) is a formally defined proximity graph designed to guarantee strictly decreasing distance paths in greedy best-first search, primarily for approximate nearest neighbor search (ANNS) in high-dimensional spaces. MRNG ensures connectedness, compactness (constant out-degree), minimal search complexity, and forms the theoretical core for practical graph-based ANN indices, including recent disk-aware extensions.
1. Formal Definition and Construction
MRNG is constructed over a finite set with the metric . For each pair , define the lune
where . The MRNG is the directed graph with
Equivalently, for construction: order by increasing , initialize the out-neighbor list with the nearest neighbor, and for each successive candidate , include if no satisfies .
Pseudocode (all-pairs naive construction, time):
1 2 3 4 5 6 7 8 9 10 11 12 13 |
for each p in S:
R ← S \ {p}, sorted by δ(p, ·) ascending
L ← { R[1] }
for q in R[2:]:
conflict ← false
for r in L:
if δ(r, q) < δ(p, q):
conflict ← true
break
if not conflict:
L.add(q)
for q in L:
add edge (p→q) to MRNG |
2. Monotonicity and Connectivity Properties
A path from to is monotonic if
MRNG is a Monotonic Search Network (MSNET), meaning for every pair there exists a monotonic path from to . Greedy best-first search in MRNG—always moving to the neighbor closest to the query that is strictly closer—never backtracks, with each hop reducing the distance to the target. This ensures strong connectivity: for any , there is a directed path from to (Fu et al., 2017, Zhu et al., 2021).
MRNG is the unique minimal graph structure guaranteeing this property. Any deletion of edges would violate monotonic search coverage for some node pairs (Zhu et al., 2021).
3. Degree Bound and Space Complexity
The out-degree in MRNG is bounded only by the ambient dimension , not by the cardinality ; for -dimensional Euclidean space, by sphere-packing arguments, the degree is
Empirical statistics show average node degrees plateau with increasing ; for , mean degree , max , and for mean degree , max (Zhu et al., 2021). The total number of edges is , yielding a compact index size (Fu et al., 2017).
4. Search-Time Complexity and Greedy Guarantees
For query and source , let denote the length of the monotonic path in MRNG, and the average node out-degree, which is . The greedy search procedure inspects at most candidates per hop, with total time
Under uniform random distribution in ,
where is the minimal difference in triangle side-lengths in . In high dimensions, this growth is close to logarithmic in (Fu et al., 2017). Greedy monotonic search in MRNG always terminates at the true nearest neighbor in at most steps; empirical scaling closely matches or exceeds alternatives like k-NN graphs (Zhu et al., 2021).
5. Generalizations, Approximation Techniques, and Hidden Structures
To accommodate large , generalizations of MRNG restrict neighbor candidate sets and cap the out-degree:
- Degree cap ()
- Candidate pool
Generic MRNG (GenMRNG) applies identical neighbor construction, limited to candidates from and up to neighbors. With , build time is and storage (Zhu et al., 2021).
MRNG possesses hidden structure termed conflicting nodes: for edge , the conflicting node set contains all such that —these are nodes "blocked" during construction. Recording conflict lists per edge facilitates escape from local minima in search, enabling improved recall and faster query time. The conflict-search subroutine, upon encountering a local minimum, scans for nodes closer to ; this strictly accelerates search convergence (Zhu et al., 2021).
Empirical findings indicate that capping degree at roughly half the mean degree of MRNG retains nearly full recall, with a phase transition where adding more edges yields negligible improvement (Zhu et al., 2021).
6. Block-Aware Monotonic Extensions for Disk-Based ANNS
The Block-aware Monotonic Relative Neighborhood Graph (BMRNG) extends MRNG principles to disk-resident graphs where disk I/O cost dominates. Vertex set is partitioned into blocks ; block-aware pruning jointly considers both geometric and storage layout to ensure monotonic search over block transitions.
- Intra-block edges: Retain the full block-wise MRNG.
- Cross-block edges: Pruned aggressively; only kept if no intra-block monotonic shortcut exists.
A monotonic I/O path is defined as a sequence of block-transitions and node steps, with strictly decreasing distances to at every step. BMRNG guarantees such monotonic I/O paths for every pair . The expected number of block transitions is
The Block-Aware Monotonic Graph (BAMG) is a practical, linear-time index that approximates BMRNG by integrating block layout into edge selection and leverages in-memory multi-layer navigation graphs for efficient search entry. BAMG achieves up to 2.1× higher throughput and 13–52% fewer disk I/Os compared to prior approaches, with only a small index size overhead (Li et al., 3 Sep 2025).
7. Practical Implications and Empirical Performance
MRNG serves as a theoretical foundation for scalable ANNS indices such as Navigating Spreading-out Graphs (NSG), which are practical MRNG approximations with nearly build cost and billion-node scalability (Fu et al., 2017).
Guidance for implementation includes using genMRNG with degree caps at approximately half the empirical mean, candidate pools drawn from k-NN graphs or random samples, and conflict lists for heavy edges, optimizing both index build and search times (Zhu et al., 2021).
Compared to classical Relative Neighborhood Graphs (RNG), MRNG relaxes the empty-lune condition, enforcing forward-only monotonic greedy paths and precluding backtracking. BMRNG/BAMG further extend this property to disk-based environments, making graph construction and block assignment a joint optimization problem, resulting in significant query throughput and I/O gains while preserving monotonicity and recall (Li et al., 3 Sep 2025).