Papers
Topics
Authors
Recent
Search
2000 character limit reached

LM-A*: Dual-Landmark Heuristic for Large Graphs

Updated 26 May 2026
  • LM-A* is a dual-landmark heuristic framework that generalizes ALT by leveraging polygon inequalities to compute tighter lower bounds for shortest-path search.
  • It reduces preprocessing storage from Θ(|V|·|L|) to Θ(|V|+|L|²), enhancing efficiency in memory-critical applications like large road networks.
  • Empirical results show LM-A* can cut query times by 30–60% and reduce node expansions by 2–3×, especially in long-range search scenarios.

Landmark Progression (LM-A*)—formally denoted as ALP (A*, Landmarks, Polygon inequalities)—is a dual-landmark heuristic framework that generalizes the classic ALT (A*, Landmarks, Triangle inequalities) approach for shortest-path search on large graphs. By leveraging generalized polygon inequalities over pairs of landmarks, LM-A* achieves significantly tighter heuristic lower bounds while dramatically reducing the preprocessing storage requirements compared to ALT. This makes it especially effective for large-scale road networks and similar environments where memory efficiency and search speed are critical (Jr, 2016).

1. Classical ALT and Its Generalization via LM-A*

The ALT method augments A* search by preselecting a set LL of landmarks and precomputing all shortest-path distances d(v,)d(v,\ell) for every vertex vv and every landmark L\ell \in L. During search, for a source ss and target tt, the ALT heuristic is defined as:

πtL(s)=maxLd(s,)d(,t)\pi_t^L(s) = \max_{\ell\in L} |d(s,\ell) - d(\ell,t)|

which, by the triangle inequality (d(A,)d(,C)d(A,C)|d(A, \ell) - d(\ell, C)| \leq d(A,C)), provides an admissible lower bound on d(s,t)d(s, t). LM-A* generalizes this by employing pairs of landmarks and applying polygon inequalities. Instead of bounding d(s,t)d(s, t) by a single triangle, LM-A* forms the quadrilateral d(v,)d(v,\ell)0 and extracts several lower-bound estimators, ultimately taking the maximum to define the dual-landmark heuristic.

2. Dual-Landmark Lower Bound Formulation

Given two distinct landmarks d(v,)d(v,\ell)1 and d(v,)d(v,\ell)2, and a search node d(v,)d(v,\ell)3 with target d(v,)d(v,\ell)4, LM-A* (ALP) derives six admissible lower bounds using reverse triangle and Ptolemaic inequalities over the quadrilateral d(v,)d(v,\ell)5. The explicit lower-bound expressions used are:

  • (2a) d(v,)d(v,\ell)6
  • (2b) d(v,)d(v,\ell)7
  • (2c) d(v,)d(v,\ell)8
  • (2d) d(v,)d(v,\ell)9
  • (2e) vv0
  • (2f) vv1

Each satisfies vv2. The LM-A* heuristic at vv3 for target vv4 is defined as:

vv5

This procedure yields a strictly tighter lower bound in many cases, particularly for long-range source-target pairs (Jr, 2016).

3. Preprocessing Strategy and Space Complexity

LM-A* implements a distributed embedding preprocessing paradigm, which minimizes storage:

  • Landmark Selection: The graph vv6 is partitioned by community detection or similar algorithms. Within each partition, a single landmark is chosen (randomly or per ALT heuristics).
  • Distance Storage: For each landmark vv7, shortest-path distances vv8 are stored only for vertices vv9 in its partition (i.e., one per vertex). Additionally, all-pairs shortest-paths L\ell \in L0 are stored for all landmark pairs.
  • Memory Complexity: This approach requires L\ell \in L1 space: one per-vertex distance, plus a full L\ell \in L2 matrix for landmark–landmark distances. By contrast, ALT requires L\ell \in L3 entries (every vertex to every landmark), yielding orders-of-magnitude larger storage for moderately sized L\ell \in L4.

4. LM-A* Query Algorithm

Query-time search proceeds as follows:

  • Given source L\ell \in L5 and target L\ell \in L6, initialize A* search with L\ell \in L7, L\ell \in L8, using a priority queue on L\ell \in L9-values.
  • For each search node ss0, the heuristic ss1 is computed by considering all pairs ss2, where ss3 is the partition landmark for ss4 and ss5. For each such pair, the six lower bounds of (2a–2f) are computed, using the stored ss6, ss7, ss8, and ss9 as required.
  • The maximum over these (for all tt0) is returned as tt1.

The query pseudocode in the original work formalizes these steps and notes that in practice, storing small landmark “neighborhoods” per partition can tighten bounds, though the core approach operates with one landmark per partition (Jr, 2016).

5. Theoretical Properties: Admissibility, Consistency, and Complexity

  • Admissibility: The dual-landmark heuristic tt2 is proven to be admissible; it never overestimates the true shortest-path distance.
  • Consistency: In LM-A*, the heuristic may be inconsistent under distributed embedding, i.e., tt3 can exceed tt4. This necessitates possible node re-openings during A*.
  • Computation: Query time per node is A*’s baseline plus tt5 or tt6 heuristic computations, dependent on the landmark pair strategy.
  • Space Complexity: The main data structure requires tt7 memory, plus overhead for A* search structures.

A comparison table summarizes core complexity properties:

Approach Preprocessing Storage Query Heuristic Cost
ALT tt8 tt9
LM-A* πtL(s)=maxLd(s,)d(,t)\pi_t^L(s) = \max_{\ell\in L} |d(s,\ell) - d(\ell,t)|0 πtL(s)=maxLd(s,)d(,t)\pi_t^L(s) = \max_{\ell\in L} |d(s,\ell) - d(\ell,t)|1–πtL(s)=maxLd(s,)d(,t)\pi_t^L(s) = \max_{\ell\in L} |d(s,\ell) - d(\ell,t)|2

6. Empirical Performance and Benchmarks

Benchmarks on DIMACS road networks and synthetic graphs (up to πtL(s)=maxLd(s,)d(,t)\pi_t^L(s) = \max_{\ell\in L} |d(s,\ell) - d(\ell,t)|3 nodes) demonstrate the following:

  • For short path queries (<50 hops), ALT and LM-A* have similar performance.
  • For longer distances (>100 hops), LM-A* reduces the number of nodes expanded by up to πtL(s)=maxLd(s,)d(,t)\pi_t^L(s) = \max_{\ell\in L} |d(s,\ell) - d(\ell,t)|4–πtL(s)=maxLd(s,)d(,t)\pi_t^L(s) = \max_{\ell\in L} |d(s,\ell) - d(\ell,t)|5 and achieves query time reductions of πtL(s)=maxLd(s,)d(,t)\pi_t^L(s) = \max_{\ell\in L} |d(s,\ell) - d(\ell,t)|6–πtL(s)=maxLd(s,)d(,t)\pi_t^L(s) = \max_{\ell\in L} |d(s,\ell) - d(\ell,t)|7.
  • With πtL(s)=maxLd(s,)d(,t)\pi_t^L(s) = \max_{\ell\in L} |d(s,\ell) - d(\ell,t)|8, ALT requires πtL(s)=maxLd(s,)d(,t)\pi_t^L(s) = \max_{\ell\in L} |d(s,\ell) - d(\ell,t)|9 distance entries for a d(A,)d(,C)d(A,C)|d(A, \ell) - d(\ell, C)| \leq d(A,C)0 node graph, while LM-A* only needs approximately d(A,)d(,C)d(A,C)|d(A, \ell) - d(\ell, C)| \leq d(A,C)1 entries, representing a two-order-of-magnitude reduction in memory footprint (Jr, 2016).

7. Limitations, Open Questions, and Extensions

Key limitations and avenues for further investigation include:

  • Heuristic Inconsistency: Node re-opening due to inconsistency can occur; the impact on performance in very large graphs has not been fully characterized.
  • Landmark Selection: Current landmark choices (random, one-per-partition) may not be optimal. The potential of farthest-landmark or other advanced selection methods within distributed embedding remains open.
  • Extension to Higher-Order Polygons: While LM-A* applies quadrilateral inequalities, hypothetically employing pentagon or higher-order polygon inequalities could further tighten bounds, but would increase preprocessing and query overhead.
  • Parameter Balancing: Determining optimal d(A,)d(,C)d(A,C)|d(A, \ell) - d(\ell, C)| \leq d(A,C)2, which governs both d(A,)d(,C)d(A,C)|d(A, \ell) - d(\ell, C)| \leq d(A,C)3 space and heuristic evaluation cost, versus query speedup is left for future tuning.

A plausible implication is that further advances in the theory and practice of distributed embedding, polygonal inequalities, and landmark selection may yield even more effective preprocessing/search trade-offs while deepening our understanding of heuristic search on massive graphs (Jr, 2016).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Landmark Progression (LM-A*).