LM-A*: Dual-Landmark Heuristic for Large Graphs

Updated 26 May 2026

LM-A* is a dual-landmark heuristic framework that generalizes ALT by leveraging polygon inequalities to compute tighter lower bounds for shortest-path search.
It reduces preprocessing storage from Θ(|V|·|L|) to Θ(|V|+|L|²), enhancing efficiency in memory-critical applications like large road networks.
Empirical results show LM-A* can cut query times by 30–60% and reduce node expansions by 2–3×, especially in long-range search scenarios.

Landmark Progression (LM-A*)—formally denoted as ALP (A*, Landmarks, Polygon inequalities)—is a dual-landmark heuristic framework that generalizes the classic ALT (A*, Landmarks, Triangle inequalities) approach for shortest-path search on large graphs. By leveraging generalized polygon inequalities over pairs of landmarks, LM-A* achieves significantly tighter heuristic lower bounds while dramatically reducing the preprocessing storage requirements compared to ALT. This makes it especially effective for large-scale road networks and similar environments where memory efficiency and search speed are critical (Jr, 2016).

1. Classical ALT and Its Generalization via LM-A*

The ALT method augments A* search by preselecting a set $L$ of landmarks and precomputing all shortest-path distances $d(v,\ell)$ for every vertex $v$ and every landmark $\ell \in L$ . During search, for a source $s$ and target $t$ , the ALT heuristic is defined as:

$\pi_t^L(s) = \max_{\ell\in L} |d(s,\ell) - d(\ell,t)|$

which, by the triangle inequality ( $|d(A, \ell) - d(\ell, C)| \leq d(A,C)$ ), provides an admissible lower bound on $d(s, t)$ . LM-A* generalizes this by employing pairs of landmarks and applying polygon inequalities. Instead of bounding $d(s, t)$ by a single triangle, LM-A* forms the quadrilateral $d(v,\ell)$ 0 and extracts several lower-bound estimators, ultimately taking the maximum to define the dual-landmark heuristic.

2. Dual-Landmark Lower Bound Formulation

Given two distinct landmarks $d(v,\ell)$ 1 and $d(v,\ell)$ 2, and a search node $d(v,\ell)$ 3 with target $d(v,\ell)$ 4, LM-A* (ALP) derives six admissible lower bounds using reverse triangle and Ptolemaic inequalities over the quadrilateral $d(v,\ell)$ 5. The explicit lower-bound expressions used are:

(2a) $d(v,\ell)$ 6
(2b) $d(v,\ell)$ 7
(2c) $d(v,\ell)$ 8
(2d) $d(v,\ell)$ 9
(2e) $v$ 0
(2f) $v$ 1

Each satisfies $v$ 2. The LM-A* heuristic at $v$ 3 for target $v$ 4 is defined as:

$v$ 5

This procedure yields a strictly tighter lower bound in many cases, particularly for long-range source-target pairs (Jr, 2016).

3. Preprocessing Strategy and Space Complexity

LM-A* implements a distributed embedding preprocessing paradigm, which minimizes storage:

Landmark Selection: The graph $v$ 6 is partitioned by community detection or similar algorithms. Within each partition, a single landmark is chosen (randomly or per ALT heuristics).
Distance Storage: For each landmark $v$ 7, shortest-path distances $v$ 8 are stored only for vertices $v$ 9 in its partition (i.e., one per vertex). Additionally, all-pairs shortest-paths $\ell \in L$ 0 are stored for all landmark pairs.
Memory Complexity: This approach requires $\ell \in L$ 1 space: one per-vertex distance, plus a full $\ell \in L$ 2 matrix for landmark–landmark distances. By contrast, ALT requires $\ell \in L$ 3 entries (every vertex to every landmark), yielding orders-of-magnitude larger storage for moderately sized $\ell \in L$ 4.

4. LM-A* Query Algorithm

Query-time search proceeds as follows:

Given source $\ell \in L$ 5 and target $\ell \in L$ 6, initialize A* search with $\ell \in L$ 7, $\ell \in L$ 8, using a priority queue on $\ell \in L$ 9-values.
For each search node $s$ 0, the heuristic $s$ 1 is computed by considering all pairs $s$ 2, where $s$ 3 is the partition landmark for $s$ 4 and $s$ 5. For each such pair, the six lower bounds of (2a–2f) are computed, using the stored $s$ 6, $s$ 7, $s$ 8, and $s$ 9 as required.
The maximum over these (for all $t$ 0) is returned as $t$ 1.

The query pseudocode in the original work formalizes these steps and notes that in practice, storing small landmark “neighborhoods” per partition can tighten bounds, though the core approach operates with one landmark per partition (Jr, 2016).

5. Theoretical Properties: Admissibility, Consistency, and Complexity

Admissibility: The dual-landmark heuristic $t$ 2 is proven to be admissible; it never overestimates the true shortest-path distance.
Consistency: In LM-A*, the heuristic may be inconsistent under distributed embedding, i.e., $t$ 3 can exceed $t$ 4. This necessitates possible node re-openings during A*.
Computation: Query time per node is A*’s baseline plus $t$ 5 or $t$ 6 heuristic computations, dependent on the landmark pair strategy.
Space Complexity: The main data structure requires $t$ 7 memory, plus overhead for A* search structures.

A comparison table summarizes core complexity properties:

Approach	Preprocessing Storage	Query Heuristic Cost
ALT	$t$ 8	$t$ 9
LM-A*	$\pi_t^L(s) = \max_{\ell\in L} \|d(s,\ell) - d(\ell,t)\|$ 0	$\pi_t^L(s) = \max_{\ell\in L} \|d(s,\ell) - d(\ell,t)\|$ 1– $\pi_t^L(s) = \max_{\ell\in L} \|d(s,\ell) - d(\ell,t)\|$ 2

6. Empirical Performance and Benchmarks

Benchmarks on DIMACS road networks and synthetic graphs (up to $\pi_t^L(s) = \max_{\ell\in L} |d(s,\ell) - d(\ell,t)|$ 3 nodes) demonstrate the following:

For short path queries (<50 hops), ALT and LM-A* have similar performance.
For longer distances (>100 hops), LM-A* reduces the number of nodes expanded by up to $\pi_t^L(s) = \max_{\ell\in L} |d(s,\ell) - d(\ell,t)|$ 4– $\pi_t^L(s) = \max_{\ell\in L} |d(s,\ell) - d(\ell,t)|$ 5 and achieves query time reductions of $\pi_t^L(s) = \max_{\ell\in L} |d(s,\ell) - d(\ell,t)|$ 6– $\pi_t^L(s) = \max_{\ell\in L} |d(s,\ell) - d(\ell,t)|$ 7.
With $\pi_t^L(s) = \max_{\ell\in L} |d(s,\ell) - d(\ell,t)|$ 8, ALT requires $\pi_t^L(s) = \max_{\ell\in L} |d(s,\ell) - d(\ell,t)|$ 9 distance entries for a $|d(A, \ell) - d(\ell, C)| \leq d(A,C)$ 0 node graph, while LM-A* only needs approximately $|d(A, \ell) - d(\ell, C)| \leq d(A,C)$ 1 entries, representing a two-order-of-magnitude reduction in memory footprint (Jr, 2016).

7. Limitations, Open Questions, and Extensions

Key limitations and avenues for further investigation include:

Heuristic Inconsistency: Node re-opening due to inconsistency can occur; the impact on performance in very large graphs has not been fully characterized.
Landmark Selection: Current landmark choices (random, one-per-partition) may not be optimal. The potential of farthest-landmark or other advanced selection methods within distributed embedding remains open.
Extension to Higher-Order Polygons: While LM-A* applies quadrilateral inequalities, hypothetically employing pentagon or higher-order polygon inequalities could further tighten bounds, but would increase preprocessing and query overhead.
Parameter Balancing: Determining optimal $|d(A, \ell) - d(\ell, C)| \leq d(A,C)$ 2, which governs both $|d(A, \ell) - d(\ell, C)| \leq d(A,C)$ 3 space and heuristic evaluation cost, versus query speedup is left for future tuning.

A plausible implication is that further advances in the theory and practice of distributed embedding, polygonal inequalities, and landmark selection may yield even more effective preprocessing/search trade-offs while deepening our understanding of heuristic search on massive graphs (Jr, 2016).

Markdown Report Issue Upgrade to Chat

References (1)

Computing Shortest Paths Using A*, Landmarks, and Polygon Inequalities (Abstract) (2016)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Landmark Progression (LM-A*).