Bounded-Suboptimal Bidirectional Search
- Bounded-suboptimal bidirectional search is a framework that finds paths with a guarantee of cost not exceeding a specified multiple of the optimal path cost.
- The method leverages dual frontiers and consistent heuristics, using tailored priority functions to balance rapid solution discovery with error correction.
- Empirical results show that these techniques can significantly reduce node expansions and runtime compared to traditional unidirectional search algorithms.
Bounded-suboptimal bidirectional search is a family of algorithms for graph search that seeks paths between designated states subject to an explicit suboptimality bound on solution cost, while leveraging bidirectional strategies and consistent heuristics. These methods build upon optimal bidirectional search frameworks, generalizing them to produce paths whose cost is guaranteed not to exceed a specified multiple of the optimal cost, providing a rigorous performance guarantee with potential for significant empirical efficiency gains (Shperberg et al., 13 Nov 2025).
1. Problem Formulation and Preliminaries
Given a graph (directed or undirected) with non-negative edge costs , a start node , and a goal node , the objective is to find a path whose cost does not exceed times the optimal cost : where the optimal cost is
The parameter defines the allowed suboptimality, commonly expressed as , with . This framework generalizes the classical A* and Weighted A* (WA*) paradigms to the bidirectional setting with explicit suboptimality guarantees.
2. Algorithmic Foundations: BAE* and its Bounded-Suboptimal Variants
The Bidirectional A* front-to-end algorithm (BAE*) is defined for consistent heuristics. It maintains two simultaneous frontiers: one forward from (F) using , and one backward from (B) using . For each node, it computes the heuristic error in the forward direction, and analogously in the backward direction. The node expansion priorities are given as
Expansion alternates or uses a direction-choice rule, always expanding the node with minimum (). BAE* terminates when a node appears in both open lists and .
Bounded-suboptimal variants ("WBAE*") modify these priorities to
where is a parameter controlling the weighting of the error term. yields weighted bidirectional A* (WBiA*); yields full error correction, and intermediate values trade off solution-finding speed and the tightness of the lower bound. The algorithm terminates when
3. Theoretical Properties: Consistency, Suboptimality, and Completeness
A heuristic is consistent if
for all . Consistency ensures that a node is never reexpanded, provided that ties are broken in favor of higher -values.
Bounded-suboptimality is rigorously guaranteed: Thus, when WBAE* halts, the incumbent solution cost satisfies .
Completeness is assured if the underlying costs are non-negative and the heuristics are consistent. Bidirectional search exhausts both frontiers unless a connecting path is discovered or a negative result is proved.
4. Practical Algorithmic Enhancements
Key algorithmic details include:
- Direction alternation: Expansions alternate between forward and backward frontiers round-robin.
- Node selection: Expand the open node minimizing in the selected frontier.
- Tie-breaking: Prefer larger values to avoid re-expansions under consistency.
- Open/closed management: Standard A* bookkeeping is performed for each frontier independently.
- Lower-bound strengthening:
- GCD rounding: If is the GCD of all edge costs, round up to the next multiple of to potentially reduce node expansions.
- Alternative lower bound (ALB): Track and take the tighter lower bound; particularly effective for small , with up to 50% fewer expansions for but can increase per-node cost for larger .
5. Empirical Evaluation and Comparative Performance
Extensive experiments compared WA*, WBiA*, WMM, WBS, and WBAE (for multiple choices and a tuned ) across the following domains:
- Towers of Hanoi (12 disks)
- 15-puzzle (standard and "heavy" variant)
- 18-pancake (GAP variants)
- DAO (grid pathfinding)
- Mazes (grid pathfinding)
The key metrics were node expansions, runtime, and solution cost ratio . Table 1 summarizes average node expansions (in thousands):
| Algorithm | ToH | STP | Pancake | DAO | Mazes |
|---|---|---|---|---|---|
| WA* | 697 | 15,585 | 2,024 | 524 | 56 |
| WBiA* | 601 | 14,660 | 1,694 | 502 | 76 |
| WMM | 589 | 8,450 | 606 | 400 | 75 |
| WBS* | 644 | 15,640 | 1,985 | 497 | 67 |
| WBAE* () | 607 | 14,470 | 1,603 | 465 | 66 |
| WBAE* ($1/W$) | 540 | 14,470 | 1,646 | 450 | 66 |
| WBAE* ($1$) | 534 | 14,680 | 1,717 | 428 | 65 |
| WBAE* () | 523 | 14,640 | 1,667 | 336 | 61 |
| WBAE* () | 478 | 14,540 | 1,709 | 430 | 60 |
For optimal search (), WBAE*() ≡ BAE* dominates the alternatives, outperforming WA* and WBiA* by a factor of 5–10. For large (), the -term’s significance diminishes; WBiA* () often becomes preferable. Tuning () nearly matches the best fixed choice in all cases. GCD rounding yields 5–20% reductions in node expansions at no runtime cost, while ALB provides further reductions for small but can slow computation for larger .
6. Parameter Selection, Domain Dependence, and Guidelines
The choice of is critical for balancing speed and lower-bound tightness:
- For strong heuristics and/or large , rapid solution finding is prioritized; a small (including , i.e. WBiA*) is effective.
- For weak heuristics or near 1, suboptimality proof becomes dominant, necessitating larger (even , i.e. full error correction).
- Tuning on a small training set identifies an efficient trade-off, with empirically observed correlation that decreases as heuristic quality or increases.
A plausible implication is that domain characteristics and suboptimality requirements must both be actively considered in selecting the optimal weighting scheme.
7. Theoretical and Empirical Synthesis
By generalizing the BAE* error-correcting priorities with a heuristic inflation parameter and a free error-weighting parameter , the WBAE* framework encapsulates a spectrum of bidirectional bounded-suboptimal algorithms. This approach interpolates between weighted bidirectional A* and fully error-corrected BAE*, maintaining completeness, bounded-suboptimality, and no node re-expansion under consistent heuristics. Empirical evidence demonstrates that a small collection of values, including , achieves near-optimal performance across a variety of domains and suboptimality levels, with an additional tuned parameter providing further refinement (Shperberg et al., 13 Nov 2025).