SuReNav: Human-Like Navigation in Constrained Environments
- SuReNav is a navigation framework that models environments as superpixel graphs to clearly differentiate hard and soft constraints in over-constrained settings.
- It leverages a graph neural network to learn regional relaxation costs, integrating these into a differentiable A* planner for human-like, efficient path planning.
- Evaluations and real-world trials show SuReNav’s superior performance in safety, efficiency, and human-likeness compared to conventional navigation methods.
SuReNav is a navigation framework designed for over-constrained environments where regions present both hard (impassable) and soft (risky, but traversable) constraints. It addresses the challenge of guiding robots or agents through semi-static, real-world environments in a manner that closely matches human preferences: minimizing risk by avoiding hard constraints while traversing soft regions only when necessary, and doing so in an efficient, human-like manner. SuReNav combines superpixel-based map abstraction, regional-constraint relaxation via graph neural networks, differentiable path planning, and a closed-loop execution pipeline (Koh et al., 6 Feb 2026).
1. Superpixel Graph Map Construction
SuReNav represents the environment as a superpixel region adjacency graph, abstracted from either 2D semantic maps or rasterized 3D OpenStreetMap (OSM) data. Each input map is over-segmented using SLIC into compact superpixels , which serve as graph nodes. This segmentation preserves semantic boundaries to ensure that each superpixel corresponds to a homogeneous environmental region.
Node features include:
- : Centroid for spatial reasoning.
- : One-hot vector encoding semantic class and goal/start region membership.
Edges are created between adjacent superpixels if the shared boundary exceeds a threshold :
- .
- : Edge cost is the Euclidean centroid distance.
Regions are categorized as:
- : Strictly traversable (e.g., sidewalk).
- : Soft constraints (road, grass).
- : Hard constraints (building, water).
Hard constraints enforce ; such nodes are excluded from all paths. For , an initial or learned risk cost quantifies the traversability penalty.
2. Regional-Constraint Relaxation via GNN
A graph neural network (GNN) models the function , predicting the cost of traversing each soft-constraint superpixel and thereby parameterizing the trade-off between path length and risk. The architecture comprises stacked GraphGPS layers, combining GatedGCN message passing with sparse global attention transformers for effective spatial-semantic reasoning.
Inputs:
- : Node semantic attributes.
- : Edge geometric costs.
The output is a non-negative vector for each node, controlling relaxation penalties.
Differentiable path-cost coupling is achieved by integrating into a soft A* variant (GraphMP), where the cost for expanding node is
Soft selection and backpropagation allow end-to-end training from expert demonstrations.
The loss for matching expert relaxation choices is:
where indicates predicted relaxed nodes, are ground truth, and strongly penalizes missed necessary relaxations.
Training proceeds via projection of demonstration paths onto , generation of region sequences, differentiable planning, and batched loss accumulation:
with stabilization via weighted examples.
3. Interleaved Relaxation, Planning, and Execution
SuReNav interleaves map updating, GNN-based relaxation, graph planning, and continuous path execution in an online feedback loop aimed at robust navigation under environment dynamics. At every time step, the system:
- Updates semantic regions based on new sensor observations, fusing camera and LiDAR data.
- Regenerates the superpixel graph as necessary (e.g., upon significant environmental change).
- Evaluates to produce current node relaxation costs.
- Uses differentiable A* planning over to determine which soft regions to relax, strictly avoiding .
- Extracts the set of soft superpixels included in the planned path.
- Performs fine-grained A* planning at the continuous level over .
- Executes the next motion primitive.
- Repeats until the goal is reached or a failure occurs.
The total planning cost for a sequence is given by:
subject to .
4. Performance Evaluation and Results
SuReNav is evaluated against state-of-the-art baselines (D* Lite, COA*, RCR) on held-out test sets encompassing both 2D and OSM-derived map types. Human-likeness is quantified using normalized Fréchet distance between the planned and demonstration paths and intersection-over-union (IoU) over relaxed soft-regions.
| Method | Fréchet↓ | Relax. IoU↑ |
|---|---|---|
| D* Lite | 0.429 | 0.223 |
| COA* | 0.701 | 0.271 |
| RCR | 0.626 | 0.259 |
| SuReNav | 0.334 | 0.416 |
SuReNav yields the lowest Fréchet distance and substantially higher relaxation IoU compared to the best alternatives, closely aligning planned trajectories with human demonstrations.
Efficiency and safety are quantified using Success Rate (SR), Success weighted by Path Length (SPL), and estimated risk (summed pixel-wise LLM-inferred risk scores). Across 300 environments, SuReNav maintains SR=100%, SPL in the 0.80–0.85 range, and risk ≈0.09–0.15, achieving a favorable trade-off not matched by alternatives.
Qualitative analysis shows that SuReNav selectively relaxes soft constraints (e.g., taking short grass or crosswalk traversals only when safe) and otherwise adheres to low-risk, human-like paths, demonstrating both fidelity and efficiency (Koh et al., 6 Feb 2026).
5. Real-World Robotic Deployment
SuReNav's generalization and real-world applicability are established via deployment on a Boston Dynamics Spot quadruped equipped with RGB-D and LiDAR sensors. The online pipeline fuses semantic segmentation (PIDNet) with real-time occupancy data, re-segments via SLIC at regular intervals or upon significant map change, and uses the same GNN weights as in simulation.
Continuous A* planning is performed over a 10 cm grid, respecting Spot's locomotion constraints. In diverse campus navigation trials, SuReNav robustly selects minimal-risk soft traversals (e.g., crossing a single narrow parking lot only as needed), and otherwise opts for safe, longer detours, closely matching expert policy.
Direct transfer from simulation to hardware is achieved without recalibration or manual tuning of relaxation costs, underscoring the system's robustness and practicality for semi-static urban navigation scenarios.
6. Comparative Analysis and Significance
SuReNav establishes a new approach to navigation in over-constrained environments, distinguished by:
- A region-based abstraction leveraging superpixel adjacency graphs for scalable constraint representation.
- Learned regional relaxation costs via GNNs trained on human demonstrations, successfully generalizing costs to unseen environments.
- Differentiable planning that allows loss signal to shape relaxation and global efficiency.
- Superior trade-off between path efficiency (SPL), risk, and human-likeness metrics over both hand-designed (COA*, D* Lite) and learned relaxation baselines.
A plausible implication is that regional constraint relaxation, rather than uniform pixel- or cell-level cost assignments, enables more faithful imitation of human navigation in settings where certain risky regions are unavoidable. The architecture is amenable to new modalities and scalable to large urban maps (Koh et al., 6 Feb 2026).