PRADMM for Pose Graph Optimization
- The paper introduces PRADMM, which decouples global interactions via variable splitting and closed-form per-vertex updates, reducing computation compared to traditional methods.
- PRADMM leverages an augmented Lagrangian and Riemannian convergence proofs under mild manifold conditions to ensure robust, global convergence.
- Empirical evaluations on synthetic and real-world datasets demonstrate that PRADMM achieves 2–10× speedups and competitive accuracy (Rel.Err and NRMSE) over state-of-the-art optimizers.
The Parallelizable Riemannian Alternating Direction Method of Multipliers (PRADMM) is a specialized optimization algorithm for non-convex pose graph optimization (PGO), targeting large-scale robotics and SLAM applications. By leveraging variable splitting and equality-constrained reformulation, PRADMM achieves closed-form updates per vertex and enables efficient parallelization, ensuring near-linear scalability with graph size. Its convergence is established via Riemannian arguments under mild smoothness and manifold assumptions, and empirical benchmarks demonstrate superior performance compared to traditional and state-of-the-art methods (Chen et al., 22 Jan 2026).
1. Non-Convex Pose Graph Optimization Formulation
Pose graph optimization underpins numerous robot perception and navigation systems, providing the core estimation for SLAM. Let be a directed graph with poses and relative-pose measurements. Each pose is characterized by a rotation and translation , with measurements .
The maximum-likelihood formulation is:
An equivalent representation uses unit quaternions :
This formulation poses global coupling across the graph, impeding parallelization and closed-form updates.
2. Variable-Splitting and Reformulation
To decouple global interactions, PRADMM duplicates quaternion () and translation () variables by introducing auxiliary variables and , with equality constraints:
The objective splits as , where aggregates translation penalties and rotation penalties, subject to linear constraints , . This approach is designed to enable per-vertex closed-form updates.
3. Augmented Lagrangian Construction
PRADMM employs an augmented Lagrangian with dual multipliers for and for , with penalties :
where , , , . This structure ensures that the update subproblems for each variable block can be solved independently with closed-form solutions.
4. PRADMM Iterative Update Scheme
Each iteration applies a five-step blockwise update with relaxation parameter and block-diagonal proximal regularizers :
- -subproblem (sphere TRS):
For each :
- Isotropic case: is the normalized .
- Anisotropic case: rightmost eigenpair solution.
- -subproblem (Euclidean least-squares):
- -subproblem (Euclidean least-squares):
- -subproblem (Euclidean least-squares):
- Dual updates (over-relaxed):
All updates are performed per-vertex in parallel. The per-vertex cost is , and when average degree is constant, overall iteration cost is .
PRADMM Pseudocode
1 2 3 4 5 6 7 8 9 10 11 |
Initialize p⁰,q⁰,t⁰,s⁰ randomly (q⁰ on 𝕌), λ⁰,z⁰, choose β₁,β₂>0, τ∈(0,2), H₁…H₄.
for k=0,1,2,… until convergence do
for i=1…n (in parallel) do
compute A_{1,i}^k,b_{1,i}^k → p_i^{k+1} (TRS on sphere)
compute A_{2,i}^k,b_{2,i}^k → q_i^{k+1} (LS)
compute A_{3,i}^k,b_{3,i}^k → t_i^{k+1} (LS)
compute A_{4,i}^k,b_{4,i}^k → s_i^{k+1} (LS)
end for
λ^{k+1} = λ^k − τ β₁ (p^{k+1}−q^{k+1})
z^{k+1} = z^k − τ β₂ (t^{k+1}−s^{k+1})
end for |
5. Convergence Properties
Under the assumptions that are Lipschitz-smooth, bounded below, block-multi-convex, and that the -block lies on a smooth compact manifold (the sphere), PRADMM converges as follows for any and sufficiently large :
- A merit function decreases monotonically and is lower bounded.
- The primal and dual residuals vanish asymptotically.
- All limit points reached by the iterates are first-order stationary.
- Invoking a Riemannian Kurdyka–Łojasiewicz argument, the entire sequence is guaranteed to converge (finite-length property).
This suggests robust theoretical guarantees for practical deployment in large-scale graph optimization tasks.
6. Computational Complexity and Parallelization
With constant average graph degree (), each per-vertex subproblem for , , , and incurs computational cost. Consequently, a full iteration entails work. In comparison, traditional methods such as Gauss–Newton or factor-graph solvers generally exhibit or worse scaling. PRADMM's per-vertex update structure lends itself to trivial parallelization, facilitating near-constant memory and computation scaling with respect to the graph size.
7. Empirical Evaluation
Extensive validation is provided on synthetic and real-world datasets:
- Synthetic circular ring and cube datasets (up to ): PRADMM matches or exceeds state-of-the-art accuracy metrics (Rel.Err, NRMSE), while achieving 2–10 speedups over SE-Sync and over 100 speedups compared to manifold Gauss–Newton for large .
- Cube grids ( up to ): PRADMM maintains sub-second solutions for and Rel.Err0.14.
- Real-world 3D SLAM benchmarks (tinyGrid, garage, sphere1/sphere2, torus3D; up to 9,000 edges): PRADMM executes in 0.3–1.0s, matches SE-Sync’s error performance in rotation and translation, and is 3–20 faster.
Performance Metrics Table
| Dataset | Solver | Rel.Err | NRMSE | Time (s) |
|---|---|---|---|---|
| Circular Ring, n=100 | SE-Sync | 0.0711 | 0.0354 | 0.18 |
| PRADMM | 0.0689 | 0.0343 | 0.065 | |
| Circular Ring, n=1000 | SE-Sync | 0.0463 | 0.0232 | 0.38 |
| PRADMM | 0.0457 | 0.0229 | 0.22 | |
| Circular Ring, n=5000 | SE-Sync | 0.0451 | 0.0225 | 0.72 |
| PRADMM | 0.0439 | 0.0219 | 0.26 |
A plausible implication is that PRADMM is well-suited for real-time and large-scale graph optimization in SLAM, with a clear advantage in scalability and computational efficiency compared to established solvers.
8. Summary of Algorithmic Features
PRADMM, as developed by Chen et al. (2025), combines the following key features:
- Structured variable splitting into four blocks, each block solvable in closed-form or via simple computations.
- Natural parallelization across vertices with near-constant computation per vertex.
- Over-relaxed dual ascent permitting relaxation parameters .
- Provable global convergence to stationary points, contingent on mild smoothness and manifold properties.
- Demonstrable superiority in computational time and robustness when applied to large synthetic and real-world PGO datasets (Chen et al., 22 Jan 2026).