Distributed Pose-Graph Optimization

Updated 18 February 2026

Distributed pose-graph optimization is a technique that collaboratively estimates robot poses by partitioning a global pose graph among multiple agents using noisy relative measurements.
It employs various algorithmic paradigms—such as consensus-driven, proximal, and ADMM-based methods—to decouple complex nonlinear constraints and improve convergence.
Effective graph partitioning and inter-agent communication strategies underpin its scalability, resilience, and applicability in multi-robot SLAM and sensor network localization.

Distributed pose-graph optimization (DPGO) refers to a class of algorithms and architectures in which the estimation of robot poses from noisy relative measurements is performed collaboratively by a team of agents, each maintaining and optimizing local portions of the global pose graph. This paradigm is fundamental to collaborative simultaneous localization and mapping (CSLAM), multi-robot SLAM, sensor network localization, and distributed mapping. In DPGO, the computation and communication are partitioned among agents, eliminating dependence on a central coordinator, enabling scalability, improved resilience, and privacy preservation.

1. Mathematical Foundations of Distributed Pose-Graph Optimization

A pose graph is an undirected or directed graph $G=(V,E)$ , where each node $i\in V$ represents a robot pose $g_i = (R_i, t_i)$ with $R_i\in SO(d)$ (usually $d=2,3$ for SE(2)/SE(3)) and $t_i\in \mathbb{R}^d$ , and each edge $(i,j)\in E$ is associated with a noisy relative measurement $(\tilde R_{ij}, \tilde t_{ij})$ approximating $(R_i^T R_j, R_i^T (t_j - t_i))$ . The general maximum-likelihood pose-graph optimization takes the form

$\min_{\substack{R_i\in SO(d)\ t_i\in\mathbb{R}^d}} \sum_{(i,j)\in E} w_R^2 \| R_j - R_i \tilde R_{ij} \|_F^2 + w_T^2 \| t_j - t_i - R_i \tilde t_{ij} \|^2$

or, equivalently, in chordal, geodesic, or information-theoretic loss forms (Cristofalo et al., 2020, Fan et al., 2020, Li et al., 2024, Xu et al., 2021).

Distributed PGO partitions the global problem into local subproblems, typically by assigning subsets of variables and edges to each agent. Each agent $k$ minimizes a local cost $f_k(x_k)$ over its own variables and those coupled by shared measurements, requiring inter-agent communication at partition boundaries. Classically, the distributed algorithms exploit the sparsity of the underlying factor graph to limit communication to neighbors as induced by the graph topology.

2. Algorithmic Paradigms in Distributed Pose-Graph Optimization

DPGO encompasses diverse algorithmic strategies, each tailored to exploit locality, convergence properties, communication constraints, and computational structure.

a. Consensus-Based and Riemannian Optimization

GeoD introduces a continuous-time, consensus-driven gradient flow on the SE(3) pose-graph cost, where each node evolves its pose by integrating neighbor-induced correction terms in both translation and rotation (using matrix logarithms and "vee" operators), leading to provable convergence under mild consistency conditions (Cristofalo et al., 2020). Riemannian gradient descent and block-coordinate descent methods generalize these ideas, treating SE(d)-synchronization as a global optimization on a product manifold, e.g., via IRBCD (Li et al., 2024) and ASAPP (Tian et al., 2020).

b. Proximal and Majorization-Minimization Methods

A family of approaches leverages block-diagonal quadratic upper bounds (majorizers or generalized proximal operators) to decouple the nonlinear coupled PGO objective into tractable local problems. Each agent solves a node-wise subproblem, often via SVD for the rotation and linear updates for translation (Fan et al., 2020, Fan et al., 2021, Fan et al., 2020). Nesterov acceleration and adaptive restart further accelerate convergence (Fan et al., 2021, Fan et al., 2020).

c. Splitting and ADMM-Type Approaches

ADMM and Bregman splitting techniques separate the nonconvex orthogonality constraints and enforce consensus by alternating local quadratic solves with closed-form projections (e.g., SVD for nearest-orthogonal matrices). The translation parameters are updated, typically by distributed conjugate gradient or block Jacobi steps, while enforcing primal-dual concordance (Ebrahimi et al., 10 Mar 2025, Chen et al., 2024).

d. Block-Coordinate Descent and Certificates of Optimality

Distributed Riemannian block coordinate descent (RBCD) methods operate over factorized low-rank SDP relaxations of the PGO problem (Tian et al., 2019, Li et al., 2024). Agents update blocks corresponding to their variables, while distributed KKT certificates and saddle-escape strategies provide certifiably correct global solutions in moderate-noise regimes (Tian et al., 2019).

e. Classic Over-Relaxation and Gauss-Seidel/Jacobi Methods

Early distributed implementations linearize the local optimization (e.g., after chordal relaxation or initialization) and solve block-sparse normal equations via Jacobi or Gauss-Seidel over-relaxation, requiring only exchange of separator variables at each iteration (Choudhary et al., 2017). DGS (Distributed Gauss-Seidel) remains a baseline in empirical studies.

f. Learning-Based and Hybrid Protocols

Recent advances formulate DPGO as a multi-agent partially observable Markov game, where distributed policy networks based on recurrent edge-conditioned GNNs with adaptive gating enable outlier rejection and rapid inference via MARL (Ghanta et al., 26 Oct 2025). Local policies refine pose estimates via edge corrections, and a consensus scheme harmonizes separators' estimates.

3. Partitioning, Communication, and Graph Structure

Partitioning the global pose-graph is central for scalability, load balancing, and minimizing communication. Naive assignment (e.g., one robot per trajectory) leads to subgraph size imbalance and excessive inter-partition edges, increasing communication and straggler effects (Xu et al., 2021, Li et al., 2024). Recent frameworks employ:

Multi-level Graph Partitioning: Multi-stage (coarsen/partition/refine) algorithms, such as KaHIP variants, produce balanced subgraphs and minimize cut edges. Highest-cut schemes empirically minimize cross-partition communication volume (Li et al., 2024). Streaming and periodic repartitioning can adapt to dynamic keyframe arrival or network topology changes (Xu et al., 2021).
Streaming Partitioning: Assigns new nodes online using greedy heuristics (e.g., FENNEL) to maintain load balance and reduce cuts (Xu et al., 2021).
ADMM-based Consensus across Separators: When graph partitioning leads to duplicated separator nodes, consensus steps using information-weighted ADMM reconcile estimates efficiently (Ghanta et al., 26 Oct 2025).

Effective partitioning directly impacts load-balancing, communication per iteration (proportional to cut edges), and overall throughput.

4. Convergence Analysis and Theoretical Guarantees

Rigorous convergence properties vary by algorithm:

Consensus/Gradient Methods: Lyapunov arguments under pairwise consistency and minimal sum-translation constraints yield convergence to local minima for continuous-time flows and Riemannian gradient-based methods (Cristofalo et al., 2020, Tian et al., 2020).
Majorization-Minimization Frameworks: With suitable prox-regularizers, MM methods are guaranteed to produce nonincreasing cost sequences converging to first-order critical points, with $O(1/\sqrt{k})$ rates for MM and strong acceleration via Nesterov-type schemes (Fan et al., 2021, Fan et al., 2020).
Semidefinite Programming and Riemannian Staircase: The sparse SDP relaxations are tight under moderate-noise, and low-rank factorization with distributed KKT certification and saddle-escape yields certifiably globally optimal solutions (Tian et al., 2019, Li et al., 2024).
ADMM and Splitting Methods: While lacking universal global optimality certificates under nonconvex constraints, splitting-based ADMM or PieADMM achieves convergence to approximate stationary points, with $O(1/\epsilon^2)$ iteration bounds under mild regularity (Chen et al., 2024, Ebrahimi et al., 10 Mar 2025).
Asynchrony and Delay Tolerance: ASAPP and similar asynchronous protocols are proven globally convergent under bounded message delays and guarantee sublinear convergence under stepsize constraints that depend explicitly on delay bound and network degree (Tian et al., 2020).

5. Practical Implementations and Empirical Evaluation

Extensive empirical validations benchmark distributed PGO solvers on standard synthetic and real-world SLAM datasets (e.g., Parking Garage, Cubicle, Rim, Sphere, Torus, Manhattan, KITTI, Intel, City10000) (Cristofalo et al., 2020, Fan et al., 2021, Li et al., 2024, Ghanta et al., 26 Oct 2025).

Summary of observed properties:

Framework	Key Features	Empirical Performance
GeoD (Cristofalo et al., 2020)	Consensus-gradient, Lyapunov proof	717× faster than SE-Sync, 3.4× < DGS error; robust up to 1000+ nodes
ASAPP (Tian et al., 2020)	Asynchronous, delay-tolerant	Matches/lowers DGS cost; robust under delays
IRBCD+Partition (Li et al., 2024)	Multilevel partition, block-opt	Fewest comm. edges ("Highest"), 2–6× faster than DGS; scalable to 16+ robots
MM-PGO/AMM-PGO (Fan et al., 2021)	Surrogate, inertia, acceleration	5–10× faster than DGS, robust to outliers
RBCD–SDP (Tian et al., 2019)	Certifiable optimal, KKT/saddle escape	Exact recovery under moderate noise, faster than DGS; scalable to large graphs
SOC-ADMM (Ebrahimi et al., 10 Mar 2025)	Closed-form splitting, Bregman	Near-global minima, outperforms DGS on large graphs, low per-iteration overhead
PieADMM (Chen et al., 2024)	Quaternion, Riemannian ADMM	Parallel local steps, O(1/ε²) convergence
BDPGO (Xu et al., 2021)	Streaming+offline partition, resilience	2–5× speedup, 60–75% lower communication, seamless recovery under failures
MARL-GNN (Ghanta et al., 26 Oct 2025)	Actor-critic, GNN+edge-gating	−37.5% F(x) over SOTA, 6–20× faster inference, effective scalable deployment

Consensus-based, MM/proximal, and Riemannian methods consistently outperform early DGS/Jacobi schemes in both speed and quality. Partition quality (balance, cut size) is critical, and adaptive approaches further boost robustness to network changes and failures (Xu et al., 2021, Li et al., 2024).

6. Extensions, Applications, and Open Challenges

Key extensions and practical deployment aspects include:

Multi-robot SLAM and collaborative mapping: DPGO is foundational in DCSLAM, collaborative visual/inertial mapping, and distributed sensor network localization (Li et al., 2024, Xu et al., 2021).
Object-level and semantic SLAM: Object-based SLAM models can be optimized via distributed PGO while drastically reducing communication volume and data privacy exposure (Choudhary et al., 2017).
Dynamic and Adversarial Environments: Fully distributed, resilient protocols adapt partitioning and optimization dynamically in response to robot failures, changing network topology, or sensor dropouts (Xu et al., 2021).
Learning-based methods: MARL and GNN-encoded actor policies provide robustness to non-Gaussian outliers and enable constant inference cost per agent as the team size scales (Ghanta et al., 26 Oct 2025).

Open challenges include fully decentralized graph partitioning, distributed dynamic repartitioning for time-varying topologies, rigorous theoretical convergence for nonconvex ADMM in SE(d), and machine learning integration for outlier rejection, initialization, and solver warm-starting.

7. Comparative Table of Distributed PGO Algorithms

Algorithm	Partitioning	Convergence Guarantee	Typical Use Case	Ref
GeoD	Flat, adjacency	Lyapunov (local)	Large graphs, consensus	(Cristofalo et al., 2020)
ASAPP	Flat, stateless	Sublinear, async, delay	Delay-tolerant, large scales	(Tian et al., 2020)
MM/AMM-PGO	Flat/partitioned	First-order, $O(1/\sqrt{k})$	Robust and fast, moderate scale	(Fan et al., 2021, Fan et al., 2020)
RBCD+SDP	Flat/partitioned	Certifiable global opt.	Moderate-noise, high-precision	(Tian et al., 2019)
IRBCD+Highest	Multilevel	First-order, global	Large-scale, communication&load	(Li et al., 2024)
SOC-ADMM	Flat	Empirical, near-global	SE(3), closed-form subproblems	(Ebrahimi et al., 10 Mar 2025)
BDPGO	2-stage, dynamic	Empirical, resilient	Swarms, disconnects, mapping	(Xu et al., 2021)
MARL-GNN/PolicyOpes	Multilevel-part.	Empirical, learning	Outlier-prone, rapid inference	(Ghanta et al., 26 Oct 2025)

All frameworks above leverage only neighbor-to-neighbor communication and provide substantial state-of-the-art advances over centralized and naive distributed baselines in accuracy, scalability, and computational and communication efficiency.