Selective Partial Optimization (SPO)

Updated 20 January 2026

Selective Partial Optimization (SPO) is a block-decomposition method that updates only significant variable blocks to reduce computational cost while maintaining strong convergence.
It employs adaptive partial linearization and selective Gauss–Newton techniques to tackle convex composite optimization and incremental estimations like SLAM.
SPO dynamically adjusts active sets based on optimality gaps and measurement changes, achieving substantial reductions in cumulative computational operations.

Selective Partial Optimization (SPO) is a family of block-decomposition algorithms for large-scale separable optimization, aimed at reducing per-iteration computational cost while maintaining strong convergence guarantees. The essential principle is to update and relinearize only “significant” or “active” variable blocks at each iteration—those that exhibit substantial violation of optimality or that are most affected by new information. Applications span convex composite optimization, sparse learning, and incremental nonlinear estimation such as SLAM, where SPO enables scalable and accurate solutions by adaptively focusing computational effort.

1. Problem Settings and Mathematical Formulation

SPO addresses structured composite minimization and nonlinear least-squares problems over product spaces. Consider the canonical composite optimization problem as formulated in (Konnov, 2016): Let $X = X_1 \times \cdots \times X_n$ , with each $X_i \subset \mathbb{R}^{n_i}$ nonempty, closed, and convex (often compact for simplicity). The objective is: $\min_{x \in X} \; F(x) = f(x) + \sum_{i=1}^n h_i(x_i)$ where

$f : \mathbb{R}^N \rightarrow \mathbb{R}$ is continuously differentiable (not necessarily convex),
$h_i : \mathbb{R}^{n_i} \rightarrow \mathbb{R} \cup\{+\infty\}$ are proper, convex, lower-semicontinuous, possibly nonsmooth, separable terms.

Block gradients $g(x) = \nabla f(x) = (g_1(x),\dots,g_n(x))$ are defined, with block-wise optimality gaps

$p_i(x) = \max_{y_i \in X_i} \left\{ \langle g_i(x), x_i - y_i \rangle + h_i(x_i) - h_i(y_i) \right\}$

A point $x^*$ is block-stationary if $p_i(x^*) = 0$ for all $i$ , equivalently satisfying the mixed variational inequality condition.

For incremental nonlinear optimization such as SLAM, the optimization is written as

$\min_{\mathbf{x}\in \mathbb{R}^N} \; c(\mathbf{x}) = \frac{1}{2}\sum_{j=1}^M \|\mathbf{m}_j - \mathbf{f}_j(\mathbf{x}_{\mathcal{V}_j})\|_{\boldsymbol\Sigma_j}^2$

with state vector $\mathbf{x}$ , measurement models $\mathbf{f}_j$ , and measurement covariance matrices $\boldsymbol\Sigma_j$ (Arablouei, 13 Jan 2026).

2. Core Algorithms and Selective Update Mechanisms

The defining characteristic of SPO is the adaptive restriction of update and linearization steps to a subset of relevant variables.

Adaptive Partial Linearization (Konnov, 2016):

At each Basic Cycle iteration, compute block gaps $p_i(x)$ and select any block $s$ with $p_s(x) \geq \delta$ (for a prescribed tolerance $\delta$ ).
Solve the block partial minimization linearized subproblem for block $s$ ; other blocks remain fixed.
Perform an inexact Armijo-type line search on this subspace direction.
The outer loop decreases the tolerance $\delta_{\ell} \to 0$ geometrically, advancing stages only when all $p_i(x) < \delta_{\ell}$ .

Selective Partial Gauss–Newton (SPO for SLAM, (Arablouei, 13 Jan 2026)):

Partition variables into an active set $\mathcal{S}$ (to be updated) and a static set $\mathcal{U}$ (held fixed).
At each GN iteration, solve the block-reduced normal equations only on $\mathcal{S}$ .
After each solve, prune $\mathcal{S}$ by removing variables with small updates ( $|d_i| \leq \tau_d$ ); expand $\mathcal{S}$ by including neighbors directly impacted by measurement changes.
Relinearize only those measurements (edges) incident to the current $\mathcal{S}$ .
Terminate the GN loop when $\mathcal{S} = \emptyset$ .

3. Theoretical Properties and Convergence Guarantees

The mathematical foundation of SPO relies on block-wise variational inequalities, classical block-coordinate descent principles, and convergence results for partial linearization.

Global Convergence (Konnov, 2016):

For product domain problems under standard regularity assumptions (convex-compactness of $X_i$ , gradient uniform continuity, convexity/l.s.c. of $h_i$ ), the sequence of stage points converges to a block-stationary point. With convexity, this limit is globally optimal.

Finite Basic-Cycle Termination:

In every outer stage (fixed $\delta$ ), only a finite number of block updates are needed before all block-gaps fall below tolerance.

Rate and Complexity:

For smooth convex $f$ with block-Lipschitz gradients, the number of block updates $V(\epsilon)$ to reach $F(x) - F^* \leq \epsilon$ satisfies

$V(\epsilon) \leq C \cdot \frac{\Delta/\epsilon - 1}{1-v}$

with $v \in (0,1)$ the geometric decay rate for tolerances (Konnov, 2016).

Nonlinear Least-Squares Context (Arablouei, 13 Jan 2026):

Under a Lipschitz GN Hessian and positive-definite blocks, selective partial GN converges to the same stationary point as full GN, with local linear convergence and superlinear rates if the convergence threshold is driven to zero and residuals vanish at the solution.

4. Computational Cost and Efficiency Analysis

SPO algorithms significantly reduce per-iteration cost by confining expensive updates to a dynamically chosen subset of variables.

Step	Full CG / Frank–Wolfe	Block-Coord. CG	SPO
Blocks updated/iter.	All	Cyclic/random single	Significant only
Block-gradients required	$O(n)$	1	1 (for selected)
Gap reduction	Fast	Slower, uncontrolled	Adaptive, focused

Per-iteration, SPO computes only one block-gradient and solves a low-dimensional subproblem. This enables reusing precomputed data and leveraging parallel structures, with the total block-update number scaling sublinearly with solution accuracy.

Let $\kappa_{t,i}$ be the number of nonzeros in column $i$ of the Cholesky factor:

Static-block Cholesky up/down-date: $\min(2 \sum_{i\in\mathcal{S}_t}\kappa_{t,i}^2, \sum_{i=1}^N\kappa_{t,i}^2)$
Partial solve: $2\sum_{i\in\mathcal{S}_t}\kappa_{t,i}$

If $|\mathcal{S}_t| \ll N$ , the computational load is much less than full GN. Experiments demonstrate a 2–8× reduction in cumulative FLOPs at equivalent estimation accuracy (Arablouei, 13 Jan 2026).

5. Integration with Information-Guided Gating and Hybrid Schemes

In data-rich, incremental settings such as SLAM, SPO is often combined with information-theoretic gating mechanisms (IGG):

The log-determinant of the information matrix ( $\ln\det(\mathbf{J}_t^\mathsf{T}\mathbf{J}_t)$ ) quantifies the information gain from new measurements.
A detrended threshold $\Delta\eta_t$ determines whether to trigger a full global GN or a selective (local) GN update on a small $\mathcal{S}_t$ .
SPO then adaptively prunes and expands $\mathcal{S}_t$ across GN iterations, focusing effort where residual change is highest.
This approach preserves global consistency in the estimate graph and maintains convergence and accuracy guarantees while minimizing redundant computation (Arablouei, 13 Jan 2026).

6. Applications and Numerical Results

Convex Optimization (Konnov, 2016)

Quadratic objectives with up to $N = 1500$ : Adaptive SPO required 30–50% as many block-gradient calls versus full CGM for comparable accuracy.
Composite convex objectives, e.g., $F(x) = f_1(x) + \sum c_i/(c_i^Tx + T)$ : SPO matched or exceeded the efficiency of block-coordinate methods.
Practical use-cases: Group-LASSO, block-structured regression, network equilibrium with elastic demands (path-flow decomposition), and resource allocation problems.

Incremental SLAM (Arablouei, 13 Jan 2026)

On standard 2D pose-graph datasets, SPO with IGG offers cumulative solve-FLOPs reduced by factors of 2–8 versus full incremental GN, with virtually identical trajectory error and normalized $\chi^2$ .
Prune/expand rules ensure that only variables with significant GN increments and their direct neighbors are updated and relinearized, exploiting locality in graph structure.
Adjustable thresholds ( $\tau_d, \tau_\eta$ ) allow trade-off between computation and estimation accuracy.

7. Relation to Other Methods and Variants

SPO generalizes and strictly improves upon several existing block-decomposition and coordinate-update techniques:

Full Conditional Gradient (Frank–Wolfe): Each step involves all blocks and full gradients—a computational bottleneck for large-scale problems.
Coordinate or Block-Coordinate CG: Updates a single block randomly or in a fixed sequence without adaptivity to optimality gap sizes, lacking tolerance control and often requiring more iterations for convergence.
SPO: Uniquely focuses on blocks with meaningful optimality (or residual) violations, dynamically adjusts the active set, and enables line-search-free operation when block-Lipschitz constants are available. It subsumes block-coordinate approaches as special cases, with significantly better empirical performance on large-scale, sparse, or graph-structured problems.

In summary, Selective Partial Optimization is a theoretically grounded, computationally efficient strategy for solving large separable optimization and estimation problems by focusing resources on the most critical blocks or variables at each iteration, enabling scalability and accuracy across multiple domains (Konnov, 2016, Arablouei, 13 Jan 2026).

Markdown Upgrade to Chat

References (2)

An Adaptive Partial Linearization Method for Optimization Problems on Product Sets (2016)

Efficient Incremental SLAM via Information-Guided and Selective Optimization (2026)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Selective Partial Optimization (SPO).

Selective Partial Optimization (SPO)

1. Problem Settings and Mathematical Formulation

2. Core Algorithms and Selective Update Mechanisms

3. Theoretical Properties and Convergence Guarantees

4. Computational Cost and Efficiency Analysis

Block-Structured Partial Linearization (Konnov, 2016)

SLAM and Sparse GN Systems (Arablouei, 13 Jan 2026)

5. Integration with Information-Guided Gating and Hybrid Schemes

6. Applications and Numerical Results

7. Relation to Other Methods and Variants

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research

Selective Partial Optimization (SPO)

1. Problem Settings and Mathematical Formulation

2. Core Algorithms and Selective Update Mechanisms

3. Theoretical Properties and Convergence Guarantees

4. Computational Cost and Efficiency Analysis

Block-Structured Partial Linearization (Konnov, 2016)

SLAM and Sparse GN Systems (Arablouei, 13 Jan 2026)

5. Integration with Information-Guided Gating and Hybrid Schemes

6. Applications and Numerical Results

7. Relation to Other Methods and Variants

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Related Topics

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research