Papers
Topics
Authors
Recent
Search
2000 character limit reached

Lossless Convexification

Updated 27 April 2026
  • Lossless Convexification is a framework that reformulates non-convex problems into convex ones via variable transformations and convex hull constructions.
  • It guarantees global or near-global optimality by ensuring conditions like surjectivity, Slater’s condition, and normality are satisfied.
  • Its applications span semidefinite programming, mixed-integer programming, and learning with discrete losses, enabling robust and scalable optimization.

Lossless convexification is a mathematical framework by which certain non-convex optimization or control problems are transformed, via change of variables or convex hull constructions, into convex programs whose optimal solutions coincide—under precise structural conditions—with those of the original problem. This equivalence can be global (no loss) or nearly global (with bounded violations determined by system dimensions or grid resolution). The framework appears in semidefinite programming, optimal control, mixed-integer quadratic and conic optimization, learning with discrete structures, and combinatorial surrogate loss design.

1. Structural Definition and Core Conditions

The canonical definition involves replacing a non-convex feasible set or cost by a “convexification” such that (a) all original feasible points map to the convex set, and (b) every feasible point of the convexified problem has a preimage in the non-convex set (“surjectivity condition”).

For an optimization problem

minxRn  f(x)s.t.Φi(x)0,i=1,...,N,\min_{x \in \mathbb{R}^n} \; f(x) \quad \text{s.t.} \quad \Phi_i(x) \preceq 0, \quad i=1, ..., N,

where ff and the matrix-valued constraints are non-convex, a lossless convexification exists if there is a variable change v=h(x)v = h(x) and convex f,Φif', \Phi_i' such that

f(x)=f(h(x)),Φi(x)=Φi(h(x)),f(x) = f'(h(x)), \quad \Phi_i(x) = \Phi_i'(h(x)),

and every feasible vv in the convexified problem is v=h(x)v = h(x) for some feasible xx (Lee, 2021). This ensures both problems share the same optimal value and, possibly modulo preimage multiplicity, optimal points.

2. Semidefinite Programming and Strong Duality

In semidefinite programming, Lee (Lee, 2021) rigorously establishes that lossless convexification preserves strong duality for non-convex SDPs under Slater’s condition. The original non-convex program

minxRnf(x)s.t.Φi(x)0\min_{x \in \mathbb{R}^n} f(x) \quad \text{s.t.} \quad \Phi_i(x) \preceq 0

may be non-convex in ff and/or in the constraint maps. After convex change-of-variable, the convexified SDP enjoys zero duality gap provided the surjectivity condition holds and there exists a strictly feasible ff0 (all ff1). This enables full dual KKT characterizations and often Riccati-equation representations in control (Lee, 2021).

3. Lossless Convexification in Optimal Control and Mixed-Integer Programming

Many non-convex and mixed-integer optimal control problems admit lossless convexifications via convex epigraphical relaxations in continuous time, and under certain dimensionality constraints, in discretized time.

3.1 Semi-continuous Inputs and SOCP

For actuator selection with inputs constrained to be either ff2 (inactive) or norm-bounded (ff3), the convex relaxation replaces integer logic variables ff4 by ff5 and introduces slack variables ff6 with second-order cone (SOC) constraints (Malyuta et al., 2019, Malyuta et al., 2019). If the system satisfies observability, normality, and transversality (“no abnormal multiplier”) conditions, then the SOCP solution recovers bang-bang controls ff7 and norm constraints at solution, yielding global optimality (Malyuta et al., 2019, Malyuta et al., 2019).

3.2 Discrete-Time and DLCvx

In discrete-time, lossless convexification is more delicate: exactness may hold at all grid points only up to ff8 violations, where ff9 is the state-space dimension. Perturbation analysis and normality characterization show that, after an arbitrarily small generic perturbation to the recursion matrix, at most v=h(x)v = h(x)0 grid points may fail to satisfy the original non-convex constraint (e.g., control norm lower bound) (Luo et al., 2024, Kiami, 2024). For first-order-hold parameterizations, the bound increases to v=h(x)v = h(x)1 vertices and v=h(x)v = h(x)2 edges. For “long-horizon” problems, a value-function bisection restores near-lossless properties (Luo et al., 2024, Kiami, 2024).

For discrete-time pointing and mixed-integer sector constraints, a dual multiplier–based analysis bounds possible slack violations, and the convex relaxation remains globally tight modulo a small, quantifiable number of grid anomalies (Luo et al., 12 Jan 2025).

3.3 Discrete-Valued and Fuel-Optimal Control

In fuel-optimal control with discrete actuator profiles, lossless convexification by convexification to a (cross-)polytopic relaxation (e.g., v=h(x)v = h(x)3-ball) preserves the extremal/bang-bang property provided a unique Hamiltonian minimization holds (adjoint vector normality). All convexified solutions automatically saturate at the true (discrete) control values (Arenas-Uribe et al., 11 Nov 2025).

4. Lossless Convexification in Quadratic and Perspective Reformulation

The lossless convexification principle underlies the “perspective reformulation” for epigraphs of mixed-integer quadratic or convex programs involving step or semicontinuous penalties. The convex hull of a set v=h(x)v = h(x)4 is

v=h(x)v = h(x)5

where v=h(x)v = h(x)6 is convex ("perspective function") (Lee et al., 2020). This convex envelope strictly dominates product and “big-M” relaxations and is SOCP-representable for v=h(x)v = h(x)7 (Lee et al., 2020).

For quadratic programs with finite-valued step-function penalties (e.g., SVM with v=h(x)v = h(x)8-v=h(x)v = h(x)9 loss), rank-one convexification yields ideal SDP or copositive relaxations, recovers all integral optima, and is robust to label noise (Choi et al., 23 Apr 2025).

5. Convexification of Discrete Losses and Learning from Constraints

In empirical risk minimization with non-modular, discrete losses (e.g., Dice or multi-label losses), lossless convexification is achieved via unique submodular-supermodular decompositions and tight convex surrogates (e.g., Lovász-hinge for submodular part, slack-rescaling for supermodular part) (Yu et al., 2016). The resulting convex surrogate is piecewise-linear, matches the discrete loss at the “vertices” (i.e., hard labels), and is polynomial-time subgradient computable—lossless convexification in the sense of extension to the original discrete problem (Yu et al., 2016).

In learning from constraints with MIP structure and complex loss/regularization, the tightest convex extension (Legendre-Fenchel biconjugate) is lossless on the vertices; efficient per-instance decompositions permit scalable computation and preserve exact minima with fractional relaxations (Shcherbatyi et al., 2016).

6. Functionals with Structural Penalties: Convex Envelope

For objectives of the form f,Φif', \Phi_i'0, the lower semi-continuous convex envelope can be computed losslessly via the f,Φif', \Phi_i'1-transform when f,Φif', \Phi_i'2 is identity (or under spectral majorization/minorization otherwise) (Carlsson, 2016). The condition f,Φif', \Phi_i'3 certifies global optimality and lossless recovery of structured minimizers (sparse, low rank) with no relaxation gap. This construction generalizes f,Φif', \Phi_i'4 or nuclear norm relaxations and bridges to the convex hull of step-function and rank constraints.

7. Unified Perspective and Applications

Lossless convexification is a structurally sensitive paradigm: its global optimality hinges on observability/normality/Slater-type conditions, surjectivity of variable change, or combinatorial geometric properties. It enables polynomial-time, robust global optimization in settings previously considered intractable, including mixed-integer optimal control, quadratic programming with indicator variables, and learning under hard combinatorial constraints.

Key properties:

Domain Structural Condition Nature of Guarantee
Non-convex SDP Surjective change of variables, Slater Zero duality gap, KKT necessary/sufficient
Mixed-integer OC Observability/normality, SOC relaxation Global optima almost everywhere
Perspective/MINLO Convex perspective, epigraph tightness Convex hull, exact SOCP
Discrete surrogate Extension property, unique decomposition Lossless surrogate, polynomial-time computation
General structured f,Φif', \Phi_i'5 f,Φif', \Phi_i'6-transform, spectrum comparison Envelope losslessness at minimizers

The approach unifies and subsumes classical relaxations, offering sharp a priori or a posteriori certificates of losslessness, and informs the design of scalable algorithms for non-convex and mixed-integer problems across control, learning, and combinatorial optimization (Lee, 2021, Yu et al., 2016, Malyuta et al., 2019, Choi et al., 23 Apr 2025, Luo et al., 2024, Arenas-Uribe et al., 11 Nov 2025, Carlsson, 2016).

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Lossless Convexification.