Lossless Convexification
- Lossless Convexification is a framework that reformulates non-convex problems into convex ones via variable transformations and convex hull constructions.
- It guarantees global or near-global optimality by ensuring conditions like surjectivity, Slater’s condition, and normality are satisfied.
- Its applications span semidefinite programming, mixed-integer programming, and learning with discrete losses, enabling robust and scalable optimization.
Lossless convexification is a mathematical framework by which certain non-convex optimization or control problems are transformed, via change of variables or convex hull constructions, into convex programs whose optimal solutions coincide—under precise structural conditions—with those of the original problem. This equivalence can be global (no loss) or nearly global (with bounded violations determined by system dimensions or grid resolution). The framework appears in semidefinite programming, optimal control, mixed-integer quadratic and conic optimization, learning with discrete structures, and combinatorial surrogate loss design.
1. Structural Definition and Core Conditions
The canonical definition involves replacing a non-convex feasible set or cost by a “convexification” such that (a) all original feasible points map to the convex set, and (b) every feasible point of the convexified problem has a preimage in the non-convex set (“surjectivity condition”).
For an optimization problem
where and the matrix-valued constraints are non-convex, a lossless convexification exists if there is a variable change and convex such that
and every feasible in the convexified problem is for some feasible (Lee, 2021). This ensures both problems share the same optimal value and, possibly modulo preimage multiplicity, optimal points.
2. Semidefinite Programming and Strong Duality
In semidefinite programming, Lee (Lee, 2021) rigorously establishes that lossless convexification preserves strong duality for non-convex SDPs under Slater’s condition. The original non-convex program
may be non-convex in and/or in the constraint maps. After convex change-of-variable, the convexified SDP enjoys zero duality gap provided the surjectivity condition holds and there exists a strictly feasible 0 (all 1). This enables full dual KKT characterizations and often Riccati-equation representations in control (Lee, 2021).
3. Lossless Convexification in Optimal Control and Mixed-Integer Programming
Many non-convex and mixed-integer optimal control problems admit lossless convexifications via convex epigraphical relaxations in continuous time, and under certain dimensionality constraints, in discretized time.
3.1 Semi-continuous Inputs and SOCP
For actuator selection with inputs constrained to be either 2 (inactive) or norm-bounded (3), the convex relaxation replaces integer logic variables 4 by 5 and introduces slack variables 6 with second-order cone (SOC) constraints (Malyuta et al., 2019, Malyuta et al., 2019). If the system satisfies observability, normality, and transversality (“no abnormal multiplier”) conditions, then the SOCP solution recovers bang-bang controls 7 and norm constraints at solution, yielding global optimality (Malyuta et al., 2019, Malyuta et al., 2019).
3.2 Discrete-Time and DLCvx
In discrete-time, lossless convexification is more delicate: exactness may hold at all grid points only up to 8 violations, where 9 is the state-space dimension. Perturbation analysis and normality characterization show that, after an arbitrarily small generic perturbation to the recursion matrix, at most 0 grid points may fail to satisfy the original non-convex constraint (e.g., control norm lower bound) (Luo et al., 2024, Kiami, 2024). For first-order-hold parameterizations, the bound increases to 1 vertices and 2 edges. For “long-horizon” problems, a value-function bisection restores near-lossless properties (Luo et al., 2024, Kiami, 2024).
For discrete-time pointing and mixed-integer sector constraints, a dual multiplier–based analysis bounds possible slack violations, and the convex relaxation remains globally tight modulo a small, quantifiable number of grid anomalies (Luo et al., 12 Jan 2025).
3.3 Discrete-Valued and Fuel-Optimal Control
In fuel-optimal control with discrete actuator profiles, lossless convexification by convexification to a (cross-)polytopic relaxation (e.g., 3-ball) preserves the extremal/bang-bang property provided a unique Hamiltonian minimization holds (adjoint vector normality). All convexified solutions automatically saturate at the true (discrete) control values (Arenas-Uribe et al., 11 Nov 2025).
4. Lossless Convexification in Quadratic and Perspective Reformulation
The lossless convexification principle underlies the “perspective reformulation” for epigraphs of mixed-integer quadratic or convex programs involving step or semicontinuous penalties. The convex hull of a set 4 is
5
where 6 is convex ("perspective function") (Lee et al., 2020). This convex envelope strictly dominates product and “big-M” relaxations and is SOCP-representable for 7 (Lee et al., 2020).
For quadratic programs with finite-valued step-function penalties (e.g., SVM with 8-9 loss), rank-one convexification yields ideal SDP or copositive relaxations, recovers all integral optima, and is robust to label noise (Choi et al., 23 Apr 2025).
5. Convexification of Discrete Losses and Learning from Constraints
In empirical risk minimization with non-modular, discrete losses (e.g., Dice or multi-label losses), lossless convexification is achieved via unique submodular-supermodular decompositions and tight convex surrogates (e.g., Lovász-hinge for submodular part, slack-rescaling for supermodular part) (Yu et al., 2016). The resulting convex surrogate is piecewise-linear, matches the discrete loss at the “vertices” (i.e., hard labels), and is polynomial-time subgradient computable—lossless convexification in the sense of extension to the original discrete problem (Yu et al., 2016).
In learning from constraints with MIP structure and complex loss/regularization, the tightest convex extension (Legendre-Fenchel biconjugate) is lossless on the vertices; efficient per-instance decompositions permit scalable computation and preserve exact minima with fractional relaxations (Shcherbatyi et al., 2016).
6. Functionals with Structural Penalties: Convex Envelope
For objectives of the form 0, the lower semi-continuous convex envelope can be computed losslessly via the 1-transform when 2 is identity (or under spectral majorization/minorization otherwise) (Carlsson, 2016). The condition 3 certifies global optimality and lossless recovery of structured minimizers (sparse, low rank) with no relaxation gap. This construction generalizes 4 or nuclear norm relaxations and bridges to the convex hull of step-function and rank constraints.
7. Unified Perspective and Applications
Lossless convexification is a structurally sensitive paradigm: its global optimality hinges on observability/normality/Slater-type conditions, surjectivity of variable change, or combinatorial geometric properties. It enables polynomial-time, robust global optimization in settings previously considered intractable, including mixed-integer optimal control, quadratic programming with indicator variables, and learning under hard combinatorial constraints.
Key properties:
| Domain | Structural Condition | Nature of Guarantee |
|---|---|---|
| Non-convex SDP | Surjective change of variables, Slater | Zero duality gap, KKT necessary/sufficient |
| Mixed-integer OC | Observability/normality, SOC relaxation | Global optima almost everywhere |
| Perspective/MINLO | Convex perspective, epigraph tightness | Convex hull, exact SOCP |
| Discrete surrogate | Extension property, unique decomposition | Lossless surrogate, polynomial-time computation |
| General structured 5 | 6-transform, spectrum comparison | Envelope losslessness at minimizers |
The approach unifies and subsumes classical relaxations, offering sharp a priori or a posteriori certificates of losslessness, and informs the design of scalable algorithms for non-convex and mixed-integer problems across control, learning, and combinatorial optimization (Lee, 2021, Yu et al., 2016, Malyuta et al., 2019, Choi et al., 23 Apr 2025, Luo et al., 2024, Arenas-Uribe et al., 11 Nov 2025, Carlsson, 2016).