Infimal Convolution Cost in Convex Analysis

Updated 21 December 2025

Infimal convolution cost is a mathematical framework in convex analysis that blends multiple cost functions or regularizers through variational minimization.
It is applied in imaging, optimal transport, and learning to decompose energies and model multiple noise types or structural priors.
The method preserves convexity and duality, facilitating efficient decomposition and the recovery of structured solutions in complex inverse problems.

The infimal convolution cost is a construct originating in convex analysis that allows the blending of multiple cost or regularization structures via a variational minimization. This cost structure appears under diverse guises across mathematical optimization, partial differential equations, image processing, statistical mechanics, optimal transport, and learning theory. Formally, the infimal convolution of two (extended-real) functions $f,g : X \to (-\infty,\infty]$ on a vector space $X$ is the function $f \Box g: X \to (-\infty,\infty]$ defined by

$(f\Box g)(x) := \inf_{y\in X} \Big\{ f(y) + g(x-y) \Big\}.$

This operation is used to construct composite fidelity terms in variational models, dualizes to sums in convex conjugacy, and defines metric and probabilistic costs in unbalanced optimal transport. Its use as "cost" is particularly crucial when the underlying mathematical problem, such as regularization, transport, or duality, requires the joint modeling or interpolation of multiple, potentially antagonistic, effects.

1. Definition and General Construction

Infimal convolution cost arises by minimizing over all possible splittings of a variable into two (or more) components, penalizing each component by an associated cost, and aggregating through infimum. For $f,g: X \to (-\infty,+\infty]$ , the prototypical definition is

$(f\Box g)(x) = \inf_{y\in X} \left\{ f(y) + g(x-y) \right\}.$

In the context of optimal transport and metric geometry, the corresponding construction for (pseudo-)distances $d_1,d_2$ on $U$ is

$(d_1 \diamond d_2)(u_0,u_1) = \inf_{\lambda\in U} \left\{ d_1(u_0,\lambda) + d_2(\lambda,u_1) \right\},$

which is the "one-step" or "metric infimal convolution." Higher-step variants (iterated infimal convolutions) can be defined for $N$ -step paths as

$\inf_{\substack{u_1,\ldots,u_{N-1}\in U \ u_0,\ldots,u_N \text{ path}}} \sum_{i=1}^N \left[ d_1(u_{i-1},v_i) + d_2(v_i, u_i) \right].$

The infimal convolution preserves convexity and, under suitable conditions, regularity and coercivity. A fundamental property is its duality: for proper convex $f,g$ , $(f\Box g)^* = f^* + g^*$ , where $f^*(\xi) = \sup_x \{ \langle \xi, x \rangle - f(x) \}$ denotes the Fenchel conjugate (Mahmudov, 2019).

2. Applications in Variational Regularization and Imaging

Infimal convolution costs have become central in advanced regularization schemes for inverse problems, image denoising, and signal processing. In such problems, the energy to minimize typically splits into a data fidelity (to the observed data $f$ ) and one or several regularization functionals enforcing desired smoothness or sparsity. The infimal convolution allows formulating composite regularizers that interpolate between different structural priors.

A major example is the TVL $^p$ family (Burger et al., 2015, Burger et al., 2015):

Model	Regularizer Formulation
TV–L $^p$ Infimal Convolution	$\operatorname{TVL}^{(p)}_{\alpha,\beta}(u) = \inf_{w\in L^p} \left\{ \alpha \\|Du-w\\|_{\mathcal{M}} + \beta \\|w\\|_{L^p} \right\}$
TVL $^\infty$	$\operatorname{TVL}^{\infty}_{\alpha,\beta}(u) = \inf_{w\in L^\infty} \left\{ \alpha \\|Du-w\\|_{\mathcal{M}} + \beta \\|w\\|_{L^\infty} \right\}$

The infimal convolution splits the (distributional) derivative $Du$ into a possibly unbounded and a bounded part, regularizing each, and leads to models that interpolate between standard total variation (TV), Huber-type TV, and total generalized variation (TGV). This approach achieves elimination of artifacts such as staircasing and better preservation of complicated geometric structures (Burger et al., 2015, Burger et al., 2015, Gao et al., 2017).

In imaging denoising for mixed noise, the data discrepancy itself is constructed as an infimal convolution to capture, e.g., both salt-and-pepper and Gaussian noise:

$\Phi^{\lambda_1,\lambda_2}(u,f) = \inf_v \left\{ \lambda_1 \Phi_1(v) + \lambda_2 \Phi_2(u, f-v) \right\},$

where $\Phi_1$ and $\Phi_2$ penalize, respectively, the different noise types (Calatroni et al., 2016).

3. Metric Structures and Optimal Transport

The infimal convolution cost plays a key role in the modern theory of unbalanced optimal transport. The celebrated result is the metric infimal convolution decomposition of the Hellinger–Kantorovich (HK) distance on nonnegative measures (Ponti et al., 17 Mar 2025). Given Hellinger (Fisher–Rao) and Wasserstein distances $\operatorname{He}_2, \operatorname{W}_2$ on measures, the squared HK distance is given by

$\operatorname{HK}_2^2(\mu_0, \mu_1) = \inf_{\nu} \left\{ \operatorname{He}_2^2(\mu_0, \nu) + \operatorname{W}_2^2(\nu, \mu_1) \right\},$

with suitable extensions to multi-step (iterated) convolutions and dual forms. This structure is exact in the sense that minimizing paths for the HK geodesic flow correspond to sequences of Hellinger and Wasserstein updates (Ponti et al., 17 Mar 2025).

In multi-marginal optimal transport, the infimal-convolution cost

$c_{\inf}(x_1,\ldots,x_N) = \inf_{z\in \mathbb{R}^d} \sum_{i=1}^N |x_i - z|^p$

serves as the cost function in both static and dynamical formulations. The associated Benamou–Brenier dynamical version consists of $N$ coupled continuity equations whose common initial measure is the Wasserstein barycenter. The equivalence between the static infimal convolution MMOT and the dynamical barycenter–flow formulation is now established (Krannich, 14 Dec 2025).

4. Duality, Conjugation, and the Infimal Convolution Inequality

Infimal convolution naturally arises in convex duality, particularly the Fenchel–Rockafellar theorem and subdifferential calculus. For proper convex $f,g$ ,

$(f \Box g)^*(x^*) = f^*(x^*) + g^*(x^*)$

is independent of further interiority constraints (Mahmudov, 2019).

In probability theory, the convex infimal-convolution inequality (ICI) provides concentration and moment inequalities for random vectors $X$ via

$\mathbb{E}\exp(f \Box \varphi (X)) \cdot \mathbb{E}\exp(-f(X)) \le 1,$

where $\varphi$ is an optimal cost function, typically a dilation of the Legendre transform of the log-Laplace $\Lambda_X^*$ (Strzelecka et al., 2017).

In analysis on metric graphs, the infimal convolution operator generates semigroups satisfying discrete analogues of Hopf–Lax evolution and Hamilton–Jacobi equations, and thereby establishes equivalences between hypercontractivity, (modified) log–Sobolev inequalities, and transport–entropy inequalities (Shu, 2015).

5. Statistical Inference and Learning

Infimal convolution costs have been leveraged in statistical estimation for robust regression losses and penalization. In functional output regression, the infimal-convolution of the squared norm and a convex function generates Huber-type and $\varepsilon$ -insensitive losses:

$H^p_\kappa := \tfrac{1}{2} \|\cdot\|_Y^2 \infconv \kappa \|\cdot\|_p,$

with closed-form expressions involving $L^p$ -norm projections and yielding losses that combine quadratic and linear penalties for outlier tolerance (Lambert et al., 2022). The dual forms become tractable, and the approach provides a spectrum from pure quadratic to robust or sparsity-promoting depending on the infimal convolution's secondary component.

In tropical and max-product inference for graphical models, the (min,+) infimal (max-)convolution for sequences

$(f \Boxdot g)(z) = \max_{x+y=z} f(x) + g(y)$

enables fast algorithms for MAP inference in convolution tree structures and specialized hidden Markov models, critically reducing computational cost via $p$ -norm approximations and FFT-based convolution (Serang, 2015).

6. Abstract, Infinite, and High-Dimensional Generalizations

The abstract form of the infimal convolution extends to infinite-dimensional or parametrized families of one-homogeneous convex functionals, leading to "infinite infimal convolution" regularizers:

$J(u) := \inf \left\{ \int \varphi_t(u_t) d\mu(t) : \int u_t d\mu(t) = u \right\},$

with a sparse representer theorem derived for finite-dimensional measurements and generalized conditional gradient (off-the-grid) optimization schemes. These extensions admit regularizers capable of adapting smoothness and anisotropy, with provable well-posedness, coercivity, and convergence guarantees (Bredies et al., 2023).

7. Significant Theoretical and Practical Properties

Key theoretical properties of the infimal convolution cost include:

Preservation of convexity: $\Box$ of convex lsc functionals remains convex lsc.
Coercivity: If the original functionals are coercive, so is their infimal convolution for mixtures with finite-dimensional nullspaces (Gao et al., 2017, Burger et al., 2015).
Decomposability and structure recovery: The cost enables splitting the solution into interpretable components, e.g., noise types in imaging (Calatroni et al., 2016), or texture/frequency directions in oscillatory TGV (Gao et al., 2017).
Duality and exactness: The critical role in strong duality results and the construction of explicit dual problems in convex control, regularization, and path-space problems (Mahmudov, 2019, Shu, 2015).

In summary, infimal convolution costs provide a universal, rigorous approach to synthesizing complex energies across convex analysis, transport theory, statistical estimation, and image regularization, with extensive structural, dual, computational, and interpretability benefits.

Markdown Upgrade to Chat

References (12)

Infimal Convolution and Duality in Convex Optimal Control Problems with Second Order Evolution Differential Inclusions (2019)

Infimal convolution regularisation functionals of BV and $\mathrm{L}^{p}$ spaces. Part I: The finite $p$ case (2015)

Infimal Convolution Regularisation Functionals of BV and $\mathrm{L}^{p}$ Spaces. The Case p$=\infty$ (2015)

Infimal convolution of oscillation total generalized variation for the recovery of images with structured texture (2017)

Infimal convolution of data discrepancies for mixed noise removal (2016)

The infimal convolution structure of the Hellinger-Kantorovich distance (2025)

A Benamou-Brenier formulation for the multi-marginal optimal transport problem with infimal convolution cost (2025)

On the convex infimum convolution inequality with optimal cost function (2017)

Hamilton-Jacobi equations on graph and applications (2015)

10.

Functional Output Regression with Infimal Convolution: Exploring the Huber and $ε$-insensitive Losses (2022)

11.

A fast numerical method for max-convolution and the application to efficient max-product inference in Bayesian networks (2015)

12.

A sparse optimization approach to infinite infimal convolution regularization (2023)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Infimal Convolution Cost.

Infimal Convolution Cost in Convex Analysis

1. Definition and General Construction

2. Applications in Variational Regularization and Imaging

3. Metric Structures and Optimal Transport

4. Duality, Conjugation, and the Infimal Convolution Inequality

5. Statistical Inference and Learning

6. Abstract, Infinite, and High-Dimensional Generalizations

7. Significant Theoretical and Practical Properties

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research

Infimal Convolution Cost in Convex Analysis

1. Definition and General Construction

2. Applications in Variational Regularization and Imaging

3. Metric Structures and Optimal Transport

4. Duality, Conjugation, and the Infimal Convolution Inequality

5. Statistical Inference and Learning

6. Abstract, Infinite, and High-Dimensional Generalizations

7. Significant Theoretical and Practical Properties

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Related Topics

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research