Papers
Topics
Authors
Recent
2000 character limit reached

Barycentric Optimal Transport

Updated 27 November 2025
  • Barycentric optimal transport is a weak OT formulation that uses cost functions based on the conditional mean (barycenter) of target variables.
  • Its dual formulation reveals a convex order structure, ensuring uniqueness via Brenier maps in quadratic settings.
  • Applications span constrained map estimation in machine learning to robust economic matching, with efficient mirror descent algorithms for discrete measures.

The barycentric optimal transport problem is a class of weak optimal transport (WOT) problems characterized by cost functions that depend on the source and the barycenter (conditional mean) of the target variables under a chosen coupling. This formulation encompasses classical quadratic cost but generalizes to convex costs and finds applications in constrained transport map estimation, weak matching, robust barycenters, and aggregation models in economics and machine learning.

1. Mathematical Formulation and Static Characterization

Let μP(X)\mu \in \mathcal P(\mathcal X) and νP(Y)\nu \in \mathcal P(\mathcal Y) be probability measures on compact, convex domains. For a coupling πΠ(μ,ν)\pi \in \Pi(\mu,\nu), with disintegration π(dx,dy)=μ(dx)π(dyx)\pi(dx,dy)=\mu(dx)\,\pi(dy|x), the barycentric map at xx is the conditional mean: b(π,x)=Yyπ(dyx)=Eyπ(x)[y].b(\pi,x) = \int_{\mathcal Y} y\,\pi(dy|x) = \mathbb E_{y\sim\pi(\cdot|x)}[y]. Given a cost c(x,z)c(x,z), convex in zz, the barycentric WOT problem is: WOTbary(μ,ν)=infπΠ(μ,ν)Xc(x,b(π,x))μ(dx).\mathsf{WOT}_\mathrm{bary}(\mu,\nu) = \inf_{\pi\in\Pi(\mu,\nu)} \int_{\mathcal X} c\bigl(x,\,b(\pi,x)\bigr)\,\mu(dx). The most studied special case uses quadratic cost c(x,z)=xz2c(x,z)=\|x-z\|^2, yielding the quadratic barycentric optimal transport problem (Gozlan et al., 5 Sep 2025, Guo et al., 26 Nov 2025, Paty et al., 2022).

2. Dual Formulation and Convex Order Structure

Under the condition that zc(x,z)z\mapsto c(x,z) is convex and lower semicontinuous, the dual can be written as (Paty et al., 2022): WOTbary(μ,ν)=infψConvLip(conv(Y)){XQc[ψ](x)μ(dx)+Yψ(y)ν(dy)}\mathsf{WOT}_\mathrm{bary}(\mu,\nu) = \inf_{\psi \in \mathrm{ConvLip}(\mathrm{conv}(\mathcal Y))} \left\{ \int_{\mathcal X} Q_c[\psi](x)\,\mu(dx) + \int_{\mathcal Y}\psi(y)\,\nu(dy) \right\} where Qc[ψ](x)=supz{c(x,z)ψ(z)}Q_c[\psi](x) = \sup_{z} \{c(x,z) - \psi(z)\} is the Hamiltonian (sup-transform).

In the quadratic case, a convex order projection emerges: the barycentric OT cost is equal to the minimal W22W_2^2 distance among all projections of μ\mu onto measures η\eta dominated by ν\nu in convex order (i.e., ηcν\eta \preceq_c \nu) (Guo et al., 26 Nov 2025, Gozlan et al., 5 Sep 2025): T2(μ,ν)=infηcνW22(μ,η).\overline T_2(\mu,\nu) = \inf_{\eta\preceq_c\nu} W_2^2(\mu,\eta). The optimizer ηˉ\bar\eta is unique and given as the pushforward of μ\mu under a 1-Lipschitz convex function (Brenier map).

3. Dynamic (Benamou–Brenier) and Martingale Analogues

The barycentric OT problem admits a dynamic formulation analogous to the Benamou–Brenier formula for classical OT, but only penalizing the drift component (Guo et al., 26 Nov 2025): T2(μ,ν)=inf(Xt)E[01vt2dt],\overline T_2(\mu,\nu) = \inf_{(X_t)} \mathbb E\left[ \int_0^1 |v_t|^2dt \right], where processes XtX_t satisfy dXt=vtdt+σtdBtdX_t = v_tdt + \sigma_t dB_t, X0μX_0 \sim \mu, X1νX_1 \sim \nu, and σt\sigma_t is arbitrary (martingale part is cost-free). This focuses transport cost on movement of conditional expectations, with extra noise/martingales unconstrained.

An interpolation between barycentric and martingale cost yields the α\alphaβ\beta functional (Guo et al., 26 Nov 2025): infπΠ(μ,ν)[αxE[YX=x]2βMCov(πx,γ1d)]μ(dx)\inf_{\pi\in\Pi(\mu,\nu)} \int [\alpha|x-\mathbb E[Y|X=x]|^2 - \beta\,\mathsf{MCov}(\pi_x, \gamma^d_1)]\,\mu(dx) where MCov(ρ,σ)\mathsf{MCov}(\rho,\sigma) is the maximal covariance between measures ρ\rho and σ\sigma, and γ1d\gamma^d_1 is standard Gaussian.

In one dimension and for Gaussians, explicit formulae in terms of convex envelopes or linear SDEs are available (Gozlan et al., 5 Sep 2025, Guo et al., 26 Nov 2025).

4. Numerical Algorithms: Mirror Descent and Barycentric Projections

For discrete or sampled measures, first-order primal and dual mirror descent algorithms with KL (entropic) geometry are efficient in high dimensions (Paty et al., 2022, Deb et al., 2021). The primal maximizes over the coupling matrix PP with objective: f(P)=i=1naic(xi,j=1mPijyj)f(P) = \sum_{i=1}^n a_i\, c\left(x_i,\, \sum_{j=1}^m P_{ij}y_j\right) using gradient (in PP) followed by KL-exponentiation and Sinkhorn projection to enforce marginals.

The dual works with convex potentials (parametrized on support) using subgradients derived from the Hamiltonian optimizer zz^*.

Complexity per iteration is O(nm)O(nm) with convergence rate O(1/T)O(1/\sqrt{T}) for general convex cost and O(1/T)O(1/T) if smooth/strongly convex. For quadratic cost, the barycentric projection of an optimal plan provides a plug-in estimator for maps, with nonparametric and smoothed estimators’ rates analyzed in (Deb et al., 2021).

In the context of constrained map estimation, the barycentric mapping for a plan π\pi provides the conditional mean πˉ(x)=E[YX=x]\bar\pi(x) = \mathbb{E}[Y|X=x], which is then projected onto a given function class GG (e.g., Lipschitz gradients, RKHS, neural nets) via L2L^2-minimization (Tanguy et al., 18 Jul 2024).

5. Theoretical Guarantees: Existence, Uniqueness, and Structure

  • Existence of a barycentric OT optimizer is ensured by compactness and lower semicontinuity of the cost.
  • In the quadratic barycentric setting, the optimizer splits as: Brenier map μμˉ\mu \to \bar\mu (convex order minimal measure) followed by any martingale coupling μˉν\bar\mu \to \nu, as characterized by Strassen-type theorems (Gozlan et al., 5 Sep 2025).
  • Dual optimizers exist as convex potentials, providing a global duality (Paty et al., 2022, Gozlan et al., 5 Sep 2025).
  • In dimension one, explicit solutions exist in terms of quantile functions (Guo et al., 26 Nov 2025).

6. Applications Across Disciplines

  • Economics: Models of labor market matching with aggregation effects use barycentric WOT for capturing production via team composition (firm as xx, worker types yy, cost is output as CES function on barycenters) (Paty et al., 2022).
  • Machine Learning: Robust barycenters, low-regularity OT map estimation, and adversarial or continuous relaxations of statistical independence in generative modeling employ barycentric projection and related weak OT formulations (Tanguy et al., 18 Jul 2024, Deb et al., 2021).
  • Statistics: Barycentric projections underpin plug-in estimators for transport maps and test statistics for independence (Deb et al., 2021).
  • Numerical Linear Programming: Fast combinatorial algorithms for the barycentric problem and its variants leverage the structure of barycentric costs and sparsity of supports (Borgwardt, 2017, Anderes et al., 2015).

7. Connection to Classical OT and Beyond

  • The barycentric OT is a generalization of classical OT; it recovers Monge-Kantorovich when the cost is linear in yy.
  • Its distinguishing feature is cost nonlinear in the conditional distribution, only convex in barycenters, which restricts original OT plans but provides a generalized matching framework.
  • Its dynamic and martingale analogues connect optimal transport to stochastic processes, Markov representations, and more general convex geometries (Guo et al., 26 Nov 2025, Gozlan et al., 5 Sep 2025).
  • Variants include density-dependent minimal action (Lagrangian-weighted costs) (Wang et al., 4 Nov 2025), unbalanced extensions for robust barycenters (Nguyen et al., 10 Oct 2024), and constrained or regularized map estimation through projection (Tanguy et al., 18 Jul 2024).

References:

Whiteboard

Follow Topic

Get notified by email when new papers are published related to Barycentric Optimal Transport Problem.