Papers
Topics
Authors
Recent
2000 character limit reached

Optimal Transport Barycenter Problem

Updated 3 December 2025
  • Optimal Transport Barycenter Problem is a mathematical framework that averages probability measures by minimizing squared Wasserstein distances.
  • It employs rigorous dual formulations and discretization techniques, allowing tractable solutions in high-dimensional computational geometry.
  • Its applications span statistics, computer vision, and generative modeling, with proven properties like sparsity and uniqueness under strict regularity.

The optimal transport barycenter problem is a variational framework for averaging probability distributions under the geometry defined by the 2‐Wasserstein (optimal transport) distance. It aims to find a probability measure—the barycenter—that minimizes the sum of squared Wasserstein distances to a finite set (or family) of input measures, serving as a metric-space analog of the Fréchet mean. Barycenter computation sits at the intersection of analysis, computational geometry, probability, and high-dimensional numerical optimization, with applications ranging from statistics and computer vision to generative modeling.

1. Mathematical Formulation and Theoretical Foundations

Given a metric space (X,d)(X,d) (commonly XRDX \subset \mathbb{R}^D) and NN probability measures μ1,,μN\mu_1, \dots, \mu_N with finite second moments, the 2-Wasserstein barycenter μˉ\bar{\mu} is defined as the minimizer

μˉ=argminνP2(X)1Nj=1NW22(ν,μj),\bar{\mu} = \arg\min_{\nu \in \mathcal{P}_2(X)} \frac{1}{N}\sum_{j=1}^N W_2^2(\nu, \mu_j),

where W22(μ,ν)=infγΓ(μ,ν)X×Xd(x,y)2dγ(x,y)W_2^2(\mu,\nu) = \inf_{\gamma \in \Gamma(\mu, \nu)} \int_{X \times X} d(x,y)^2\,d\gamma(x,y), with Γ(μ,ν)\Gamma(\mu,\nu) denoting couplings with marginals μ,ν\mu, \nu (Claici et al., 2018).

The barycenter generalizes the concept of means to metric measure spaces equipped with the Wasserstein metric. Under strict regularity (e.g., absolute continuity), existence and uniqueness often follow from the strict convexity of the cost functional.

In discrete settings, where each μi\mu_i has finite support, an analogous LP can be posed over a finite set SS of weighted centroids, reducing the infinite-dimensional problem to a tractable finite-dimensional one (Anderes et al., 2015). The support of the discrete barycenter lies in the set of centroids S={1N(x1,k1++xN,kN)}S = \{\frac{1}{N} (x_{1,k_1} + \cdots + x_{N,k_N})\}, with provable sparsity and non-mass-splitting properties.

2. Dual Formulation and Semi-Discrete Structure

For semi-discrete problems—those in which the barycenter is restricted to a measure supported on mm points—the dual formulation plays a crucial role. Given Σ={x1,,xm}X\Sigma = \{x^1, \ldots, x^m\} \subset X, define νΣ=1mi=1mδxi\nu_\Sigma = \frac{1}{m}\sum_{i=1}^m \delta_{x^i}. Then, for each μj\mu_j: FOT[ϕ,Σ;μj]=1mi=1mϕi+Xϕˉ(y)dμj(y),F_{\mathrm{OT}}[\phi, \Sigma; \mu_j] = \frac{1}{m}\sum_{i=1}^m \phi^i + \int_X \bar{\phi}(y) d\mu_j(y), where ϕˉ(y)=mini{yxi2ϕi}\bar{\phi}(y) = \min_i \{\|y - x^i\|^2 - \phi^i\} is the cc-transform. The optimal weights ϕ\phi induce a weighted Voronoi tessellation (power diagram).

Gradients with respect to site locations and dual variables are given by: Fxi=1Nj=1Naji(xibji),Fϕji=1N(1maji),\frac{\partial F}{\partial x^i} = \frac{1}{N} \sum_{j=1}^N a^i_j (x^i - b^i_j), \qquad \frac{\partial F}{\partial \phi^i_j} = \frac{1}{N}\left(\frac{1}{m} - a^i_j\right), where ajia^i_j and bjib^i_j are cell masses and barycenters for μj\mu_j (Claici et al., 2018).

Extension to the fully continuous setting

Definition Search Book Streamline Icon: https://streamlinehq.com
References (2)

Whiteboard

Topic to Video (Beta)

Follow Topic

Get notified by email when new papers are published related to Optimal Transport Barycenter Problem.