Optimal Transport Barycenter Problem
- Optimal Transport Barycenter Problem is a mathematical framework that averages probability measures by minimizing squared Wasserstein distances.
- It employs rigorous dual formulations and discretization techniques, allowing tractable solutions in high-dimensional computational geometry.
- Its applications span statistics, computer vision, and generative modeling, with proven properties like sparsity and uniqueness under strict regularity.
The optimal transport barycenter problem is a variational framework for averaging probability distributions under the geometry defined by the 2‐Wasserstein (optimal transport) distance. It aims to find a probability measure—the barycenter—that minimizes the sum of squared Wasserstein distances to a finite set (or family) of input measures, serving as a metric-space analog of the Fréchet mean. Barycenter computation sits at the intersection of analysis, computational geometry, probability, and high-dimensional numerical optimization, with applications ranging from statistics and computer vision to generative modeling.
1. Mathematical Formulation and Theoretical Foundations
Given a metric space (commonly ) and probability measures with finite second moments, the 2-Wasserstein barycenter is defined as the minimizer
where , with denoting couplings with marginals (Claici et al., 2018).
The barycenter generalizes the concept of means to metric measure spaces equipped with the Wasserstein metric. Under strict regularity (e.g., absolute continuity), existence and uniqueness often follow from the strict convexity of the cost functional.
In discrete settings, where each has finite support, an analogous LP can be posed over a finite set of weighted centroids, reducing the infinite-dimensional problem to a tractable finite-dimensional one (Anderes et al., 2015). The support of the discrete barycenter lies in the set of centroids , with provable sparsity and non-mass-splitting properties.
2. Dual Formulation and Semi-Discrete Structure
For semi-discrete problems—those in which the barycenter is restricted to a measure supported on points—the dual formulation plays a crucial role. Given , define . Then, for each : where is the -transform. The optimal weights induce a weighted Voronoi tessellation (power diagram).
Gradients with respect to site locations and dual variables are given by: where and are cell masses and barycenters for (Claici et al., 2018).
Extension to the fully continuous setting