Wasserstein Formulation & Optimal Transport
- Wasserstein formulation is a framework in optimal transport that minimizes the cost of moving probability measures, typically using squared Euclidean distances.
- It extends to multi-marginal and dual settings with strategies like genetic column updates to enable sparse, tractable solutions for high-dimensional problems.
- Applications include computing barycenters, mesh-free interpolation, and Wasserstein splines, advancing techniques in statistics, machine learning, and computational mathematics.
The Wasserstein formulation is a canonical framework in optimal transport theory that characterizes the minimal cost of transporting one probability measure to another with respect to a specified ground cost, most commonly the squared Euclidean distance. This formulation encompasses both classical two-marginal problems and generalizations to the multi-marginal setting, where optimal transport plans couple more than two marginals. Modern developments include dual formulations, algorithmic advances for tractable high-dimensional computation, as well as applications to Wasserstein barycenters and spline interpolation in the metric space of probability measures. The Wasserstein framework enables both geometric and variational perspectives, which are foundational for recent advances in statistics, machine learning, computational mathematics, and applied sciences (Friesecke et al., 2022).
1. Multi-Marginal Optimal Transport: Primal and Dual Wasserstein Formulations
Let and given probability measures for , the multi-marginal Kantorovich problem is
where denotes the set of all couplings with prescribed marginals
and is a given cost function. For quadratic cost, the Wasserstein-$2$ setting, is typically a sum of squared distances or a symmetric quadratic cost.
After discretization, each marginal is represented on a finite grid , and a plan 0 is an 1 nonnegative tensor, interpreted as a probability mass function. The discrete problem reads
2
where 3 extracts the 4-th marginal, and 5 is the Frobenius inner product. The feasible set is the so-called Kantorovich polytope.
The dual formulation introduces potentials 6, yielding
7
with equality of primal and dual optima by linear programming duality.
2. The GenCol Algorithm for High-Dimensional Multi-Marginal Problems
The GenCol (genetic column generation) algorithm is specifically designed to solve large-scale multi-marginal optimal transport (MMOT) linear programs which arise in mesh-based Wasserstein formulations. The algorithm operates on a restricted subset of the configuration space, iteratively augmenting active constraints to converge to the full optimal solution. The main steps are:
- Column restriction: Start with a small subset 8.
- Reduced LP solve: Solve the primal and dual LPs on 9.
- Feasibility check: If dual feasibility extends globally, terminate; the current solution is optimal.
- Genetic update: Otherwise, select a "parent" 0 with 1, mutate one coordinate to create an "offspring" 2 not in 3, and admit 4 if it violates the dual constraint.
- Pruning: If 5 exceeds a chosen inflation threshold, prune inactive columns.
Key properties include:
- Primal LP on 6 has 7 variables and 8 constraints.
- By sparsity, at most 9 columns are needed for an exact solution.
- The genetic update ensures efficient exploration while controlling computational complexity.
The GenCol algorithm empirically achieves exponential convergence and enables MMOT problems with 0–1 variables to be solved on standard hardware, whereas classical approaches (Sinkhorn iterations, iterative Bregman projections) are infeasible due to memory constraints (Friesecke et al., 2022).
3. Wasserstein Barycenters and Mesh-Free Formulations
The Wasserstein barycenter is the probability measure that minimizes a weighted sum of Wasserstein distances to a given family of marginals. The standard barycenter problem is
2
with 3, 4. In the mesh-free multi-marginal formulation, the cost is
5
with 6. The barycenter measure is then the pushforward 7, where 8 is the optimal MMOT plan, allowing the barycenter to be supported on a grid of size 9 times finer than the input marginals—hence, "mesh-free."
This formulation avoids limitations of grid-based approaches, which restrict barycenter locations, by exploiting the higher-dimensional simplex induced by the multi-marginal coupling (Friesecke et al., 2022).
4. Wasserstein Splines and Geodesic Interpolation
Wasserstein splines generalize classical spline interpolation to the space of probability measures endowed with the Wasserstein metric. Given marginal constraints at discrete times 0, the MMOT problem seeks a joint law 1 on 2 minimizing
3
subject to 4 for all 5. The cost 6 is defined via
7
the cubic-spline energy associated with the knot sequence. The resulting interpolants are 8, where 9 evaluates the spline at intermediate times.
This provides a principled, variational approach to interpolate measures along smooth paths in Wasserstein space, with links to dynamic optimal transport (Friesecke et al., 2022).
5. Sparsity, Correctness, and Complexity of Wasserstein Formulations
Several theoretical properties characterize the tractability of the MMOT Wasserstein formulation:
- Sparsity (Dubins): Any optimizer 0 to the discrete MMOT LP with marginal grid sizes 1 has at most 2 nonzeros. This sharp bound ensures that although the full coupling grows exponentially, optimizers are supported on a much smaller subset of configurations.
- Algorithmic correctness (GenCol): Under the genetic column update, the reduced LP converges to the global optimum once dual feasibility is satisfied, ensuring correctness and finite termination in practice.
- Computational efficiency: Each iteration involves LPs of size 3, with overall complexity dictated by the manageable active set size 4.
Empirical observations confirm rapid (often exponential) convergence and mesh-free numerical accuracy for barycenter and spline tasks in high dimensions (Friesecke et al., 2022).
6. Significance: Applications and Extensions
The general Wasserstein formulation underlies a broad range of optimal transport-based techniques. Key applications include:
- Multi-marginal barycenters: Allowing mesh-free, high-dimensional computation, essential for statistical averaging in geometry and machine learning.
- Wasserstein splines: Enabling smooth interpolation of distributions in generative modeling, time-series, and dynamical inference.
- High-dimensional OT solvers: Via the GenCol scheme and sparsity guarantees, practical in scientific computing regimes previously inaccessible to full dense plan approaches.
- Theoretical innovation: The integration of duality, sparsity, and mesh-free evaluation within a unified mathematical framework has enabled advances across computational mathematics, statistics, and applied sciences.
References and further technical details, such as precise convergence proofs or spline cost expressions, are detailed in (Friesecke et al., 2022).