PW Barycenter in Optimal Transport

Updated 2 July 2025

PW Barycenter is the statistical mean of probability measures in the 2-Wasserstein space, defined via the minimization of expected squared transport distances.
It employs the averaging of optimal transport maps and convex duality to characterize deformations in imaging and manifold statistics.
The empirical formulation ensures strong consistency, enabling practical template estimation in applications like neuroimaging and statistical signal analysis.

A PW Barycenter (“Population Wasserstein barycenter”) is a generalized notion of Fréchet mean for probability measures within the nonlinear metric geometry of the 2-Wasserstein space. It is defined as the minimizer of the expected squared Wasserstein distance to a family of probability measures, and can, under general regularity and compactness assumptions, be characterized as the push-forward of a reference measure by the mean of the optimal transport maps arising from an underlying parametric or random model. This perspective connects duality in optimal transport, convex analysis, and statistical averaging of deformations in stochastic modeling and imaging.

1. Definition and Characterization

The population Wasserstein barycenter is the minimizer $\mu^*$ of the Fréchet functional

$J(\nu) = \int_{\Theta} \frac{1}{2} d_{W_2}^2(\nu, \mu_\theta) \, g(\theta)\, d\theta,$

where $\{\mu_\theta\}_{\theta\in\Theta}$ is a parametric family of compactly supported random probability measures, $g(\theta)$ is the distribution of $\theta$ , and $d_{W_2}$ is the quadratic Wasserstein distance. The barycenter is thus

$\mu^* = \operatorname{argmin}_\nu J(\nu).$

A duality argument from optimal transport theory reveals a deeper structure: if $\mu_0$ is a reference measure and $T_\theta$ is the optimal transport map from $\mu_0$ to $\mu_\theta$ , then, under suitable conditions, the barycenter has the form

$\mu^* = \bar{T} \# \mu_0, \quad \text{where} \quad \bar{T}(x) = \int_\Theta T_\theta(x)\, g(\theta)\, d\theta,$

and $\#$ denotes push-forward. In other words, the barycenter is constructed by pushing forward the reference measure by the expectation of the optimal transport maps with respect to the parameter distribution.

In dimension one, the barycenter’s quantile function is given by the average of the input quantile functions: $F_{\mu^*}^{-1}(y) = \int_\Theta F_{\mu_\theta}^{-1}(y) g(\theta) d\theta.$

2. Mathematical Framework

The barycenter problem is governed by the 2-Wasserstein metric

$d_{W_2}^2(\mu, \nu) = \inf_{\gamma \in \Pi(\mu, \nu)} \int |x - y|^2\, d\gamma(x, y),$

where $\Pi(\mu, \nu)$ is the set of couplings with fixed marginals. The push-forward operation for a measurable map $T$ is defined via

$\int f(x)\, d(T\#\mu)(x) = \int f(T(x))\, d\mu(x).$

By Brenier’s theorem, the optimal transport map exists and is unique under absolute continuity and regularity assumptions: for each $\theta$ , $\mu_\theta = T_\theta\#\mu_0$ .

The barycenter as push-forward by the average OT map holds under the condition that $T_\theta \circ \bar{T}^{-1}$ is itself the optimal map from $\mu^*$ to each $\mu_\theta$ (see Proposition 3.6 and Theorem 3.7 in the paper). In this sense, the averaging of optimal transport maps is central to the characterization.

The dual formulation, based on convex analysis, expresses the barycenter problem as

$J_P := \inf_\nu \int_\Theta \frac{1}{2} d_{W_2}^2(\nu, \mu_\theta) g(\theta) d\theta = \sup \left\{ \int_\Theta \int_\Omega S_{g(\theta)} f_\theta(x) d\mu_\theta(x) d\theta : \int_\Theta f_\theta(x) d\theta = 0 \ \forall x \right\}$

with $S_{g(\theta)}f(x) = \inf_y \frac{g(\theta)}{2}|x-y|^2 - f(y)$ . In one dimension, the barycenter is immediately available as the average of quantile functions.

3. Extensions to Statistical and Imaging Models

The paper extends these abstract results to statistical models for signals and images with geometric variability, known as deformable models. For observed random deformations

$X_i(x) = h(\varphi_i^{-1}(x)), \qquad q_i(x) = |\det D\varphi_i^{-1}(x)| q_0(\varphi_i^{-1}(x)),$

the observed signals or densities are random push-forwards of a template by diffeomorphisms. In this setting, provided proper integrability and regularity, the barycenter’s density is

$\overline{q}(x) = |\det D\overline{\varphi}^{-1}(x)| q_0(\overline{\varphi}^{-1}(x)),$

with $\overline{\varphi}(x) = \mathbb{E}_\theta \varphi_\theta(x)$ . Thus, the barycenter captures the mean “template” in a deformation-invariant way, simultaneously accounting for geometric and photometric variations.

This explicit formula provides a practical and statistically meaningful solution to template estimation under complex geometric warping, especially in contexts such as neuroimaging, atlas construction, and shape or texture summarization.

4. Estimation and Consistency: Empirical Barycenter

Given $n$ i.i.d. random measures $\mu_{\theta_1},\ldots,\mu_{\theta_n}$ , the empirical barycenter is

$\overline{\mu}_n = \operatorname{argmin}_\nu \frac{1}{n} \sum_{j=1}^n \frac{1}{2} d_{W_2}^2(\nu, \mu_{\theta_j}).$

Under compact support, existence and uniqueness are ensured. The paper establishes strong statistical consistency: as $n\to\infty$ , the empirical barycenter converges in $W_2$ almost surely to the population barycenter. The proof adapts the strong law of large numbers to the Wasserstein setting.

For practical computation, when OT maps $T_{\theta_j}$ from $\mu_0$ to each $\mu_{\theta_j}$ can be computed explicitly, one can use the empirical mean map

$\overline{T}_n(x) = \frac{1}{n} \sum_{j=1}^n T_{\theta_j}(x)$

and approximate the barycenter by $\overline{T}_n\#\mu_0$ .

5. Relationship to Prior Work and Broader Interpretation

This framework generalizes the concept of the empirical barycenter given by Agueh and Carlier (2011) to full population models, extending from finitely many fixed measures to general families of random measures. It also relaxes assumptions (e.g., concerning the admissibility of the class of maps to be averaged), and shows that, provided only that the average of OT maps is compatible with the optimal transport structure, the population barycenter is characterized as push-forward by the mean OT map.

Earlier approaches required stronger admissibility conditions or were only justified in finite, discrete, or one-dimensional settings. This extension is significant both for the theory of optimal transport and for applications in statistics where geometric averaging and averaging of deformations are central.

6. Practical Implications and Summary of Core Results

The characterization of the PW barycenter enables practical algorithms for template estimation in imaging, manifold statistics, and registration tasks, especially when statistical models involve random deformations. The central results may be summarized as follows:

Aspect	Mathematical Statement	Interpretation
Population barycenter	$\mu^* = \operatorname{argmin}_\nu \int \frac{1}{2} d_{W_2}^2(\nu, \mu_\theta)\, dP(\theta)$	Wasserstein Fréchet mean of the distribution
Averaged OT maps	$\mu^* = \bar{T}\#\mu_0$ , $\bar{T}(x)=\mathbb{E}T_\theta(x)$	Barycenter as push-forward by mean transport map
Empirical barycenter	$\overline{\mu}_n = \operatorname{argmin}_\nu \frac{1}{n} \sum_{j=1}^n \frac{1}{2} d_{W_2}^2(\nu, \mu_{\theta_j})$	Sample estimate of barycenter
Strong consistency	$d_{W_2}(\overline{\mu}_n, \mu^*) \to 0$ a.s. as $n \to \infty$	Empirical barycenter converges to population barycenter

Conclusion

For broad classes of random probability measures, including models of random geometric deformations prevalent in modern statistical image and signal analysis, the PW barycenter admits a rigorous and practically computable characterization as the push-forward of a reference probability measure by the mean of the optimal transport maps. This approach underpins a principled, geometry-aware statistical averaging scheme for probability distributions, yielding both consistency guarantees and explicit computational strategies for empirical estimation, and generalizes fundamentally the notion of averaging in non-Euclidean spaces.

PDF Markdown Chat (Pro)

Whiteboard

Generate a whiteboard explanation of this topic.

Topic to Video (Beta)

Generate a video overview of this topic.

Follow Topic

Get notified by email when new papers are published related to PW Barycenter.