Brenier’s Theorem: Foundations in Optimal Transport
- Brenier’s Theorem is a fundamental result in optimal transport that ensures the existence and uniqueness of an optimal map, represented as the gradient of a convex function.
- The theorem leverages duality, c-concavity, and the Monge–Ampère equation to establish optimality and smooth regularity under quadratic cost conditions.
- Extensions to degenerate and barycentric costs provide insights into numerical algorithms, optimal coupling, and the transition to the Knothe–Rosenblatt rearrangement.
Brenier’s Theorem provides the foundational existence and uniqueness theorem for the Monge–Kantorovich optimal transport problem with quadratic cost. Given Borel probability measures μ and ν on ℝⁿ (or on the flat torus 𝕋ⁿ = ℝⁿ/ℤⁿ), with μ absolutely continuous with respect to Lebesgue measure and both measures having finite second moments, there exists a unique optimal transport pushing μ to ν which minimizes the expected quadratic cost. This unique map is (almost everywhere) the gradient of a convex function and is characterized as the solution to the Monge–Ampère equation. The theorem not only supplies the structure of optimal plans but also relates closely to the theory of convex analysis, PDEs, and the geometry of mass transport.
1. Statement and Mathematical Formulation
Let μ, ν ∈ 𝒫₂(ℝⁿ) be Borel probability measures with finite second moments, and suppose μ ≪ dx. The Monge problem seeks a measurable transport map T: ℝⁿ → ℝⁿ pushing μ onto ν (T♯μ = ν) that minimizes
Brenier’s theorem asserts that there is a unique μ-a.e. optimal map T of the form T = ∇u, where u: ℝⁿ → ℝ is convex. The uniqueness is in the μ-almost everywhere sense; any other optimal plan must coincide with the graph of T (Bonnotte, 2012, Gozlan et al., 2018).
On the torus 𝕋ⁿ, with cost for A ∈ Sₙ{++}, and under smoothness and positivity conditions on densities , , there exists a unique c_A–concave potential ψ: 𝕋ⁿ → ℝ (with ∫ψ = 0) such that T(x) = x – A⁻¹∇ψ(x) pushes μ onto ν, and ψ ∈ C∞(𝕋ⁿ), A − D²ψ(x) > 0 everywhere (Bonnotte, 2012).
2. Duality, c-Concavity, and Cyclic Monotonicity
The optimal transport map’s existence and characterization leverage Kantorovich duality and the notion of c-concavity. For quadratic cost, the dual formulation is
There is always an optimizer of the form ψ = φ, χ = φ, with φ c-convex. Complementary slackness (φ(x) + φ(y) = |x – y|² for π⋆-almost every (x,y)) implies y ∈ ∂φ(x). Under absolute continuity of μ, this subdifferential reduces μ-a.e. to a single point, yielding T(x) = ∇φ(x) (Gozlan et al., 2018).
Cyclical monotonicity, as established by McCann, ensures that T is optimal. For torus costs, ψcc = ψ if and only if is convex. If A − D²ψ > 0, T is a diffeomorphism (Bonnotte, 2012).
3. Monge–Ampère Equation and Regularity Theory
The optimal transport map must satisfy the Monge–Ampère equation. In the standard case (A = I), this reads
with T(x) = ∇u(x). For general A, the formulation in terms of ψ is
The regularity theory asserts that if f, g ∈ C⁰∞_{>0} and all smoothness and positivity assumptions hold, then the Kantorovich potential ψ and the transport map T are smooth and T is a diffeomorphism (Bonnotte, 2012).
4. Evolution Under Degenerate Costs and Connection to Knothe’s Rearrangement
Introducing a family of anisotropic costs (A_t = diag(1, t, ..., t{n−1})), one obtains a smooth evolution of the optimal map as the anisotropy parameter t → 0. The associated PDE for Kantorovich potentials ψ_t is encoded in
[Equation (4.7) in (Bonnotte, 2012)].
In the vanishing limit t → 0, the cost penalizes the x₁-direction more heavily, resulting in the convergence (in L²) of the Brenier map T_t to the Knothe–Rosenblatt rearrangement R, which is formed by successive measures on conditional marginals. The decomposition
aligns with a smooth evolution in t that yields, at t=0, the one-dimensional Kantorovich potentials associated with the Knothe rearrangement (Bonnotte, 2012).
5. Smooth Dependence on Cost Structures and Nash–Moser Theory
For families of costs parameterized by positive-definite matrices A_t, the map A ↦ ψ_A is C¹ from Sₙ{++} to C{n+2,α}(𝕋ⁿ). As t ↓ 0, existence and regularity of solutions to the coupled system for (ψ_t¹, ψ_t²) are guaranteed using the Nash–Moser inverse function theorem, handling loss of derivatives and ensuring C∞ dependence on t ∈ [0, ε) (Bonnotte, 2012).
All linearizations of the nonlinear system exhibit isomorphic behavior with “tame estimates.” This framework recovers the Brenier structure for t > 0 and demonstrates degeneration of the map onto the Knothe rearrangement as t → 0.
6. Extensions: Weak and Barycentric Transport and Mixture Theorems
Generalizations of Brenier’s result appear for weakened quadratic costs, notably the barycentric cost, defined as
with π(dx, dy) = μ(dx) p_x(dy). For such costs, every optimal plan decomposes into (1) a deterministic Brenier transport X ↦ X' = ∇φ(X), pushing μ onto its barycentric projection μ̄, and (2) a martingale coupling from μ̄ to ν (Gozlan et al., 2018).
A full equivalence is established: W_22(μ, ν) = \overline{T}_2(ν|μ) if and only if there is a convex φ ∈ C¹ with 1–Lipschitz gradient pushing μ onto ν, which links to Caffarelli’s contraction theorem. In cases where ν has density proportional to e{-V} with V convex relative to a reference Gaussian, the Brenier map satisfies strict contraction properties (Gozlan et al., 2018).
7. Impact, Applications, and Further Directions
Brenier’s theorem provides the canonical tool for generating optimal transport maps in numerous mathematical, computational, and applied domains. Its structure underlies the analysis and numerical computation of transport maps, with direct connections to partial differential equations (Monge–Ampère), metric geometry, and probability. The smooth continuation from Brenier maps to the Knothe–Rosenblatt rearrangement elucidates the role of the cost function’s anisotropy and its effect on optimal transport structure. These insights inform the analysis of degenerate cases, the regularity of solutions, and the construction of numerical algorithms exploiting the smooth dependence on transport cost parameters (Bonnotte, 2012).
Further implications include the characterization of weak transport costs via barycentric projections and their relation to deterministic and stochastic components of optimal plans, as articulated in the mixture theorem and its connections to contraction principles (Gozlan et al., 2018). This suggests both a deeper structural understanding of transport optimality and new potential computational strategies driven by degenerate-cost limits.
References:
- (Bonnotte, 2012) "From Knothe's rearrangement to Brenier's optimal transport map"
- (Gozlan et al., 2018) "On A Mixture Of Brenier and Strassen Theorems"