Transport-Information Inequalities Overview
- Transport-information inequalities are defined as functional inequalities that link optimal transport costs with Fisher or Donsker–Varadhan information in Markov process settings.
- They establish a hierarchy connecting quadratic, Talagrand, Poincaré, and log-Sobolev inequalities through techniques like Lyapunov drift conditions and variational principles.
- These inequalities offer practical insights into regularity estimates for optimal maps and serve as a bridge between geometric, stochastic, and quantum analyses.
A transport-information inequality is a functional inequality that compares an optimal transport cost with an information functional. In the formulation singled out in the survey literature, one writes
where is an optimal transport cost from to a reference measure , is increasing with , and is Fisher or Donsker–Varadhan information associated with a reversible Markov process (Gozlan et al., 2010). In the quadratic case this becomes the transportation-information inequality , typically written as
or equivalently (Liu, 2015). The subject lies at the intersection of optimal transport, Dirichlet forms, entropy dissipation, concentration of measure, curvature, and PDE regularity; related papers also use the same transport-geometric mechanism to control Sobolev norms of optimal maps and to interpolate between entropy, transport, and Fisher information in both classical and quantum settings (Kolesnikov, 2010, Rouzé et al., 2017).
1. General template and dynamical meaning
The general transport-information template is
0
with 1 defined from a Dirichlet form. For a 2-reversible Markov semigroup 3 with Dirichlet form 4, the relevant information functional is
5
In the standard diffusion setting 6, this becomes
7
and it differs from the more common 8 by a factor 9 (Gozlan et al., 2010).
The dynamical meaning of this choice of information is central. If 0 is a stationary ergodic Markov process with invariant law 1, then the occupation measure
2
has a large deviation principle with rate function 3. In this sense, transport-information inequalities are the Markov-process analogue of transport-entropy inequalities: relative entropy governs i.i.d. empirical measures, while Donsker–Varadhan information governs empirical occupation measures (Gozlan et al., 2010).
This viewpoint yields explicit deviation theory. The survey states that
4
is equivalent to exponential Laplace bounds for time averages of 1-Lipschitz observables and to deviation inequalities of the form
5
for centered 1-Lipschitz 6 (Gozlan et al., 2010). The transport side therefore encodes geometry, while the information side encodes dynamical rarity.
2. The quadratic inequality 7 and its position in the hierarchy
On a connected complete finite-dimensional Riemannian manifold 8, with
9
the quadratic transportation-information inequality is
0
where, for 1 with 2,
3
and 4 otherwise (Liu, 2015). This is the form usually denoted 5.
The hierarchy recorded in the supplied literature is precise. It was already known that 6 implies Talagrand’s quadratic transport-entropy inequality 7, and under lower Bakry–Émery curvature bounds the HWI inequality relates 8 and logarithmic Sobolev inequalities. The survey states the implications
9
and, under
0
it adds that 1 implies
2
when 3 (Gozlan et al., 2010). In the terminology of (Lacker et al., 2020), 4 is stronger than Poincaré and weaker than log-Sobolev in diffusion settings.
A related refinement is the restricted quadratic transportation-information inequality. If 5 is 6-semi-convex and 7, the restricted inequality takes the form
8
The paper proves that this restricted version is equivalent to Talagrand’s 9, with explicit constant conversion: from 0 to 1, one may take 2, and conversely 3 (Liu, 2015). This identifies a sharp “restricted” bridge between entropy and Fisher-information transport control.
At the lower end of the hierarchy, the Poincaré inequality is tied to transportation-variance bounds. One paper proves that Poincaré is equivalent to several quadratic transportation-variance inequalities, including
4
and uses the same argument to derive further characterizations and a direct route from Lyapunov conditions to 5 (Liu, 2019). This places transportation-information theory inside a broader ladder of variance, entropy, and Fisher-information inequalities.
3. Characterizations by Lyapunov drift, variational principles, and dimension-free concentration
A central structural result is the Lyapunov characterization of 6. On a complete Riemannian manifold, the paper (Liu, 2015) proves that the following are equivalent: first, 7 satisfies 8; second, there exists a Lyapunov function 9 with 0 locally bounded such that
1
in the sense of distributions, for some 2, 3, and some point 4. This criterion removes curvature assumptions from the characterization and recasts transport-information control as a coercive quadratic drift condition for the generator.
The same paper proves a bounded-perturbation principle of Holley–Stroock type. If 5 is absolutely continuous with respect to 6 and
7
for some 8, then 9 satisfying 0 implies that 1 also satisfies 2 (Liu, 2015). This transference principle is one of the main robustness properties of the inequality.
A complementary approach is variational. The paper (Fontbona et al., 2015) studies functionals of the form
3
or in the quadratic setting 4. Transport inequalities become statements that 5 is the minimizer of 6. Nontrivial minimizers satisfy an Euler–Lagrange relation involving a Kantorovich potential 7: 8 From this framework the paper recovers the implication
9
and records the constant comparison
0
A further characterization identifies 1 with a dimension-free concentration property for product Markov processes. For an ergodic reversible Markov process with invariant law 2, the paper (Lacker et al., 2020) proves that 3 is equivalent to a family of product-space statements, including the Feynman–Kac semigroup bound
4
for every 5, 6, 7, and every 1-Lipschitz 8, together with the dimension-free deviation estimate
9
This is the Markov-process counterpart of Gozlan’s dimension-free characterization of 0.
4. Sobolev regularity of optimal transport as a transport-information inequality
A distinct but closely related line of work interprets transport-information inequalities as regularity estimates for the optimal map itself. In the global Euclidean setting
1
with optimal map 2, the Monge–Ampère equation reads
3
or equivalently
4
Assuming uniform convexity of the target,
5
Theorem 3.1 of (Kolesnikov, 2010) gives the global, dimension-free Hessian bound
6
This estimate is explicitly presented as a transport-information inequality: the Fisher information of the source controls an 7-Sobolev norm of the transport map.
The same paper develops an 8-family of estimates for second derivatives of 9, with 00, and derives matrix-level bounds such as
01
In the limiting 02 regime, the estimates recover Caffarelli’s contraction theorem. In particular, when 03 is Gaussian and 04, the optimal map from 05 to 06 is 07-Lipschitz (Kolesnikov, 2010).
A key transport-geometric comparison in the paper is
08
valid for every vector 09. This is derived from a generalized Talagrand inequality and then differentiated to obtain the Hessian estimate. In this formulation, the cost of translating the source density is bounded below by squared displacement of the optimal transport map, with the curvature of the target producing the factor 10 (Kolesnikov, 2010).
The Gaussian specialization is especially explicit. If 11 and 12, then the paper proves a dimension-free identity/estimate in which the Gaussian relative Fisher information dominates several nonnegative transport terms, including
13
and
14
This identifies the Gaussian log-Sobolev inequality and the Sobolev regularity of the transport map as consequences of a single transport-information structure (Kolesnikov, 2010).
5. Geometric and stochastic realizations
On compact 15-dimensional Riemannian manifolds with
16
mass transport yields a curvature-adapted transport inequality
17
where 18 is normalized volume, 19, and 20 is an explicit nonquadratic cost built from
21
Its linearization gives exactly the sharp Poincaré inequality
22
The paper emphasizes that the “naive” quadratic-cost transport approach would only recover the wrong constant 23; the modified cost and dimensional entropy restore the sharp spectral gap (Cordero-Erausquin, 2014). It also notes that a transport proof of the sharp log-Sobolev inequality on positively curved manifolds remains open.
For reflected diffusions in a convex domain 24, the law 25 on path space 26 satisfies a dimension-free Talagrand-type inequality
27
provided the drift obeys the one-sided Lipschitz condition
28
The constant is
29
For reflected Brownian motion with constant diffusion matrix 30, this simplifies to 31; in the standard case 32,
33
The proof hinges on the convexity observation
34
which makes the reflection term stabilizing rather than expansive (Pal et al., 2018).
A path-space analogue also holds for a nonlinear hyperbolic SPDE. For the stochastic wave equation in 35,
36
with Gaussian noise white in time and correlated in space, the law 37 on 38 satisfies
39
under the stated Lipschitz, covariance, and initial-data hypotheses. The proof uses Girsanov representation, a coupling estimate, and Gronwall’s inequality, and it yields exponential integrability and Hoeffding-type concentration for Lipschitz functionals of the solution path (Li et al., 2018).
These examples show that the transport-information paradigm is not confined to static Euclidean measures. It persists, with modified costs and norms, on compact manifolds, reflected path spaces, and SPDE path spaces.
6. Discrete, point-process, quantum, and information-constrained extensions
A common misconception is that quadratic 40-theory should transfer unchanged to discrete spaces. The discrete literature states the opposite: if a probability measure has support intersecting two sets at positive distance, then it cannot satisfy 41 for any 42. For Markov chains on countable spaces, the appropriate replacements are 43-transport-information and weak transport-information inequalities. Under 44,
45
and one also has the weak quadratic bound
46
Under the exponential curvature condition 47,
48
and under coarse Ricci curvature 49,
50
These inequalities lead further to 51-type transport-entropy bounds and to a discrete Bonnet–Myers theorem (Fathi et al., 2015).
On configuration spaces, transport inequalities lift from single-site laws to laws of whole point processes. If the base law 52 satisfies Talagrand’s inequality on 53, then the mixed binomial point-process law 54 satisfies a process-level inequality in which the cost of changing the point count appears as an additional entropy term: 55 For Poisson point processes with arbitrary 56-finite intensity, the paper proves a universal Marton-type transport-entropy inequality and derives concentration and modified logarithmic Sobolev consequences; taking 57 recovers a universal Marton inequality for Poisson processes (Gozlan et al., 2020). Although the right-hand side here is relative entropy rather than Donsker–Varadhan information, the mechanism is unmistakably transport-information in the broader geometric sense.
In noncommutative probability, a quantum HWI inequality provides the analogue of the classical transport-information bridge. For a primitive detailed-balance quantum Markov semigroup with invariant state 58, if the Carlen–Maas quantum Ricci lower bound satisfies 59, then
60
Here 61 is quantum relative entropy, 62 the Carlen–Maas quantum Wasserstein distance, and 63 quantum Fisher information. This yields modified logarithmic Sobolev and transport-cost inequalities as corollaries (Rouzé et al., 2017).
A further broadening replaces Fisher information by mutual information constraints on couplings. The information-constrained transport cost
64
interpolates between independent coupling at 65 and unconstrained optimal transport as 66. For Gaussian target 67, the paper proves
68
a strict sharpening of Talagrand’s Gaussian transport inequality when 69, and uses it in Marton-type concentration and in a converse for Cover’s relay-channel problem (Bai et al., 2020).
Transport-information inequalities thus form a family rather than a single statement. In the strict Markov-semigroup sense they compare transport cost with Fisher or Donsker–Varadhan information; in adjacent frameworks they compare transport with entropy, Sobolev norms of optimal maps, or even mutual information of couplings. Across these variants, the recurring structure is the same: geometry in Wasserstein-type space is constrained by a dissipation functional, and that constraint propagates to concentration, regularity, contraction, and functional inequalities.