Papers
Topics
Authors
Recent
Search
2000 character limit reached

Transport-Information Inequalities Overview

Updated 4 July 2026
  • Transport-information inequalities are defined as functional inequalities that link optimal transport costs with Fisher or Donsker–Varadhan information in Markov process settings.
  • They establish a hierarchy connecting quadratic, Talagrand, Poincaré, and log-Sobolev inequalities through techniques like Lyapunov drift conditions and variational principles.
  • These inequalities offer practical insights into regularity estimates for optimal maps and serve as a bridge between geometric, stochastic, and quantum analyses.

A transport-information inequality is a functional inequality that compares an optimal transport cost with an information functional. In the formulation singled out in the survey literature, one writes

a(Tc(ν,p))I(νp),a(T_c(\nu,p)) \le I(\nu\mid p),

where Tc(ν,p)T_c(\nu,p) is an optimal transport cost from ν\nu to a reference measure pp, aa is increasing with a(0)=0a(0)=0, and I(νp)I(\nu\mid p) is Fisher or Donsker–Varadhan information associated with a reversible Markov process (Gozlan et al., 2010). In the quadratic case this becomes the transportation-information inequality W2IW_2I, typically written as

W2(ν,μ)24CI(νμ),W_2(\nu,\mu)^2 \le 4C\,I(\nu\mid \mu),

or equivalently W2(ν,μ)2CI(νμ)W_2(\nu,\mu)\le 2\sqrt{C\,I(\nu\mid\mu)} (Liu, 2015). The subject lies at the intersection of optimal transport, Dirichlet forms, entropy dissipation, concentration of measure, curvature, and PDE regularity; related papers also use the same transport-geometric mechanism to control Sobolev norms of optimal maps and to interpolate between entropy, transport, and Fisher information in both classical and quantum settings (Kolesnikov, 2010, Rouzé et al., 2017).

1. General template and dynamical meaning

The general transport-information template is

Tc(ν,p)T_c(\nu,p)0

with Tc(ν,p)T_c(\nu,p)1 defined from a Dirichlet form. For a Tc(ν,p)T_c(\nu,p)2-reversible Markov semigroup Tc(ν,p)T_c(\nu,p)3 with Dirichlet form Tc(ν,p)T_c(\nu,p)4, the relevant information functional is

Tc(ν,p)T_c(\nu,p)5

In the standard diffusion setting Tc(ν,p)T_c(\nu,p)6, this becomes

Tc(ν,p)T_c(\nu,p)7

and it differs from the more common Tc(ν,p)T_c(\nu,p)8 by a factor Tc(ν,p)T_c(\nu,p)9 (Gozlan et al., 2010).

The dynamical meaning of this choice of information is central. If ν\nu0 is a stationary ergodic Markov process with invariant law ν\nu1, then the occupation measure

ν\nu2

has a large deviation principle with rate function ν\nu3. In this sense, transport-information inequalities are the Markov-process analogue of transport-entropy inequalities: relative entropy governs i.i.d. empirical measures, while Donsker–Varadhan information governs empirical occupation measures (Gozlan et al., 2010).

This viewpoint yields explicit deviation theory. The survey states that

ν\nu4

is equivalent to exponential Laplace bounds for time averages of 1-Lipschitz observables and to deviation inequalities of the form

ν\nu5

for centered 1-Lipschitz ν\nu6 (Gozlan et al., 2010). The transport side therefore encodes geometry, while the information side encodes dynamical rarity.

2. The quadratic inequality ν\nu7 and its position in the hierarchy

On a connected complete finite-dimensional Riemannian manifold ν\nu8, with

ν\nu9

the quadratic transportation-information inequality is

pp0

where, for pp1 with pp2,

pp3

and pp4 otherwise (Liu, 2015). This is the form usually denoted pp5.

The hierarchy recorded in the supplied literature is precise. It was already known that pp6 implies Talagrand’s quadratic transport-entropy inequality pp7, and under lower Bakry–Émery curvature bounds the HWI inequality relates pp8 and logarithmic Sobolev inequalities. The survey states the implications

pp9

and, under

aa0

it adds that aa1 implies

aa2

when aa3 (Gozlan et al., 2010). In the terminology of (Lacker et al., 2020), aa4 is stronger than Poincaré and weaker than log-Sobolev in diffusion settings.

A related refinement is the restricted quadratic transportation-information inequality. If aa5 is aa6-semi-convex and aa7, the restricted inequality takes the form

aa8

The paper proves that this restricted version is equivalent to Talagrand’s aa9, with explicit constant conversion: from a(0)=0a(0)=00 to a(0)=0a(0)=01, one may take a(0)=0a(0)=02, and conversely a(0)=0a(0)=03 (Liu, 2015). This identifies a sharp “restricted” bridge between entropy and Fisher-information transport control.

At the lower end of the hierarchy, the Poincaré inequality is tied to transportation-variance bounds. One paper proves that Poincaré is equivalent to several quadratic transportation-variance inequalities, including

a(0)=0a(0)=04

and uses the same argument to derive further characterizations and a direct route from Lyapunov conditions to a(0)=0a(0)=05 (Liu, 2019). This places transportation-information theory inside a broader ladder of variance, entropy, and Fisher-information inequalities.

3. Characterizations by Lyapunov drift, variational principles, and dimension-free concentration

A central structural result is the Lyapunov characterization of a(0)=0a(0)=06. On a complete Riemannian manifold, the paper (Liu, 2015) proves that the following are equivalent: first, a(0)=0a(0)=07 satisfies a(0)=0a(0)=08; second, there exists a Lyapunov function a(0)=0a(0)=09 with I(νp)I(\nu\mid p)0 locally bounded such that

I(νp)I(\nu\mid p)1

in the sense of distributions, for some I(νp)I(\nu\mid p)2, I(νp)I(\nu\mid p)3, and some point I(νp)I(\nu\mid p)4. This criterion removes curvature assumptions from the characterization and recasts transport-information control as a coercive quadratic drift condition for the generator.

The same paper proves a bounded-perturbation principle of Holley–Stroock type. If I(νp)I(\nu\mid p)5 is absolutely continuous with respect to I(νp)I(\nu\mid p)6 and

I(νp)I(\nu\mid p)7

for some I(νp)I(\nu\mid p)8, then I(νp)I(\nu\mid p)9 satisfying W2IW_2I0 implies that W2IW_2I1 also satisfies W2IW_2I2 (Liu, 2015). This transference principle is one of the main robustness properties of the inequality.

A complementary approach is variational. The paper (Fontbona et al., 2015) studies functionals of the form

W2IW_2I3

or in the quadratic setting W2IW_2I4. Transport inequalities become statements that W2IW_2I5 is the minimizer of W2IW_2I6. Nontrivial minimizers satisfy an Euler–Lagrange relation involving a Kantorovich potential W2IW_2I7: W2IW_2I8 From this framework the paper recovers the implication

W2IW_2I9

and records the constant comparison

W2(ν,μ)24CI(νμ),W_2(\nu,\mu)^2 \le 4C\,I(\nu\mid \mu),0

(Fontbona et al., 2015).

A further characterization identifies W2(ν,μ)24CI(νμ),W_2(\nu,\mu)^2 \le 4C\,I(\nu\mid \mu),1 with a dimension-free concentration property for product Markov processes. For an ergodic reversible Markov process with invariant law W2(ν,μ)24CI(νμ),W_2(\nu,\mu)^2 \le 4C\,I(\nu\mid \mu),2, the paper (Lacker et al., 2020) proves that W2(ν,μ)24CI(νμ),W_2(\nu,\mu)^2 \le 4C\,I(\nu\mid \mu),3 is equivalent to a family of product-space statements, including the Feynman–Kac semigroup bound

W2(ν,μ)24CI(νμ),W_2(\nu,\mu)^2 \le 4C\,I(\nu\mid \mu),4

for every W2(ν,μ)24CI(νμ),W_2(\nu,\mu)^2 \le 4C\,I(\nu\mid \mu),5, W2(ν,μ)24CI(νμ),W_2(\nu,\mu)^2 \le 4C\,I(\nu\mid \mu),6, W2(ν,μ)24CI(νμ),W_2(\nu,\mu)^2 \le 4C\,I(\nu\mid \mu),7, and every 1-Lipschitz W2(ν,μ)24CI(νμ),W_2(\nu,\mu)^2 \le 4C\,I(\nu\mid \mu),8, together with the dimension-free deviation estimate

W2(ν,μ)24CI(νμ),W_2(\nu,\mu)^2 \le 4C\,I(\nu\mid \mu),9

This is the Markov-process counterpart of Gozlan’s dimension-free characterization of W2(ν,μ)2CI(νμ)W_2(\nu,\mu)\le 2\sqrt{C\,I(\nu\mid\mu)}0.

4. Sobolev regularity of optimal transport as a transport-information inequality

A distinct but closely related line of work interprets transport-information inequalities as regularity estimates for the optimal map itself. In the global Euclidean setting

W2(ν,μ)2CI(νμ)W_2(\nu,\mu)\le 2\sqrt{C\,I(\nu\mid\mu)}1

with optimal map W2(ν,μ)2CI(νμ)W_2(\nu,\mu)\le 2\sqrt{C\,I(\nu\mid\mu)}2, the Monge–Ampère equation reads

W2(ν,μ)2CI(νμ)W_2(\nu,\mu)\le 2\sqrt{C\,I(\nu\mid\mu)}3

or equivalently

W2(ν,μ)2CI(νμ)W_2(\nu,\mu)\le 2\sqrt{C\,I(\nu\mid\mu)}4

Assuming uniform convexity of the target,

W2(ν,μ)2CI(νμ)W_2(\nu,\mu)\le 2\sqrt{C\,I(\nu\mid\mu)}5

Theorem 3.1 of (Kolesnikov, 2010) gives the global, dimension-free Hessian bound

W2(ν,μ)2CI(νμ)W_2(\nu,\mu)\le 2\sqrt{C\,I(\nu\mid\mu)}6

This estimate is explicitly presented as a transport-information inequality: the Fisher information of the source controls an W2(ν,μ)2CI(νμ)W_2(\nu,\mu)\le 2\sqrt{C\,I(\nu\mid\mu)}7-Sobolev norm of the transport map.

The same paper develops an W2(ν,μ)2CI(νμ)W_2(\nu,\mu)\le 2\sqrt{C\,I(\nu\mid\mu)}8-family of estimates for second derivatives of W2(ν,μ)2CI(νμ)W_2(\nu,\mu)\le 2\sqrt{C\,I(\nu\mid\mu)}9, with Tc(ν,p)T_c(\nu,p)00, and derives matrix-level bounds such as

Tc(ν,p)T_c(\nu,p)01

In the limiting Tc(ν,p)T_c(\nu,p)02 regime, the estimates recover Caffarelli’s contraction theorem. In particular, when Tc(ν,p)T_c(\nu,p)03 is Gaussian and Tc(ν,p)T_c(\nu,p)04, the optimal map from Tc(ν,p)T_c(\nu,p)05 to Tc(ν,p)T_c(\nu,p)06 is Tc(ν,p)T_c(\nu,p)07-Lipschitz (Kolesnikov, 2010).

A key transport-geometric comparison in the paper is

Tc(ν,p)T_c(\nu,p)08

valid for every vector Tc(ν,p)T_c(\nu,p)09. This is derived from a generalized Talagrand inequality and then differentiated to obtain the Hessian estimate. In this formulation, the cost of translating the source density is bounded below by squared displacement of the optimal transport map, with the curvature of the target producing the factor Tc(ν,p)T_c(\nu,p)10 (Kolesnikov, 2010).

The Gaussian specialization is especially explicit. If Tc(ν,p)T_c(\nu,p)11 and Tc(ν,p)T_c(\nu,p)12, then the paper proves a dimension-free identity/estimate in which the Gaussian relative Fisher information dominates several nonnegative transport terms, including

Tc(ν,p)T_c(\nu,p)13

and

Tc(ν,p)T_c(\nu,p)14

This identifies the Gaussian log-Sobolev inequality and the Sobolev regularity of the transport map as consequences of a single transport-information structure (Kolesnikov, 2010).

5. Geometric and stochastic realizations

On compact Tc(ν,p)T_c(\nu,p)15-dimensional Riemannian manifolds with

Tc(ν,p)T_c(\nu,p)16

mass transport yields a curvature-adapted transport inequality

Tc(ν,p)T_c(\nu,p)17

where Tc(ν,p)T_c(\nu,p)18 is normalized volume, Tc(ν,p)T_c(\nu,p)19, and Tc(ν,p)T_c(\nu,p)20 is an explicit nonquadratic cost built from

Tc(ν,p)T_c(\nu,p)21

Its linearization gives exactly the sharp Poincaré inequality

Tc(ν,p)T_c(\nu,p)22

The paper emphasizes that the “naive” quadratic-cost transport approach would only recover the wrong constant Tc(ν,p)T_c(\nu,p)23; the modified cost and dimensional entropy restore the sharp spectral gap (Cordero-Erausquin, 2014). It also notes that a transport proof of the sharp log-Sobolev inequality on positively curved manifolds remains open.

For reflected diffusions in a convex domain Tc(ν,p)T_c(\nu,p)24, the law Tc(ν,p)T_c(\nu,p)25 on path space Tc(ν,p)T_c(\nu,p)26 satisfies a dimension-free Talagrand-type inequality

Tc(ν,p)T_c(\nu,p)27

provided the drift obeys the one-sided Lipschitz condition

Tc(ν,p)T_c(\nu,p)28

The constant is

Tc(ν,p)T_c(\nu,p)29

For reflected Brownian motion with constant diffusion matrix Tc(ν,p)T_c(\nu,p)30, this simplifies to Tc(ν,p)T_c(\nu,p)31; in the standard case Tc(ν,p)T_c(\nu,p)32,

Tc(ν,p)T_c(\nu,p)33

The proof hinges on the convexity observation

Tc(ν,p)T_c(\nu,p)34

which makes the reflection term stabilizing rather than expansive (Pal et al., 2018).

A path-space analogue also holds for a nonlinear hyperbolic SPDE. For the stochastic wave equation in Tc(ν,p)T_c(\nu,p)35,

Tc(ν,p)T_c(\nu,p)36

with Gaussian noise white in time and correlated in space, the law Tc(ν,p)T_c(\nu,p)37 on Tc(ν,p)T_c(\nu,p)38 satisfies

Tc(ν,p)T_c(\nu,p)39

under the stated Lipschitz, covariance, and initial-data hypotheses. The proof uses Girsanov representation, a coupling estimate, and Gronwall’s inequality, and it yields exponential integrability and Hoeffding-type concentration for Lipschitz functionals of the solution path (Li et al., 2018).

These examples show that the transport-information paradigm is not confined to static Euclidean measures. It persists, with modified costs and norms, on compact manifolds, reflected path spaces, and SPDE path spaces.

6. Discrete, point-process, quantum, and information-constrained extensions

A common misconception is that quadratic Tc(ν,p)T_c(\nu,p)40-theory should transfer unchanged to discrete spaces. The discrete literature states the opposite: if a probability measure has support intersecting two sets at positive distance, then it cannot satisfy Tc(ν,p)T_c(\nu,p)41 for any Tc(ν,p)T_c(\nu,p)42. For Markov chains on countable spaces, the appropriate replacements are Tc(ν,p)T_c(\nu,p)43-transport-information and weak transport-information inequalities. Under Tc(ν,p)T_c(\nu,p)44,

Tc(ν,p)T_c(\nu,p)45

and one also has the weak quadratic bound

Tc(ν,p)T_c(\nu,p)46

Under the exponential curvature condition Tc(ν,p)T_c(\nu,p)47,

Tc(ν,p)T_c(\nu,p)48

and under coarse Ricci curvature Tc(ν,p)T_c(\nu,p)49,

Tc(ν,p)T_c(\nu,p)50

These inequalities lead further to Tc(ν,p)T_c(\nu,p)51-type transport-entropy bounds and to a discrete Bonnet–Myers theorem (Fathi et al., 2015).

On configuration spaces, transport inequalities lift from single-site laws to laws of whole point processes. If the base law Tc(ν,p)T_c(\nu,p)52 satisfies Talagrand’s inequality on Tc(ν,p)T_c(\nu,p)53, then the mixed binomial point-process law Tc(ν,p)T_c(\nu,p)54 satisfies a process-level inequality in which the cost of changing the point count appears as an additional entropy term: Tc(ν,p)T_c(\nu,p)55 For Poisson point processes with arbitrary Tc(ν,p)T_c(\nu,p)56-finite intensity, the paper proves a universal Marton-type transport-entropy inequality and derives concentration and modified logarithmic Sobolev consequences; taking Tc(ν,p)T_c(\nu,p)57 recovers a universal Marton inequality for Poisson processes (Gozlan et al., 2020). Although the right-hand side here is relative entropy rather than Donsker–Varadhan information, the mechanism is unmistakably transport-information in the broader geometric sense.

In noncommutative probability, a quantum HWI inequality provides the analogue of the classical transport-information bridge. For a primitive detailed-balance quantum Markov semigroup with invariant state Tc(ν,p)T_c(\nu,p)58, if the Carlen–Maas quantum Ricci lower bound satisfies Tc(ν,p)T_c(\nu,p)59, then

Tc(ν,p)T_c(\nu,p)60

Here Tc(ν,p)T_c(\nu,p)61 is quantum relative entropy, Tc(ν,p)T_c(\nu,p)62 the Carlen–Maas quantum Wasserstein distance, and Tc(ν,p)T_c(\nu,p)63 quantum Fisher information. This yields modified logarithmic Sobolev and transport-cost inequalities as corollaries (Rouzé et al., 2017).

A further broadening replaces Fisher information by mutual information constraints on couplings. The information-constrained transport cost

Tc(ν,p)T_c(\nu,p)64

interpolates between independent coupling at Tc(ν,p)T_c(\nu,p)65 and unconstrained optimal transport as Tc(ν,p)T_c(\nu,p)66. For Gaussian target Tc(ν,p)T_c(\nu,p)67, the paper proves

Tc(ν,p)T_c(\nu,p)68

a strict sharpening of Talagrand’s Gaussian transport inequality when Tc(ν,p)T_c(\nu,p)69, and uses it in Marton-type concentration and in a converse for Cover’s relay-channel problem (Bai et al., 2020).

Transport-information inequalities thus form a family rather than a single statement. In the strict Markov-semigroup sense they compare transport cost with Fisher or Donsker–Varadhan information; in adjacent frameworks they compare transport with entropy, Sobolev norms of optimal maps, or even mutual information of couplings. Across these variants, the recurring structure is the same: geometry in Wasserstein-type space is constrained by a dissipation functional, and that constraint propagates to concentration, regularity, contraction, and functional inequalities.

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Transport-Information Inequality.