Papers
Topics
Authors
Recent
Search
2000 character limit reached

Weighted Relative Entropy Contraction

Updated 11 January 2026
  • Weighted relative entropy contraction is a principle that quantifies the decay of relative entropy under system-specific dynamics using operator convexity and spatial weights.
  • It unifies classical and quantum frameworks by linking contraction rates to spectral gaps, curvature, and entropy-dissipation inequalities.
  • The approach enables explicit computation of decay rates in models such as unital qubit channels, block dynamics, and conservation laws.

A weighted relative entropy contraction is a general principle asserting that the (possibly weighted or generalized) relative entropy between two evolving states contracts under certain dynamics—stochastic, deterministic, classical, or quantum—often at a rate governed by underlying mixing, spectral gap, or curvature properties. This principle serves as a unifying theme across numerous domains: quantum channel theory, nonequilibrium statistical mechanics, interacting particle systems, conservation laws, and metrics on Markov chains. The weighted nature enters either through operator-convex functions (quantum setting), geometric block decompositions (spin systems), or through assignment of locality or shift weights (hyperbolic conservation laws).

1. Formalism of Weighted Relative Entropy

Weighted relative entropy generalizes the standard Kullback–Leibler divergence by incorporating weights either into the state space decomposition or through embedding via a weight function or operator.

  • Classical setting: Given measures μ\mu, ν\nu on a configuration space XX, the KL divergence is

D(μν)=Xlog(dμ/dν(x))  dμ(x)D(\mu \Vert \nu) = \int_X \log \bigl( d\mu/d\nu(x) \bigr)\; d\mu(x)

Weighting arises naturally when entropy is decomposed over regions or blocks, as in the weighted sum AαAD(μAνA)\sum_A \alpha_A D(\mu_A \Vert \nu_A).

  • Quantum setting: When g:(0,)Rg: (0, \infty) \to \mathbb{R} is operator-convex, the gg-divergence is

Dg(ρσ)=Tr[σ1/2  g(LρRσ1)σ1/2]D_g(\rho\Vert\sigma) = \mathrm{Tr}\left[ \sigma^{1/2}\; g\left(L_\rho R_{\sigma^{-1}}\right)\sigma^{1/2} \right]

where Lρ(X)=ρXL_\rho(X) = \rho X and Rσ1(X)=Xσ1R_{\sigma^{-1}}(X) = X \sigma^{-1}. Upon symmetrization, g(x)+xg(x1)=(x1)2K(x)g(x) + xg(x^{-1}) = (x-1)^2 K(x) for some normalized, operator-convex KK (Hiai et al., 2015).

  • Block-spin or spatial decompositions: In finite-lattice Gibbs setups, the entropy on a volume VV is bounded by a weighted block sum:

γ(α)  EntVτ(f)CAVαA E[EntAτ(f)]\gamma(\alpha)\;\mathrm{Ent}_V^\tau(f) \leq C \sum_{A\subset V} \alpha_A~\mathbb{E}\left[\mathrm{Ent}_A^\tau(f)\right]

with γ(α)=minxAxαA\gamma(\alpha) = \min_x\, \sum_{A \ni x} \alpha_A (Caputo et al., 2020).

  • Nonlocal weights (conservation laws): In Rm\mathbb{R}^m, the weighted pseudo-distance between a solution UU and a fixed shock SS,

D(U(t),S)=x<α(t)η(U(x)U)dx+ax>α(t)η(U(x)U+)dxD(U(t), S) = \int_{x < \alpha(t)} \eta(U(x)\Vert U_-) dx + a \int_{x > \alpha(t)} \eta(U(x)\Vert U_+) dx

depends on the weighting aa and shift α(t)\alpha(t) (Vasseur, 2013).

2. Contraction Principle and Differential Inequalities

Weighted relative entropy contracts under appropriate dynamics characterized by Markov generators, Lindblad semigroups, or block-update schemes. The contraction behavior is governed by entropy-dissipation inequalities of the form:

ddtD(μtνt)=I(μt,νt)\frac{d}{dt} D(\mu_t\Vert\nu_t) = -\mathcal{I}(\mu_t, \nu_t)

where I\mathcal{I} is a (weighted) Fisher information or Dirichlet form.

In the quantum unraveling of Lindblad dynamics, the entropy-dissipation law reads (Ortigueira et al., 28 Nov 2025):

ddtSBS(ρtσt)2IUNR(ρtσt)0\frac{d}{dt} S_{BS}(\rho_t\Vert \sigma_t) \leq -2\, I_{UNR}(\rho_t \Vert \sigma_t) \leq 0

Analogous results hold for classical block dynamics and Markov chains with nonnegative curvature (Caputo et al., 2020, Caputo et al., 2024):

ddtD(μtν)λD(μtν)    D(μtν)eλtD(μ0ν)\frac{d}{dt} D(\mu_t\Vert \nu) \leq -\lambda D(\mu_t\Vert \nu) \implies D(\mu_t\Vert \nu) \leq e^{-\lambda t} D(\mu_0\Vert \nu)

3. Spectral Gap, Curvature, and Explicit Contraction Rates

Quantitative control over the contraction rate is typically supplied by a spectral gap, Poincaré inequality, or curvature estimate.

  • Quantum semigroups: When the Fokker–Planck generator LL has spectral gap λ>0\lambda > 0, the Belavkin–Staszewski relative entropy decays exponentially with coefficient 2λ2\lambda (Ortigueira et al., 28 Nov 2025):

SBS(ρtσt)    e2λtSBS(ρ0σ0)S_{BS}(\rho_t\Vert\sigma_t)\;\leq\;e^{-2\lambda t}\,S_{BS}(\rho_0\Vert\sigma_0)

  • Quantum channels: The gg-divergence contraction coefficient for CPT maps Φ\Phi is

ηg(Φ)=supρσDg(Φ(ρ)Φ(σ))Dg(ρσ)\eta_g(\Phi) = \sup_{\rho\neq\sigma} \frac{D_g(\Phi(\rho)\Vert\Phi(\sigma))}{D_g(\rho\Vert\sigma)}

with 0ηg(Φ)10 \leq \eta_g(\Phi)\leq 1 and explicit evaluation possible for unital qubit channels, amplitude-damping, etc (Hiai et al., 2015).

  • Block-spin systems: The contraction for weighted block-heat-bath dynamics is dictated by γ(α)\gamma(\alpha):

D(μtν)eγ(α)t/CD(μ0ν)D(\mu_t\Vert\nu) \leq e^{-\gamma(\alpha)t/C} D(\mu_0\Vert\nu)

(Caputo et al., 2020).

  • Markov chains with curvature: If Wd(Pμ,Pν)(1κ)Wd(μ,ν)W_d(P\mu, P\nu) \leq (1-\kappa) W_d(\mu, \nu) for Wasserstein distance, then equivalently

D(PμPν)(1κ)D(μν)D(P\mu\Vert P\nu) \leq (1-\kappa) D(\mu\Vert\nu)

with κ\kappa the curvature modulus (Caputo et al., 2024).

4. Structure of Contraction Coefficients and Relationships

Weighted relative entropy contraction coefficients interact with other quantitative distances and monotone metrics:

Coefficient Inequality chain Attainment/Special Case
Geodesic \leq Riemannian Equates for all operator-convex KK
Riemannian \leq gg-divergence (weighted entropy) Saturated for unital qubit channels
gg-divergence 1\leq 1 Classical case: Dobrushin coeff.
Trace norm \leq sqrt(Riemannian) Bounding for all CPT maps

For generalized quantum divergences (e.g., BKM, WY, maximal metric), explicit closed-form contraction coefficients exist for certain channels, but extremal inequalities may be strictly loose outside special cases such as unital qubit or amplitude-damping channels. The conjecture that K(x)=x1/2K(x) = x^{-1/2} maximizes contraction is false; values must be computed case by case (Hiai et al., 2015).

5. Connections to Classical and Quantum Systems

Weighted entropy contraction unifies approaches across disparate models:

  • Quantum Lindblad master equations: The minimal KL divergence between unravellings of density matrices realizes the Belavkin–Staszewski entropy, which contracts under Lindblad flow precisely as dictated by classical functional inequalities transferred to the pure-state measure ensemble (Ortigueira et al., 28 Nov 2025).
  • Spin systems and interacting particles: Block decomposition with arbitrary αA\alpha_A weights generalizes log-Sobolev and Shearer inequalities, enabling explicit entropy/mixing control in high-temperature and spatially mixing spin systems (Caputo et al., 2020).
  • Conservation laws and shocks: Relative entropy pseudo-distances weighted by shift parameters yield stability and contraction properties for weak solutions, including in the presence of discontinuities or vacuum (Vasseur, 2013).
  • Markov chains in geometry: Sectional curvature governs both Wasserstein and entropy contraction, leading to sharp estimates in birth–death processes, zero-range dynamics, and Gibbs samplers. The time-varying MLSI captures scale-dependent curvature improvement (Caputo et al., 2024).

6. Extensions and Generalizations

The contraction principle extends to maximal ff-divergences by replacing KL with DfD_f and the BS entropy with DfmaxD_f^{\mathrm{max}}, with parallel contraction-by-gap reasoning holding throughout. For classical Markov semigroups and weighted block dynamics, the proof techniques enable dimension-free spectral estimates, modified log-Sobolev inequalities, and mixing time bounds. In all cases, the contraction coefficients—in particular, the dependence on spectral gap or curvature—can be leveraged for precise rates in hypothesis testing, quantum thermodynamics, and stability analysis.

7. Illustrative Examples and Computability

Explicit contraction rates are available in paradigmatic settings:

  • Unital qubit channels: All contraction coefficients for monotone metrics, divergences, and trace norms collapse to T2\|T\|^2 or T\|T\| depending on the metric, so optimal rates are directly computable (Hiai et al., 2015).
  • Block heat-bath dynamics: For finite-state spaces, total-variation mixing time and log-Sobolev constant are bounded in terms of γ(α)1\gamma(\alpha)^{-1}, which is determined by the local block structure and weights (Caputo et al., 2020).
  • Birth–death processes: Choosing a metric dd, the contraction follows from coupling and curvature computation, with the entropy decay rate linked to the maximal mean difference in expected position under the semigroup (Caputo et al., 2024).
  • Zero-range mean-field dynamics: Sharp MLSI constants are computable by examining the minimal rate increment, matching the exponential rate of weighted relative entropy contraction (Caputo et al., 2024).
  • Conservation laws with shocks: The L2L^2 stability and explicit bounds on the required spatial shift demonstrate the flexibility and robustness of weighted pseudo-distances for nonlinear hyperbolic problems (Vasseur, 2013).

Weighted relative entropy contraction is thus a central unifying concept relating functional inequalities, spectral theory, and stability under time-evolution, accommodating both classical and quantum dynamics, as well as both local and global weighting schemes across a spectrum of models (Ortigueira et al., 28 Nov 2025, Hiai et al., 2015, Caputo et al., 2020, Vasseur, 2013, Caputo et al., 2024).

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Weighted Relative Entropy Contraction.