Hypocoercivity in Discrete Time Markov Chains
- Hypocoercivity is characterized by uniform exponential convergence to equilibrium in non-reversible, discrete time Markov chains via operator splittings and projections onto mean-zero subspaces.
- The framework leverages spectral properties, similarity transforms, and, when necessary, Jordan block analysis to establish explicit mixing rates and exponential decay bounds.
- Practical implementations include non-reversible random walks and discrete kinetic schemes, where explicit hypocoercivity constants provide sharp non-asymptotic estimates for convergence and χ²-divergence.
The hypocoercivity phenomenon for discrete time Markov chains refers to uniform exponential convergence to equilibrium for a large class of non-reversible, discrete time stochastic processes, even in the absence of classical detailed-balance (i.e., non-selfadjointness of the transition operator). This concept generalizes spectral gap–based “coercivity” to operator splittings where neither part is coercive, originating in kinetic theory and extended here to finite Markov chains, time-discrete kinetic equations, and general linear dynamical systems. The foundational results are operator-theoretic, leveraging spectral properties, similarity transforms, and explicit control of non-reversible dynamics via projections onto mean-zero subspaces in weighted spaces.
1. Operator-Theoretic Framework and Mean-Zero Subspace
Let be the stochastic, irreducible, aperiodic transition matrix of a Markov chain on finite state space with stationary law . The natural Hilbert space is
with inner product .
The central tool is the orthogonal projection
which projects onto the mean-zero subspace . This leads to the projected transition operator , acting on , which governs convergence to equilibrium: Hence, the decay is controlled by the norm . This operator commutes with and is the focus for quantitative analysis.
2. Hypocoercivity in the Diagonalizable Case
Assume that is diagonalizable on . There exist and a diagonal , with , such that , where is the trivial eigenvalue and are nontrivial. Define the operator condition number .
The discrete hypocoercivity estimate asserts: where and is an explicit hypocoercivity constant.
Thus, for diagonalizable , one achieves a uniform, computable exponential bound on the convergence rate—exporting the exact concept of hypocoercivity from the continuous to discrete setting.
3. Non-Diagonalizable and General Hypocoercivity
If is not diagonalizable, the spectral theory must account for Jordan block structure. The Jordan decomposition (after a suitable similarity via )
involves block-diagonal with eigenvalues , algebraic multiplicities , geometric multiplicities , and .
Standard estimates yield, for : The polynomial prefactors in reflect algebraic non-diagonalizability. Additionally, in control of convergence, the factor (for smallest stationary probability) appears, increasing the worst-case bound.
4. Quantitative Mixing, χ²-Divergence, and Submultiplicativity
Mixing to equilibrium is quantified through the pointwise and global -divergence: The operator-theoretic approach establishes the submultiplicativity property: where is the second singular value. Globally,
Explicit checks on powers of —by spectral or numerical means—thus yield sharp, non-asymptotic mixing time estimates for any initial state, including non-reversible Markov chains.
5. Hypocontractivity, Discrete Indices, and Short-Time Plateau Phenomenon
The theory is supplemented and clarified by the concept of hypocontractivity from linear dynamical systems (Achleitner et al., 2022). For a (possibly non-symmetric) , after projecting onto mean-zero, the hypocoercivity/hypocontractivity index is the minimal such that the iterated dissipation
is positive definite, with . This implies that for , but , revealing a finite “plateau” of norm preservation followed by strict contraction and exponential decay. The drop is quantified by the smallest eigenvalue of as
Efficient computation is via rank tests on controllability matrices. Concrete examples for non-reversible 3-state chains demonstrate this effect; e.g., for certain , and first strict contraction occurs at .
6. Structure of Hypocoercivity for Discrete Kinetic Schemes
The explicit structure of hypocoercivity is illuminated in the context of discrete Fokker-Planck chains (Dujardin et al., 2018). Decompositions , with coercive (“collision”) and skew-adjoint (“transport”) and appropriate commutator control, enable the construction of modified discrete energies
with shift , such that for suitable and time step
with independent of discretization. The approach is robust: the same algebraic argument applies directly to general discrete Markov chains with analogous splitting and spectral gap properties, so long as a Poincaré-type inequality and relevant commutator identities are available.
7. Assumptions, Generalizations, and Limitations
The hypocoercivity framework for discrete time Markov chains applies to any finite, ergodic chain, regardless of reversibility. In the diagonalizable case, hypocoercivity constants and are explicit; in the general case, polynomial prefactors arise naturally from Jordan block structure. The theory has been shown to cover a wide class of non-reversible chains (notably, momentum-based samplers and non-reversible random walks), and produces the sharpest known mixing bounds without asymptotic approximations.
Limitations include the computational complexity of determining (condition number of the similarity matrix), which may be mitigated by bounding via graph structure or pseudospectral estimates. Extending to countable or continuous states requires semigroup analogues and spectral calculus for infinite dimensions. Nevertheless, the foundational operator-theoretic approach, involving mean-zero projections and dissipation control, enables a unified and general methodology for non-asymptotic mixing and exponential convergence for discrete-time, non-reversible Markov chains. The interplay of spectral gap, non-selfadjointness, and cross-effects captured by commutators is now understood as a purely algebraic phenomenon, paralleling and extending Villani’s continuous-time hypocoercivity to the discrete-time, discrete-state setting.