Papers
Topics
Authors
Recent
Search
2000 character limit reached

Discrete-Time Markov Chains

Updated 7 April 2026
  • Discrete-Time Markov Chain (DTMC) is a stochastic process defined on a countable state space where the next state depends solely on the current state.
  • DTMCs are widely used in spectral analysis, model checking, queuing theory, and infrastructure reliability across various research fields.
  • Key analyses involve state classification, ergodic properties, mixing time estimation, and empirical calibration for practical applications.

A discrete-time Markov chain (DTMC) is a system consisting of a countable state space and a sequence of random variables with the Markov property—future states depend only on the present state and not on the past trajectory. DTMCs are foundational in stochastic modeling, probabilistic verification, statistical physics, combinatorics, queuing theory, and infrastructure reliability. The core mathematical structure is characterized by a (sub)stochastic transition matrix PP, with applications ranging from spectral graph theory to model checking and large-scale infrastructure monitoring.

1. Formal Structure and Definitions

A DTMC is a sequence of SS-valued random variables X0,X1,…X_0, X_1, \ldots defined on a probability space (Ω,F,P)(\Omega, \mathcal{F}, P) such that for all nn, P(Xn+1=j∣Xn=i,Xn−1,...)=P(Xn+1=j∣Xn=i)=:PijP(X_{n+1}=j \mid X_n = i, X_{n-1}, ...) = P(X_{n+1}=j \mid X_n = i) =: P_{ij}. Here, SS is the state space (finite or countably infinite), and P=(Pij)P = (P_{ij}) is the transition matrix—row-stochastic if Pij≥0P_{ij}\geq0 and ∑jPij=1\sum_j P_{ij}=1 for each SS0; sub-stochastic if SS1. The initial distribution is denoted by SS2 where SS3 (Halidias, 2017).

Paths of length SS4 can be represented as SS5, with probability SS6 (Hartmanns et al., 2 Sep 2025). Absorbing and goal states may be specified for application contexts.

2. State Classification and Ergodic Properties

States in a DTMC are classified based on recurrence:

  • Transient: SS7
  • Null Recurrent: Recurrence probability SS8 but infinite expected return time
  • Positive Recurrent: Recurrence probability SS9 and finite expected return time X0,X1,…X_0, X_1, \ldots0

For each pair X0,X1,…X_0, X_1, \ldots1, X0,X1,…X_0, X_1, \ldots2 is the hitting probability. The ergodic theorem for occupancy states that X0,X1,…X_0, X_1, \ldots3 if X0,X1,…X_0, X_1, \ldots4 is positive recurrent and X0,X1,…X_0, X_1, \ldots5, otherwise X0,X1,…X_0, X_1, \ldots6 (Halidias, 2017). For arbitrary initial law X0,X1,…X_0, X_1, \ldots7, time-averaged occupancy probabilities are weighted by X0,X1,…X_0, X_1, \ldots8.

3. DTMCs in Graph-Theoretic and Structural Contexts

Constructing a DTMC from a graph X0,X1,…X_0, X_1, \ldots9 with adjacency (or weight) matrix (Ω,F,P)(\Omega, \mathcal{F}, P)0 and degree matrix (Ω,F,P)(\Omega, \mathcal{F}, P)1 is standard. For undirected graphs, (Ω,F,P)(\Omega, \mathcal{F}, P)2 yields a symmetric or doubly stochastic transition matrix (when (Ω,F,P)(\Omega, \mathcal{F}, P)3 is symmetric and unweighted). The stationary distribution satisfies (Ω,F,P)(\Omega, \mathcal{F}, P)4, specializing to (Ω,F,P)(\Omega, \mathcal{F}, P)5 for undirected weighted graphs, and (Ω,F,P)(\Omega, \mathcal{F}, P)6 for doubly stochastic (Ω,F,P)(\Omega, \mathcal{F}, P)7 (regular, unweighted, undirected) (Murthy, 2012). Graph structure determines chain properties, with connectedness and aperiodicity ensuring ergodicity.

Spectral decomposition applies when (Ω,F,P)(\Omega, \mathcal{F}, P)8 is symmetric: (Ω,F,P)(\Omega, \mathcal{F}, P)9, allowing exact analysis of convergence and mixing properties. Exponential mixing is controlled by nn0, the second-largest eigenvalue, with mixing time nn1 (Murthy, 2012). Shannon entropy nn2 increases monotonically for doubly stochastic nn3, reaching nn4 in the limit.

4. Reachability, Path Abstraction, and Model Checking

Probabilistic model checking in DTMCs requires computation of probabilities for reaching goal states nn5 from an initial state nn6. The reachability probability is nn7, summing over all non-zero probability paths from nn8 to nn9 (Hartmanns et al., 2 Sep 2025).

Path abstraction leverages the free monoid P(Xn+1=j∣Xn=i,Xn−1,...)=P(Xn+1=j∣Xn=i)=:PijP(X_{n+1}=j \mid X_n = i, X_{n-1}, ...) = P(X_{n+1}=j \mid X_n = i) =: P_{ij}0: by removing subsets P(Xn+1=j∣Xn=i,Xn−1,...)=P(Xn+1=j∣Xn=i)=:PijP(X_{n+1}=j \mid X_n = i, X_{n-1}, ...) = P(X_{n+1}=j \mid X_n = i) =: P_{ij}1 ("intermediate" states), one constructs a path-abstracted chain P(Xn+1=j∣Xn=i,Xn−1,...)=P(Xn+1=j∣Xn=i)=:PijP(X_{n+1}=j \mid X_n = i, X_{n-1}, ...) = P(X_{n+1}=j \mid X_n = i) =: P_{ij}2. Fundamental absorption theorems establish that abstracting over nested subsets P(Xn+1=j∣Xn=i,Xn−1,...)=P(Xn+1=j∣Xn=i)=:PijP(X_{n+1}=j \mid X_n = i, X_{n-1}, ...) = P(X_{n+1}=j \mid X_n = i) =: P_{ij}3 yields P(Xn+1=j∣Xn=i,Xn−1,...)=P(Xn+1=j∣Xn=i)=:PijP(X_{n+1}=j \mid X_n = i, X_{n-1}, ...) = P(X_{n+1}=j \mid X_n = i) =: P_{ij}4, guaranteeing invariance under the order of abstraction over non-goal sets. The free-monoid formalism yields equational proofs of these properties and circumvents reliance on graph-theoretic SCC decompositions.

This algebraic framework enables early counterexample refinement in model checking and decouples path abstraction from combinatorial explosion in classical state elimination (Hartmanns et al., 2 Sep 2025).

5. Statistical Inference and Parametric Modeling

DTMCs provide parametric structures for empirical modeling, exemplified in infrastructure deterioration studies. For sewer pipe degradation, state space P(Xn+1=j∣Xn=i,Xn−1,...)=P(Xn+1=j∣Xn=i)=:PijP(X_{n+1}=j \mid X_n = i, X_{n-1}, ...) = P(X_{n+1}=j \mid X_n = i) =: P_{ij}5 (discrete severity levels) is used, with homogeneous DTMC calibrated per cohort and failure type (Jimenez-Roa et al., 2023). Calibration employs weighted least-squares minimization of prediction error, under constraints enforcing Markov structure: row-stochasticity, structural zeros prohibiting "healing," and absorbing final states.

"Chain Multi" allows arbitrary non-improving transitions (P(Xn+1=j∣Xn=i,Xn−1,...)=P(Xn+1=j∣Xn=i)=:PijP(X_{n+1}=j \mid X_n = i, X_{n-1}, ...) = P(X_{n+1}=j \mid X_n = i) =: P_{ij}6 for P(Xn+1=j∣Xn=i,Xn−1,...)=P(Xn+1=j∣Xn=i)=:PijP(X_{n+1}=j \mid X_n = i, X_{n-1}, ...) = P(X_{n+1}=j \mid X_n = i) =: P_{ij}7), whereas "Chain Single" restricts transitions to self-loops and one-step worsening (P(Xn+1=j∣Xn=i,Xn−1,...)=P(Xn+1=j∣Xn=i)=:PijP(X_{n+1}=j \mid X_n = i, X_{n-1}, ...) = P(X_{n+1}=j \mid X_n = i) =: P_{ij}8). Model selection favors "Chain Single" for parsimony and improved numerical stability. Expected future severity P(Xn+1=j∣Xn=i,Xn−1,...)=P(Xn+1=j∣Xn=i)=:PijP(X_{n+1}=j \mid X_n = i, X_{n-1}, ...) = P(X_{n+1}=j \mid X_n = i) =: P_{ij}9 enables direct cohort comparison.

6. MCMC, Mixing Time, and Reversible Chains

In combinatorial optimization, reversible positive recurrent DTMCs are used for Markov Chain Monte Carlo (MCMC) sampling from complex distributions. The Metropolis chain used for Gibbs distribution sampling over perfect matchings is irreducible, aperiodic, and its stationary measure is given by SS0 (Moothedath et al., 2017). Conductance SS1 bounds and canonical paths yield explicit mixing time estimates, e.g., SS2 with SS3.

Mixing properties depend on spectral gaps (via SS4) and detailed balance, ensuring convergence to the target distribution. This analytic machinery underpins practical rapid mixing results central to probabilistic sampling and randomized algorithms involving DTMCs.

7. Applications in Queueing, Communication, and Optimization

DTMCs underpin operational models across engineering domains. For instance, buffer-occupancy in bidirectional Fano decoders is modeled as a DTMC on occupancy states SS5, with transition probabilities tied to the random decoding time and traffic parameters (Xu et al., 2010). Performance metrics—frame erasure, mean occupancy, throughput, and delay—are computed from the stationary distribution.

In system design, the trade-off between the number of parallel decoders and input buffer size can be optimized using DTMC-predicted performance. Minimal total area per throughput is achieved by maximizing SS6, where SS7 is the per-decoder input rate for buffer size SS8.


DTMC theory thus provides an analytically tractable foundation for modeling, estimation, performance analysis, and verification in diverse research and application contexts, with deep interplay between structure, spectral properties, and statistical inference (Halidias, 2017, Murthy, 2012, Hartmanns et al., 2 Sep 2025, Jimenez-Roa et al., 2023, Xu et al., 2010, Moothedath et al., 2017).

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Discrete-Time Markov Chain (DTMC).