2000 character limit reached
Optimistic Estimation of Convergence in Markov Chains with the Average-Mixing Time (2402.10506v2)
Published 16 Feb 2024 in math.ST, math.PR, and stat.TH
Abstract: The convergence rate of a Markov chain to its stationary distribution is typically assessed using the concept of total variation mixing time. However, this worst-case measure often yields pessimistic estimates and is challenging to infer from observations. In this paper, we advocate for the use of the average-mixing time as a more optimistic and demonstrably easier-to-estimate alternative. We further illustrate its applicability across a range of settings, from two-point to countable spaces, and discuss some practical implications.
- A. Agarwal and J. C. Duchi. The generalization ability of online algorithms for dependent data. IEEE Transactions on Information Theory, 59(1):573–587, 2012.
- Mixing and average mixing times for general Markov processes. Canadian Mathematical Bulletin, 64(3):541–552, 2021.
- Random walks on the random graph. The Annals of Probability, 46(1):456–490, 2018.
- S. Bernstein. Sur l’extension du théorème limite du calcul des probabilités aux sommes de quantités dépendantes. Mathematische Annalen, 97:1–59, 1927.
- R. C. Bradley. Basic properties of strong mixing conditions. a survey and some open questions. Probability Surveys, 2:107–144, 2005.
- Graphs of linear growth have bounded treewidth. arXiv preprint arXiv:2210.13720, 2022.
- S. Chib and E. Greenberg. Understanding the Metropolis-Hastings algorithm. The american statistician, 49(4):327–335, 1995.
- Chernoff-Hoeffding bounds for Markov chains: Generalized and simplified. In 29th International Symposium on Theoretical Aspects of Computer Science (STACS 2012). Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik, 2012.
- On the Lambert W function. Advances in Computational mathematics, 5:329–359, 1996.
- Y. A. Davydov. Convergence of distributions generated by stationary stochastic processes. Theory of Probability & Its Applications, 13(4):691–696, 1968.
- Y. A. Davydov. Mixing conditions for markov chains. Theory of Probability & Its Applications, 18(2):312–328, 1974.
- Markov chains. Springer, 2018.
- P. Doukhan. Mixing: properties and examples, volume 85. Springer Science & Business Media, 2012.
- E. Eberlein. Weak convergence of partial sums of absolutely regular sequences. Statistics & probability letters, 2(5):291–293, 1984.
- Speeding up random walk mixing by starting from a uniform vertex. arXiv e-prints, pages arXiv–2208, 2022.
- W. Hastings. Monte carlo sampling methods using markov chains and their applications. Biometrika, pages 97–109, 1970.
- W. Hoeffding. Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association, 58(301):13–30, 1963. doi: 10.1080/01621459.1963.10500830.
- Mixing time estimation in reversible Markov chains from a single sample path. Ann. Appl. Probab., 29(4):2439–2480, 08 2019. doi: 10.1214/18-AAP1457. URL https://doi.org/10.1214/18-AAP1457.
- I. A. Ibragimov. Some limit theorems for stationary processes. Theory of Probability & Its Applications, 7(4):349–382, 1962.
- Finite Markov chains: with a new appendix ”Generalization of a fundamental matrix”. Springer, 1983.
- A. Khaleghi and G. Lugosi. Inferring the mixing properties of an ergodic process. arXiv preprint arXiv:2106.07054, 2021.
- I. Kontoyiannis and S. P. Meyn. Geometric ergodicity and the spectral gap of non-reversible Markov chains. Probability Theory and Related Fields, 154(1-2):327–339, 2012.
- Markov chains and mixing times, second edition. American Mathematical Soc., 2009.
- L. Lovász and R. Kannan. Faster mixing via average conductance. In Proceedings of the thirty-first annual ACM symposium on Theory of computing, pages 282–287, 1999.
- Estimating beta-mixing coefficients via histograms. Electronic Journal of Statistics, 9(2):2855 – 2883, 2015. doi: 10.1214/15-EJS1094. URL https://doi.org/10.1214/15-EJS1094.
- Equation of state calculations by fast computing machines. The journal of chemical physics, 21(6):1087–1092, 1953.
- A. Y. Mitrophanov. Stability and exponential convergence of continuous-time markov chains. Journal of applied probability, 40(4):970–979, 2003.
- M. Mohri and A. Rostamizadeh. Stability bounds for stationary ϕitalic-ϕ\phiitalic_ϕ-mixing and β𝛽\betaitalic_β-mixing processes. Journal of Machine Learning Research, 11(26):789–814, 2010. URL http://jmlr.org/papers/v11/mohri10a.html.
- F. Münch and J. Salez. Mixing time and expansion of non-negatively curved markov chains. arXiv preprint arXiv:2206.08294, 2022.
- Split conformal prediction for dependent data. arXiv preprint arXiv:2203.15885, 2022.
- R. Ortner. Regret bounds for reinforcement learning via Markov chain concentration. Journal of Artificial Intelligence Research, 67:115–128, 2020.
- D. Paulin. Concentration inequalities for Markov chains by Marton couplings and spectral methods. Electron. J. Probab, 20(79):1–32, 2015.
- Y. Peres and P. Sousi. Mixing times are hitting times of large sets. Journal of Theoretical Probability, 28(2):488–519, 2015.
- E. Rio. Covariance inequalities for strongly mixing processes. In Annales de l’IHP Probabilités et statistiques, volume 29, pages 587–597, 1993.
- E. Rio. Théorie asymptotique des processus aléatoires faiblement dépendants, volume 31. Springer Science & Business Media, 1999.
- M. Rosenblatt. A central limit theorem and a strong mixing condition. Proceedings of the national Academy of Sciences, 42(1):43–47, 1956.
- A. Sinclair. Algorithms for random generation and counting: a Markov chain approach. Springer Science & Business Media, 2012.
- V. I. Trofimov. Graphs with polynomial growth. Mathematics of the USSR-Sbornik, 51(2):405, 1985.
- M. Vidyasagar. Learning and generalisation: with applications to neural networks. Springer Science & Business Media, 2013.
- V. Volkonskii and Y. A. Rozanov. Some limit theorems for random functions. i. Theory of Probability & Its Applications, 4(2):178–197, 1959.
- G. Wolfer. Empirical and instance-dependent estimation of Markov chain and mixing time. Scandinavian Journal of Statistics, 2023+. doi: 10.1111/sjos.12686.
- G. Wolfer and A. Kontorovich. Statistical estimation of ergodic Markov chain kernel over discrete state space. Bernoulli, 27(1):532–553, 02 2021. doi: 10.3150/20-BEJ1248. URL https://doi.org/10.3150/20-BEJ1248.
- G. Wolfer and A. Kontorovich. Improved estimation of relaxation time in nonreversible Markov chains. Ann. Appl. Probab., 34(1A):249–276, 2024. ISSN 1050-5164. doi: 10.1214/23-AAP1963.
- G. Wolfer and S. Watanabe. Geometric aspects of data-processing of Markov chains. arXiv:2203.04575, 2022.
- B. Yu. Density estimation in the l∞superscript𝑙l^{\infty}italic_l start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT norm for dependent data with applications to the gibbs sampler. The Annals of Statistics, pages 711–735, 1993.
- B. Yu. Rates of convergence for empirical processes of stationary mixing sequences. The Annals of Probability, pages 94–116, 1994.
- L. Zanetti and S. John. Personal communication, 2023.