2000 character limit reached
Rates of Convergence in the Central Limit Theorem for Markov Chains, with an Application to TD Learning (2401.15719v3)
Published 28 Jan 2024 in math.PR, cs.LG, cs.SY, eess.SY, and math.OC
Abstract: We prove a non-asymptotic central limit theorem for vector-valued martingale differences using Stein's method, and use Poisson's equation to extend the result to functions of Markov Chains. We then show that these results can be applied to establish a non-asymptotic central limit theorem for Temporal Difference (TD) learning with averaging.