Tight sample complexities for RMDPs under general uncertainty sets
Establish tight sample complexity characterizations for learning distributionally robust Markov decision processes across broad families of uncertainty sets beyond total variation distance, including divergences such as chi-squared, Lp, and KL.
Sponsor
References
It remains an interesting open question to establish tight sample complexities of RMDPs over broad families of uncertainty sets.
— Statistical and Algorithmic Foundations of Reinforcement Learning
(2507.14444 - Chi et al., 19 Jul 2025) in Section 7 (Distributionally robust RL), Discussion: other uncertainty sets