Approximability of discounted ℓ-step transition look-ahead planning
Determine whether polynomial-time approximation schemes (PTAS) exist for the discounted ℓ-step transition look-ahead planning problem in finite tabular Markov Decision Processes, or alternatively, establish inapproximability results proving that even constant-factor approximation cannot be achieved for this setting.
References
On the approximation side, it remains open whether polynomial-time approximation schemes (PTAS) exist for discounted \ell–look-ahead planning, or conversely, whether even constant-factor approximation is impossible.
— On the hardness of RL with Lookahead
(2510.19372 - Pla et al., 22 Oct 2025) in Section 6: Conclusion and future work