Provably efficient reinforcement learning with function approximation
Determine whether reinforcement learning algorithms that incorporate function approximation can be designed to be provably efficient in both runtime and sample complexity, with efficiency depending on an intrinsic complexity measure of the function class rather than the number of states, thereby addressing the exploration–exploitation tradeoff in large or infinite state spaces.
Sponsor
References
As a result, a core RL question remains open: how can we design provably efficient RL algorithms that incorporate function approximation? This question persists even in a basic setting with linear dynamics and linear rewards, for which only linear function approximation is needed.
— Provably Efficient Reinforcement Learning with Linear Function Approximation
(1907.05388 - Jin et al., 2019) in Abstract