Scalability of prior unsupervised RL approaches in complex, high-dimensional environments
Determine whether pure exploration-based unsupervised reinforcement learning methods (such as Random Network Distillation, APT, and Plan2Explore) and mutual information-based unsupervised skill discovery methods (such as DIAYN, DADS, and CIC) scale to complex environments with high intrinsic dimensionality, establishing conditions or evidence for their scalability or lack thereof.
References
While these approaches have been shown to be effective in several unsupervised RL benchmarks, it is not entirely clear whether such methods can indeed be scalable to complex environments with high intrinsic dimensionality.
— METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
(2310.08887 - Park et al., 2023) in Section 1 (Introduction)