Cause of instability in individual goal performance on Frozen Lake
Determine whether the observed instability in individual goal success rates during training with autonomous goal conditioning in the Frozen Lake environment is caused by policy forgetting, by the stochastic transition dynamics of Frozen Lake, or by other factors.
References
If this is due to forgetting, the stochastic nature of the environment, or something else, is yet to be determined.
— Environment Agnostic Goal-Conditioning, A Study of Reward-Free Autonomous Learning
(2511.04598 - Åström et al., 6 Nov 2025) in Section "Goal success rates" (Frozen Lake analysis)