Dice Question Streamline Icon: https://streamlinehq.com

General continuity and stability theory for eUDRL beyond special cases

Determine a general theory establishing continuity and stability properties of episodic Upside-Down Reinforcement Learning (eUDRL) under perturbations of the transition kernel, beyond the special conditions considered in the paper and without relying on regularization.

Information Square Streamline Icon: https://streamlinehq.com

Background

The article develops continuity and stability results for eUDRL under specific conditions and demonstrates full generality for a regularized eUDRL recursion. However, the unregularized algorithm’s global continuity and stability behavior remains unresolved.

This open problem asks for lifting the current special-case results to a general setting, providing continuity and stability guarantees for eUDRL under broad assumptions.

References

While the general discussion of continuity and stability of eUDRL remains an open problem, we have found that the theory developed in this article is sufficient to address the regularized eUDRL recursion.