Extending Transitive RL to general reward-based reinforcement learning
Establish whether Transitive RL or other divide-and-conquer-style value learning algorithms can be generalized from goal-conditioned reinforcement learning to general reward-based reinforcement learning tasks.
References
Another open question is whether TRL (or any divide-and-conquer-style algorithm) can be extended to general reward-based RL tasks, beyond goal-conditioned RL.
— Transitive RL: Value Learning via Divide and Conquer
(2510.22512 - Park et al., 26 Oct 2025) in Section 6: What's Next?