Adaptive $Q$-Aid for Conditional Supervised Learning in Offline Reinforcement Learning (2402.02017v2)

Published 3 Feb 2024 in cs.LG

Abstract: Offline reinforcement learning (RL) has progressed with return-conditioned supervised learning (RCSL), but its lack of stitching ability remains a limitation. We introduce $Q$-Aided Conditional Supervised Learning (QCS), which effectively combines the stability of RCSL with the stitching capability of $Q$-functions. By analyzing $Q$-function over-generalization, which impairs stable stitching, QCS adaptively integrates $Q$-aid into RCSL's loss function based on trajectory return. Empirical results show that QCS significantly outperforms RCSL and value-based methods, consistently achieving or exceeding the maximum trajectory returns across diverse offline RL benchmarks.

References (40)

Authors (4)

Jeonghye Kim (5 papers)
Suyoung Lee (13 papers)
Woojun Kim (20 papers)
Youngchul Sung (48 papers)

Citations (1)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Adaptive $Q$-Aid for Conditional Supervised Learning in Offline Reinforcement Learning (2402.02017v2)

Summary

Related Papers