RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences (2402.17257v4)

Published 27 Feb 2024 in cs.LG, cs.AI, and cs.RO

Abstract: Preference-based Reinforcement Learning (PbRL) circumvents the need for reward engineering by harnessing human preferences as the reward signal. However, current PbRL methods excessively depend on high-quality feedback from domain experts, which results in a lack of robustness. In this paper, we present RIME, a robust PbRL algorithm for effective reward learning from noisy preferences. Our method utilizes a sample selection-based discriminator to dynamically filter out noise and ensure robust training. To counteract the cumulative error stemming from incorrect selection, we suggest a warm start for the reward model, which additionally bridges the performance gap during the transition from pre-training to online training in PbRL. Our experiments on robotic manipulation and locomotion tasks demonstrate that RIME significantly enhances the robustness of the state-of-the-art PbRL method. Code is available at https://github.com/CJReinforce/RIME_ICML2024.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (42)

Authors (6)

Jie Cheng (80 papers)
Gang Xiong (37 papers)
Xingyuan Dai (14 papers)
Qinghai Miao (5 papers)
Yisheng Lv (26 papers)
Fei-Yue Wang (72 papers)

Citations (4)

View on Semantic Scholar

RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences (2402.17257v4)

Related Papers