Contrastive Variational Reinforcement Learning for Complex Observations (2008.02430v2)

Published 6 Aug 2020 in cs.LG and stat.ML

Abstract: Deep reinforcement learning (DRL) has achieved significant success in various robot tasks: manipulation, navigation, etc. However, complex visual observations in natural environments remains a major challenge. This paper presents Contrastive Variational Reinforcement Learning (CVRL), a model-based method that tackles complex visual observations in DRL. CVRL learns a contrastive variational model by maximizing the mutual information between latent states and observations discriminatively, through contrastive learning. It avoids modeling the complex observation space unnecessarily, as the commonly used generative observation model often does, and is significantly more robust. CVRL achieves comparable performance with state-of-the-art model-based DRL methods on standard Mujoco tasks. It significantly outperforms them on Natural Mujoco tasks and a robot box-pushing task with complex observations, e.g., dynamic shadows. The CVRL code is available publicly at https://github.com/Yusufma03/CVRL.

Authors (4)

Xiao Ma (169 papers)
Siwei Chen (20 papers)
David Hsu (73 papers)
Wee Sun Lee (60 papers)

Citations (23)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

GitHub

GitHub - Yusufma03/CVRL: code for CoRL 2020 paper "Contrastive Variational Model-Based Reinforcement Learning for Complex Observations" (24 stars)

Contrastive Variational Reinforcement Learning for Complex Observations (2008.02430v2)

Summary

Related Papers

GitHub