BYOL-Explore: Exploration by Bootstrapped Prediction (2206.08332v1)

Published 16 Jun 2022 in cs.LG, cs.AI, and stat.ML

Abstract: We present BYOL-Explore, a conceptually simple yet general approach for curiosity-driven exploration in visually-complex environments. BYOL-Explore learns a world representation, the world dynamics, and an exploration policy all-together by optimizing a single prediction loss in the latent space with no additional auxiliary objective. We show that BYOL-Explore is effective in DM-HARD-8, a challenging partially-observable continuous-action hard-exploration benchmark with visually-rich 3-D environments. On this benchmark, we solve the majority of the tasks purely through augmenting the extrinsic reward with BYOL-Explore s intrinsic reward, whereas prior work could only get off the ground with human demonstrations. As further evidence of the generality of BYOL-Explore, we show that it achieves superhuman performance on the ten hardest exploration games in Atari while having a much simpler design than other competitive agents.

Authors (14)

Zhaohan Daniel Guo (15 papers)
Shantanu Thakoor (15 papers)
Miruna Pîslar (10 papers)
Bernardo Avila Pires (21 papers)
Florent Altché (18 papers)
Corentin Tallec (16 papers)
Alaa Saade (19 papers)
Daniele Calandriello (34 papers)
Jean-Bastien Grill (13 papers)
Yunhao Tang (63 papers)
Michal Valko (91 papers)
Rémi Munos (121 papers)
Mohammad Gheshlaghi Azar (31 papers)
Bilal Piot (40 papers)

Citations (60)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

BYOL-Explore: Exploration by Bootstrapped Prediction (2206.08332v1)

Summary

Related Papers