Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

BYOL-Explore: Exploration by Bootstrapped Prediction (2206.08332v1)

Published 16 Jun 2022 in cs.LG, cs.AI, and stat.ML

Abstract: We present BYOL-Explore, a conceptually simple yet general approach for curiosity-driven exploration in visually-complex environments. BYOL-Explore learns a world representation, the world dynamics, and an exploration policy all-together by optimizing a single prediction loss in the latent space with no additional auxiliary objective. We show that BYOL-Explore is effective in DM-HARD-8, a challenging partially-observable continuous-action hard-exploration benchmark with visually-rich 3-D environments. On this benchmark, we solve the majority of the tasks purely through augmenting the extrinsic reward with BYOL-Explore s intrinsic reward, whereas prior work could only get off the ground with human demonstrations. As further evidence of the generality of BYOL-Explore, we show that it achieves superhuman performance on the ten hardest exploration games in Atari while having a much simpler design than other competitive agents.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (14)
  1. Zhaohan Daniel Guo (15 papers)
  2. Shantanu Thakoor (15 papers)
  3. Miruna Pîslar (10 papers)
  4. Bernardo Avila Pires (21 papers)
  5. Florent Altché (18 papers)
  6. Corentin Tallec (16 papers)
  7. Alaa Saade (19 papers)
  8. Daniele Calandriello (34 papers)
  9. Jean-Bastien Grill (13 papers)
  10. Yunhao Tang (63 papers)
  11. Michal Valko (91 papers)
  12. Rémi Munos (121 papers)
  13. Mohammad Gheshlaghi Azar (31 papers)
  14. Bilal Piot (40 papers)
Citations (60)

Summary

We haven't generated a summary for this paper yet.