Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition (2303.13512v1)

Published 23 Mar 2023 in cs.AI

Abstract: To facilitate research in the direction of fine-tuning foundation models from human feedback, we held the MineRL BASALT Competition on Fine-Tuning from Human Feedback at NeurIPS 2022. The BASALT challenge asks teams to compete to develop algorithms to solve tasks with hard-to-specify reward functions in Minecraft. Through this competition, we aimed to promote the development of algorithms that use human feedback as channels to learn the desired behavior. We describe the competition and provide an overview of the top solutions. We conclude by discussing the impact of the competition and future directions for improvement.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (30)
  1. Stephanie Milani (23 papers)
  2. Anssi Kanervisto (32 papers)
  3. Karolis Ramanauskas (6 papers)
  4. Sander Schulhoff (6 papers)
  5. Brandon Houghton (13 papers)
  6. Sharada Mohanty (13 papers)
  7. Byron Galbraith (1 paper)
  8. Ke Chen (241 papers)
  9. Yan Song (91 papers)
  10. Tianze Zhou (5 papers)
  11. Bingquan Yu (1 paper)
  12. He Liu (57 papers)
  13. Kai Guan (3 papers)
  14. Yujing Hu (28 papers)
  15. Tangjie Lv (35 papers)
  16. Federico Malato (7 papers)
  17. Florian Leopold (4 papers)
  18. Amogh Raut (4 papers)
  19. Andrew Melnik (33 papers)
  20. Shu Ishida (9 papers)
Citations (15)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com