Retrospective on the 2021 BASALT Competition on Learning from Human Feedback

04/14/2022
by   Rohin Shah, et al.
0

We held the first-ever MineRL Benchmark for Agents that Solve Almost-Lifelike Tasks (MineRL BASALT) Competition at the Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS 2021). The goal of the competition was to promote research towards agents that use learning from human feedback (LfHF) techniques to solve open-world tasks. Rather than mandating the use of LfHF techniques, we described four tasks in natural language to be accomplished in the video game Minecraft, and allowed participants to use any approach they wanted to build agents that could accomplish the tasks. Teams developed a diverse range of LfHF algorithms across a variety of possible human feedback types. The three winning teams implemented significantly different approaches while achieving similar performance. Interestingly, their approaches performed well on different tasks, validating our choice of tasks to include in the competition. While the outcomes validated the design of our competition, we did not get as many participants and submissions as our sister competition, MineRL Diamond. We speculate about the causes of this problem and suggest improvements for future iterations of the competition.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/23/2023

Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition

To facilitate research in the direction of fine-tuning foundation models...
research
05/16/2023

ICDAR 2023 Competition on Hierarchical Text Detection and Recognition

We organize a competition on hierarchical text detection and recognition...
research
10/24/2014

Detecting Figures and Part Labels in Patents: Competition-Based Development of Image Processing Algorithms

We report the findings of a month-long online competition in which parti...
research
08/09/2023

AI4GCC – Track 3: Consumption and the Challenges of Multi-Agent RL

The AI4GCC competition presents a bold step forward in the direction of ...
research
07/05/2021

The MineRL BASALT Competition on Learning from Human Feedback

The last decade has seen a significant increase of interest in deep lear...
research
08/15/2023

The 10 Million ANA Avatar XPRIZE Competition Advanced Immersive Telepresence Systems

The 10M ANA Avatar XPRIZE aimed to create avatar systems that can transp...
research
08/06/2022

Improving Aircraft Localization: Experiences and Lessons Learned from an Open Competition

Knowledge about the exact positioning of aircraft is crucial in many set...

Please sign up or login with your details

Forgot password? Click here to reset