Emergent Tool Use From Multi-Agent Autocurricula

09/17/2019
by   Bowen Baker, et al.
2

Through multi-agent competition, the simple objective of hide-and-seek, and standard reinforcement learning algorithms at scale, we find that agents create a self-supervised autocurriculum inducing multiple distinct rounds of emergent strategy, many of which require sophisticated tool use and coordination. We find clear evidence of six emergent phases in agent strategy in our environment, each of which creates a new pressure for the opposing team to adapt; for instance, agents learn to build multi-object shelters using moveable boxes which in turn leads to agents discovering that they can overcome obstacles using ramps. We further provide evidence that multi-agent competition may scale better with increasing environment complexity and leads to behavior that centers around far more human-relevant skills than other self-supervised reinforcement learning methods such as intrinsic motivation. Finally, we propose transfer and fine-tuning as a way to quantitatively evaluate targeted capabilities, and we compare hide-and-seek agents to both intrinsic motivation and random initialization baselines in a suite of domain-specific intelligence tests.

READ FULL TEXT

page 11

page 12

page 13

page 14

page 23

page 24

page 26

page 27

research
01/19/2023

Multi-Agent Interplay in a Competitive Survival Environment

Solving hard-exploration environments in an important challenge in Reinf...
research
10/10/2017

Emergent Complexity via Multi-Agent Competition

Reinforcement learning algorithms can train agents that solve problems i...
research
08/06/2021

Semantic Tracklets: An Object-Centric Representation for Visual Multi-Agent Reinforcement Learning

Solving complex real-world tasks, e.g., autonomous fleet control, often ...
research
02/19/2019

Emergent Coordination Through Competition

We study the emergence of cooperative behaviors in reinforcement learnin...
research
07/05/2022

Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning

Successful deployment of multi-agent reinforcement learning often requir...
research
08/28/2020

Investigating Taxi and Uber competition in New York City: Multi-agent modeling by reinforcement-learning

The taxi business has been overly regulated for many decades. Regulation...
research
05/15/2018

Complexity Reduction in the Negotiation of New Lexical Conventions

In the process of collectively inventing new words for new con- cepts in...

Please sign up or login with your details

Forgot password? Click here to reset