Deep Latent Competition: Learning to Race Using Visual Control Policies in Latent Space

02/19/2021
by   Wilko Schwarting, et al.
18

Learning competitive behaviors in multi-agent settings such as racing requires long-term reasoning about potential adversarial interactions. This paper presents Deep Latent Competition (DLC), a novel reinforcement learning algorithm that learns competitive visual control policies through self-play in imagination. The DLC agent imagines multi-agent interaction sequences in the compact latent space of a learned world model that combines a joint transition function with opponent viewpoint prediction. Imagined self-play reduces costly sample generation in the real world, while the latent representation enables planning to scale gracefully with observation dimensionality. We demonstrate the effectiveness of our algorithm in learning competitive behaviors on a novel multi-agent racing benchmark that requires planning from image observations. Code and videos available at https://sites.google.com/view/deep-latent-competition.

READ FULL TEXT

page 2

page 3

page 4

page 5

page 6

page 9

page 10

page 13

research
06/04/2019

Options as responses: Grounding behavioural hierarchies in multi-agent RL

We propose a novel hierarchical agent architecture for multi-agent reinf...
research
10/10/2017

Emergent Complexity via Multi-Agent Competition

Reinforcement learning algorithms can train agents that solve problems i...
research
10/05/2021

Influencing Towards Stable Multi-Agent Interactions

Learning in multi-agent environments is difficult due to the non-station...
research
04/21/2021

Contingencies from Observations: Tractable Contingency Planning with Learned Behavior Models

Humans have a remarkable ability to make decisions by accurately reasoni...
research
12/03/2019

Dream to Control: Learning Behaviors by Latent Imagination

Learned world models summarize an agent's experience to facilitate learn...
research
12/14/2022

Hierarchical Strategies for Cooperative Multi-Agent Reinforcement Learning

Adequate strategizing of agents behaviors is essential to solving cooper...
research
11/25/2020

TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning

Competitive Self-Play (CSP) based Multi-Agent Reinforcement Learning (MA...

Please sign up or login with your details

Forgot password? Click here to reset