Adaptable Agent Populations via a Generative Model of Policies

07/15/2021
by   Kenneth Derek, et al.
0

In the natural world, life has found innumerable ways to survive and often thrive. Between and even within species, each individual is in some manner unique, and this diversity lends adaptability and robustness to life. In this work, we aim to learn a space of diverse and high-reward policies on any given environment. To this end, we introduce a generative model of policies, which maps a low-dimensional latent space to an agent policy space. Our method enables learning an entire population of agent policies, without requiring the use of separate policy parameters. Just as real world populations can adapt and evolve via natural selection, our method is able to adapt to changes in our environment solely by selecting for policies in latent space. We test our generative model's capabilities in a variety of environments, including an open-ended grid-world and a two-player soccer environment. Code, visualizations, and additional experiments can be found at https://kennyderek.github.io/adap/.

READ FULL TEXT
research
09/10/2018

VPE: Variational Policy Embedding for Transfer Reinforcement Learning

Reinforcement Learning methods are capable of solving complex problems, ...
research
05/16/2023

Dynamics of niche construction in adaptable populations evolving in diverse environments

In both natural and artificial studies, evolution is often seen as synon...
research
05/30/2023

Generating Behaviorally Diverse Policies with Latent Diffusion Models

Recent progress in Quality Diversity Reinforcement Learning (QD-RL) has ...
research
11/07/2018

Generative Adversarial Policy Networks for Behavioural Repertoire

Learning algorithms are enabling robots to solve increasingly challengin...
research
08/27/2018

BézierGAN: Automatic Generation of Smooth Curves from Interpretable Low-Dimensional Parameters

Many real-world objects are designed by smooth curves, especially in the...
research
09/04/2018

Recurrent World Models Facilitate Policy Evolution

A generative recurrent neural network is quickly trained in an unsupervi...
research
02/08/2013

Complexity distribution of agent policies

We analyse the complexity of environments according to the policies that...

Please sign up or login with your details

Forgot password? Click here to reset