Paired Open-Ended Trailblazer (POET): Endlessly Generating Increasingly Complex and Diverse Learning Environments and Their Solutions

01/07/2019
by   Rui Wang, et al.
8

While the history of machine learning so far encompasses a series of problems posed by researchers and algorithms that learn their solutions, an important question is whether the problems themselves can be generated by the algorithm at the same time as they are being solved. Such a process would in effect build its own diverse and expanding curricula, and the solutions to problems at various stages would become stepping stones towards solving even more challenging problems later in the process. The Paired Open-Ended Trailblazer (POET) algorithm introduced in this paper does just that: it pairs the generation of environmental challenges and the optimization of agents to solve those challenges. It simultaneously explores many different paths through the space of possible problems and solutions and, critically, allows these stepping-stone solutions to transfer between problems if better, catalyzing innovation. The term open-ended signifies the intriguing potential for algorithms like POET to continue to create novel and increasingly complex capabilities without bound. The results show that POET produces a diverse range of sophisticated behaviors that solve a wide range of environmental challenges, many of which cannot be solved by direct optimization alone, or even through a direct, single-path curriculum-based control algorithm introduced to highlight the critical role of open-endedness in solving ambitious challenges. The ability to transfer solutions from one environment to another proves essential to unlocking the full potential of the system as a whole, demonstrating the unpredictable nature of fortuitous stepping stones. We hope that POET will inspire a new push towards open-ended discovery across many domains, where algorithms like POET can blaze a trail through their interesting possible manifestations and solutions.

READ FULL TEXT

page 11

page 17

page 18

research
03/19/2020

Enhanced POET: Open-Ended Reinforcement Learning through Unbounded Invention of Learning Challenges and their Solutions

Creating open-ended algorithms, which generate their own never-ending st...
research
10/31/2014

A Comparison of learning algorithms on the Arcade Learning Environment

Reinforcement learning agents have traditionally been evaluated on small...
research
02/12/2023

MarioGPT: Open-Ended Text2Level Generation through Large Language Models

Procedural Content Generation (PCG) algorithms provide a technique to ge...
research
12/30/2019

Boldly Going Where No Prover Has Gone Before

I argue that the most interesting goal facing researchers in automated r...
research
05/31/2023

Space Net Optimization

Most metaheuristic algorithms rely on a few searched solutions to guide ...
research
02/17/2021

Automated Curriculum Learning for Embodied Agents: A Neuroevolutionary Approach

We demonstrate how an evolutionary algorithm can be extended with a curr...
research
05/17/2020

Tackling the DMN Challenges with cDMN: a Tight Integration of DMN and constraint reasoning

This paper describes an extension to the DMN standard, called cDMN. It a...

Please sign up or login with your details

Forgot password? Click here to reset