Learning with AMIGo: Adversarially Motivated Intrinsic Goals

06/22/2020
by   Andres Campero, et al.
22

A key challenge for reinforcement learning (RL) consists of learning in environments with sparse extrinsic rewards. In contrast to current RL methods, humans are able to learn new skills with little or no reward by using various forms of intrinsic motivation. We propose AMIGo, a novel agent incorporating a goal-generating teacher that proposes Adversarially Motivated Intrinsic Goals to train a goal-conditioned "student" policy in the absence of (or alongside) environment reward. Specifically, through a simple but effective "constructively adversarial" objective, the teacher learns to propose increasingly challenging—yet achievable—goals that allow the student to learn general skills for acting in a new environment, independent of the task to be solved. We show that our method generates a natural curriculum of self-proposed goals which ultimately allows the agent to solve challenging procedurally-generated tasks where other forms of intrinsic motivation and state-of-the-art RL methods fail.

READ FULL TEXT

page 2

page 6

page 8

page 14

page 17

research
10/28/2022

Goal Exploration Augmentation via Pre-trained Skills for Sparse-Reward Long-Horizon Goal-Conditioned Reinforcement Learning

Reinforcement learning (RL) often struggles to accomplish a sparse-rewar...
research
12/17/2020

Intrinsically Motivated Goal-Conditioned Reinforcement Learning: a Short Survey

Building autonomous machines that can explore open-ended environments, d...
research
03/04/2022

AutoDIME: Automatic Design of Interesting Multi-Agent Environments

Designing a distribution of environments in which RL agents can learn in...
research
12/07/2021

Information is Power: Intrinsic Control via Information Capture

Humans and animals explore their environment and acquire useful skills e...
research
06/23/2019

Neural networks with motivation

Motivational salience is a mechanism that determines an organism's curre...
research
04/20/2016

Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation

Learning goal-directed behavior in environments with sparse feedback is ...
research
03/29/2019

Learning Good Representation via Continuous Attention

In this paper we present our scientific discovery that good representati...

Please sign up or login with your details

Forgot password? Click here to reset