Skew-Fit: State-Covering Self-Supervised Reinforcement Learning

03/08/2019
by   Vitchyr H. Pong, et al.
16

In standard reinforcement learning, each new skill requires a manually-designed reward function, which takes considerable manual effort and engineering. Self-supervised goal setting has the potential to automate this process, enabling an agent to propose its own goals and acquire skills that achieve these goals. However, such methods typically rely on manually-designed goal distributions, or heuristics to force the agent to explore a wide range of states. We propose a formal exploration objective for goal-reaching policies that maximizes state coverage. We show that this objective is equivalent to maximizing the entropy of the goal distribution together with goal reaching performance, where goals correspond to entire states. We present an algorithm called Skew-Fit for learning such a maximum-entropy goal distribution, and show that under certain regularity conditions, our method converges to a uniform distribution over the set of possible states, even when we do not know this set beforehand. Skew-Fit enables self-supervised agents to autonomously choose and practice diverse goals. Our experiments show that it can learn a variety of manipulation tasks from images, including opening a door with a real robot, entirely from scratch and without any manually-designed reward function.

READ FULL TEXT

page 2

page 4

page 5

page 6

page 10

page 11

page 13

page 14

research
07/12/2018

Visual Reinforcement Learning with Imagined Goals

For an autonomous agent to fulfill a wide range of user-specified goals ...
research
06/17/2019

LPaintB: Learning to Paint from Self-SupervisionLPaintB: Learning to Paint from Self-Supervision

We present a novel reinforcement learning-based natural media painting a...
research
05/21/2020

LEAF: Latent Exploration Along the Frontier

Self-supervised goal proposal and reaching is a key component for explor...
research
05/21/2020

Dynamics-Aware Latent Space Reachability for Exploration in Temporally-Extended Tasks

Self-supervised goal proposal and reaching is a key component of efficie...
research
06/02/2021

Variational Empowerment as Representation Learning for Goal-Based Reinforcement Learning

Learning to reach goal states and learning diverse skills through mutual...
research
10/23/2019

Contextual Imagined Goals for Self-Supervised Robotic Learning

While reinforcement learning provides an appealing formalism for learnin...
research
09/30/2018

Few-Shot Goal Inference for Visuomotor Learning and Planning

Reinforcement learning and planning methods require an objective or rewa...

Please sign up or login with your details

Forgot password? Click here to reset