Curriculum goal masking for continuous deep reinforcement learning

09/17/2018
by   Manfred Eppe, et al.
0

Deep reinforcement learning has recently gained a focus on problems where policy or value functions are independent of goals. Evidence exists that the sampling of goals has a strong effect on the learning performance, but there is a lack of general mechanisms that focus on optimizing the goal sampling process. In this work, we present a simple and general goal masking method that also allows us to estimate a goal's difficulty level and thus realize a curriculum learning approach for deep RL. Our results indicate that focusing on goals with a medium difficulty level is appropriate for deep deterministic policy gradient (DDPG) methods, while an "aim for the stars and reach the moon-strategy", where hard goals are sampled much more often than simple goals, leads to the best learning performance in cases where DDPG is combined with for hindsight experience replay (HER). We demonstrate that the approach significantly outperforms standard goal sampling for different robotic object manipulation problems.

READ FULL TEXT

page 2

page 4

page 5

page 6

research
06/14/2022

Stein Variational Goal Generation For Reinforcement Learning in Hard Exploration Problems

Multi-goal Reinforcement Learning has recently attracted a large amount ...
research
06/17/2020

Automatic Curriculum Learning through Value Disagreement

Continually solving new, unsolved tasks is the key to learning diverse b...
research
12/02/2021

SparRL: Graph Sparsification via Deep Reinforcement Learning

Graph sparsification concerns data reduction where an edge-reduced graph...
research
06/25/2018

Accuracy-based Curriculum Learning in Deep Reinforcement Learning

In this paper, we investigate a new form of automated curriculum learnin...
research
08/05/2020

Follow the Object: Curriculum Learning for Manipulation Tasks with Imagined Goals

Learning robot manipulation through deep reinforcement learning in envir...
research
03/09/2023

GOATS: Goal Sampling Adaptation for Scooping with Curriculum Reinforcement Learning

In this work, we first formulate the problem of goal-conditioned robotic...

Please sign up or login with your details

Forgot password? Click here to reset