Cluster-based Sampling in Hindsight Experience Replay for Robot Control

08/31/2022
by   Taeyoung Kim, et al.
2

In multi-goal reinforcement learning in an environment, agents learn policies to achieve multiple goals by using experiences gained from interactions with the environment. With a sparse binary reward, training agents is particularly challenging, due to a lack of successful experiences. To solve this problem, hindsight experience replay (HER) generates successful experiences from unsuccessful experiences. However, generating successful experiences without consideration of the property of achieved goals is less efficient. In this paper, a novel cluster-based sampling strategy exploiting the property of achieved goals is proposed. The proposed sampling strategy groups episodes with different achieved goals and samples experiences in the manner of HER. For the grouping, K-means clustering algorithm is used. The centroids of the clusters are obtained from the distribution of failed goals defined as the original goals not achieved. The proposed method is validated by experiments with three robotic control tasks of the OpenAI Gym. The results of experiments demonstrate that the proposed method significantly reduces the number of epochs required for convergence in two of the three tasks and marginally increases the success rates in the remaining one. It is also shown that the proposed method can be combined with other sampling strategies for HER.

READ FULL TEXT

page 1

page 5

research
07/01/2021

MHER: Model-based Hindsight Experience Replay

Solving multi-goal reinforcement learning (RL) problems with sparse rewa...
research
09/22/2021

Position Paper: Goals of the Luau Type System

Luau is the scripting language that powers user-generated experiences on...
research
10/28/2021

Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment

This paper proposes a method for prioritizing the replay experience refe...
research
05/14/2019

Bias-Reduced Hindsight Experience Replay with Virtual Goal Prioritization

Hindsight Experience Replay (HER) is a multi-goal reinforcement learning...
research
09/06/2018

ARCHER: Aggressive Rewards to Counter bias in Hindsight Experience Replay

Experience replay is an important technique for addressing sample-ineffi...
research
08/20/2018

Learning to Dialogue via Complex Hindsight Experience Replay

Reinforcement learning methods have been used for learning dialogue poli...
research
12/10/1998

Towards a computational theory of human daydreaming

This paper examines the phenomenon of daydreaming: spontaneously recalli...

Please sign up or login with your details

Forgot password? Click here to reset