Self-Paced Contextual Reinforcement Learning

10/07/2019
by   Pascal Klink, et al.
0

Generalization and adaptation of learned skills to novel situations is a core requirement for intelligent autonomous robots. Although contextual reinforcement learning provides a principled framework for learning and generalization of behaviors across related tasks, it generally relies on uninformed sampling of environments from an unknown, uncontrolled context distribution, thus missing the benefits of structured, sequential learning. We introduce a novel relative entropy reinforcement learning algorithm that gives the agent the freedom to control the intermediate task distribution, allowing for its gradual progression towards the target context distribution. Empirical evaluation shows that the proposed curriculum learning scheme drastically improves sample efficiency and enables learning in scenarios with both broad and sharp target context distributions in which classical approaches perform sub-optimally.

READ FULL TEXT

page 7

page 8

research
08/02/2020

Curriculum Learning with a Progression Function

Curriculum Learning for Reinforcement Learning is an increasingly popula...
research
04/29/2023

Meta-Reinforcement Learning Based on Self-Supervised Task Representation Learning

Meta-reinforcement learning enables artificial agents to learn from rela...
research
02/09/2022

Contextualize Me – The Case for Context in Reinforcement Learning

While Reinforcement Learning (RL) has made great strides towards solving...
research
10/19/2022

CLUTR: Curriculum Learning via Unsupervised Task Representation Learning

Reinforcement Learning (RL) algorithms are often known for sample ineffi...
research
12/29/2022

Backward Curriculum Reinforcement Learning

The current reinforcement learning algorithm uses forward-generated traj...
research
10/18/2022

Curriculum Reinforcement Learning using Optimal Transport via Gradual Domain Adaptation

Curriculum Reinforcement Learning (CRL) aims to create a sequence of tas...
research
08/18/2011

Feature Reinforcement Learning In Practice

Following a recent surge in using history-based methods for resolving pe...

Please sign up or login with your details

Forgot password? Click here to reset