Reinforcement Learning with Success Induced Task Prioritization

12/30/2022
by   Maria Nesterova, et al.
0

Many challenging reinforcement learning (RL) problems require designing a distribution of tasks that can be applied to train effective policies. This distribution of tasks can be specified by the curriculum. A curriculum is meant to improve the results of learning and accelerate it. We introduce Success Induced Task Prioritization (SITP), a framework for automatic curriculum learning, where a task sequence is created based on the success rate of each task. In this setting, each task is an algorithmically created environment instance with a unique configuration. The algorithm selects the order of tasks that provide the fastest learning for agents. The probability of selecting any of the tasks for the next stage of learning is determined by evaluating its performance score in previous stages. Experiments were carried out in the Partially Observable Grid Environment for Multiple Agents (POGEMA) and Procgen benchmark. We demonstrate that SITP matches or surpasses the results of other curriculum design methods. Our method can be implemented with handful of minor modifications to any standard RL framework and provides useful prioritization with minimal computational overhead.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/31/2022

Teacher-student curriculum learning for reinforcement learning

Reinforcement learning (rl) is a popular paradigm for sequential decisio...
research
04/25/2023

Proximal Curriculum for Reinforcement Learning Agents

We consider the problem of curriculum design for reinforcement learning ...
research
07/11/2022

Grounding Aleatoric Uncertainty in Unsupervised Environment Design

Adaptive curricula in reinforcement learning (RL) have proven effective ...
research
06/05/2018

Mix&Match - Agent Curricula for Reinforcement Learning

We introduce Mix&Match (M&M) - a training framework designed to facilita...
research
10/19/2022

CLUTR: Curriculum Learning via Unsupervised Task Representation Learning

Reinforcement Learning (RL) algorithms are often known for sample ineffi...
research
02/25/2021

A Probabilistic Interpretation of Self-Paced Learning with Applications to Reinforcement Learning

Across machine learning, the use of curricula has shown strong empirical...
research
02/17/2021

Automated Curriculum Learning for Embodied Agents: A Neuroevolutionary Approach

We demonstrate how an evolutionary algorithm can be extended with a curr...

Please sign up or login with your details

Forgot password? Click here to reset