Understanding the Complexity Gains of Single-Task RL with a Curriculum

12/24/2022
by   Qiyang Li, et al.
0

Reinforcement learning (RL) problems can be challenging without well-shaped rewards. Prior work on provably efficient RL methods generally proposes to address this issue with dedicated exploration strategies. However, another way to tackle this challenge is to reformulate it as a multi-task RL problem, where the task space contains not only the challenging task of interest but also easier tasks that implicitly function as a curriculum. Such a reformulation opens up the possibility of running existing multi-task RL methods as a more efficient alternative to solving a single challenging task from scratch. In this work, we provide a theoretical framework that reformulates a single-task RL problem as a multi-task RL problem defined by a curriculum. Under mild regularity conditions on the curriculum, we show that sequentially solving each task in the multi-task RL problem is more computationally efficient than solving the original single-task problem, without any explicit exploration bonuses or other exploration strategies. We also show that our theoretical insights can be translated into an effective practical learning algorithm that can accelerate curriculum learning on simulated robotic tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/27/2023

Outcome-directed Reinforcement Learning by Uncertainty Temporal Distance-Aware Curriculum Goal Generation

Current reinforcement learning (RL) often suffers when solving a challen...
research
11/07/2022

Curriculum-based Asymmetric Multi-task Reinforcement Learning

We introduce CAMRL, the first curriculum-based asymmetric multi-task lea...
research
04/25/2023

Proximal Curriculum for Reinforcement Learning Agents

We consider the problem of curriculum design for reinforcement learning ...
research
02/25/2021

A Probabilistic Interpretation of Self-Paced Learning with Applications to Reinforcement Learning

Across machine learning, the use of curricula has shown strong empirical...
research
04/27/2020

Maximum Entropy Multi-Task Inverse RL

Multi-task IRL allows for the possibility that the expert could be switc...
research
06/06/2022

Effects of Reward Shaping on Curriculum Learning in Goal Conditioned Tasks

Real-time control for robotics is a popular research area in the reinfor...
research
11/12/2020

Evaluating Curriculum Learning Strategies in Neural Combinatorial Optimization

Neural combinatorial optimization (NCO) aims at designing problem-indepe...

Please sign up or login with your details

Forgot password? Click here to reset