Automated curriculum generation for Policy Gradients from Demonstrations

12/01/2019
by   Anirudh Srinivasan, et al.
17

In this paper, we present a technique that improves the process of training an agent (using RL) for instruction following. We develop a training curriculum that uses a nominal number of expert demonstrations and trains the agent in a manner that draws parallels from one of the ways in which humans learn to perform complex tasks, i.e by starting from the goal and working backwards. We test our method on the BabyAI platform and show an improvement in sample efficiency for some of its tasks compared to a PPO (proximal policy optimization) baseline.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/16/2021

Automatic Curricula via Expert Demonstrations

We propose Automatic Curricula via Expert Demonstrations (ACED), a reinf...
research
10/20/2022

Task Phasing: Automated Curriculum Learning from Demonstrations

Applying reinforcement learning (RL) to sparse reward domains is notorio...
research
05/11/2022

Learning to Guide Multiple Heterogeneous Actors from a Single Human Demonstration via Automatic Curriculum Learning in StarCraft II

Traditionally, learning from human demonstrations via direct behavior cl...
research
02/23/2021

School of hard knocks: Curriculum analysis for Pommerman with a fixed computational budget

Pommerman is a hybrid cooperative/adversarial multi-agent environment, w...
research
08/31/2023

Curriculum Proximal Policy Optimization with Stage-Decaying Clipping for Self-Driving at Unsignalized Intersections

Unsignalized intersections are typically considered as one of the most r...
research
07/18/2018

Backplay: "Man muss immer umkehren"

A long-standing problem in model free reinforcement learning (RL) is tha...
research
10/21/2019

Self-Educated Language Agent With Hindsight Experience Replay For Instruction Following

Language creates a compact representation of the world and allows the de...

Please sign up or login with your details

Forgot password? Click here to reset