DeepAI AI Chat
Log In Sign Up

Divide Conquer Imitation Learning

04/15/2022
by   Alexandre Chenu, et al.
UPMC
17

When cast into the Deep Reinforcement Learning framework, many robotics tasks require solving a long horizon and sparse reward problem, where learning algorithms struggle. In such context, Imitation Learning (IL) can be a powerful approach to bootstrap the learning process. However, most IL methods require several expert demonstrations which can be prohibitively difficult to acquire. Only a handful of IL algorithms have shown efficiency in the context of an extreme low expert data regime where a single expert demonstration is available. In this paper, we present a novel algorithm designed to imitate complex robotic tasks from the states of an expert trajectory. Based on a sequential inductive bias, our method divides the complex task into smaller skills. The skills are learned into a goal-conditioned policy that is able to solve each skill individually and chain skills to solve the entire task. We show that our method imitates a non-holonomic navigation task and scales to a complex simulated robotic manipulation task with very high sample efficiency.

READ FULL TEXT

page 1

page 6

02/28/2022

LISA: Learning Interpretable Skill Abstractions from Language

Learning policies that effectually utilize language instructions in comp...
11/09/2022

Leveraging Sequentiality in Reinforcement Learning from a Single Demonstration

Deep Reinforcement Learning has been successfully applied to learn robot...
03/25/2021

Self-Imitation Learning by Planning

Imitation learning (IL) enables robots to acquire skills quickly by tran...
10/25/2019

Relay Policy Learning: Solving Long-Horizon Tasks via Imitation and Reinforcement Learning

We present relay policy learning, a method for imitation and reinforceme...
02/25/2020

Scalable Multi-Task Imitation Learning with Autonomous Improvement

While robot learning has demonstrated promising results for enabling rob...
10/16/2022

Towards an Interpretable Hierarchical Agent Framework using Semantic Goals

Learning to solve long horizon temporally extended tasks with reinforcem...
11/09/2021

AW-Opt: Learning Robotic Skills with Imitation and Reinforcement at Scale

Robotic skills can be learned via imitation learning (IL) using user-pro...