Get Back Here: Robust Imitation by Return-to-Distribution Planning

05/02/2023
by   Geoffrey Cideron, et al.
0

We consider the Imitation Learning (IL) setup where expert data are not collected on the actual deployment environment but on a different version. To address the resulting distribution shift, we combine behavior cloning (BC) with a planner that is tasked to bring the agent back to states visited by the expert whenever the agent deviates from the demonstration distribution. The resulting algorithm, POIR, can be trained offline, and leverages online interactions to efficiently fine-tune its planner to improve performance over time. We test POIR on a variety of human-generated manipulation demonstrations in a realistic robotic manipulation simulator and show robustness of the learned policy to different initial state distributions and noisy dynamics.

READ FULL TEXT

page 5

page 21

page 22

research
11/18/2020

SAFARI: Safe and Active Robot Imitation Learning with Imagination

One of the main issues in Imitation Learning is the erroneous behavior o...
research
02/28/2022

LobsDICE: Offline Imitation Learning from Observation via Stationary Distribution Correction Estimation

We consider the problem of imitation from observation (IfO), in which th...
research
07/18/2022

Back to the Manifold: Recovering from Out-of-Distribution States

Learning from previously collected datasets of expert data offers the pr...
research
05/26/2021

What data do we need for training an AV motion planner?

We investigate what grade of sensor data is required for training an imi...
research
09/03/2021

Iterative Imitation Policy Improvement for Interactive Autonomous Driving

We propose an imitation learning system for autonomous driving in urban ...
research
06/04/2023

Data Quality in Imitation Learning

In supervised learning, the question of data quality and curation has be...
research
11/13/2020

Grasping with Chopsticks: Combating Covariate Shift in Model-free Imitation Learning for Fine Manipulation

Billions of people use chopsticks, a simple yet versatile tool, for fine...

Please sign up or login with your details

Forgot password? Click here to reset