Deep Imitative Models for Flexible Inference, Planning, and Control

10/15/2018
by   Nicholas Rhinehart, et al.
0

Imitation learning provides an appealing framework for autonomous control: in many tasks, demonstrations of preferred behavior can be readily obtained from human experts, removing the need for costly and potentially dangerous online data collection in the real world. However, policies learned with imitation learning have limited flexibility to accommodate varied goals at test time. Model-based reinforcement learning (MBRL) offers considerably more flexibility, since a predictive model learned from data can be used to achieve various goals at test time. However, MBRL suffers from two shortcomings. First, the predictive model does not help to choose desired or safe outcomes -- it reasons only about what is possible, not what is preferred. Second, MBRL typically requires additional online data collection to ensure that the model is accurate in those situations that are actually encountered when attempting to achieve test time goals. Collecting this data with a partially trained model can be dangerous and time-consuming. In this paper, we aim to combine the benefits of imitation learning and MBRL, and propose imitative models: probabilistic predictive models able to plan expert-like trajectories to achieve arbitrary goals. We find this method substantially outperforms both direct imitation and MBRL in a simulated autonomous driving task, and can be learned efficiently from a fixed set of expert demonstrations without additional online data collection. We also show our model can flexibly incorporate user-supplied costs as test-time, can plan to sequences of goals, and can even perform well with imprecise goals, including goals on the wrong side of the road.

READ FULL TEXT

page 2

page 4

page 7

page 9

page 10

research
04/07/2022

Imitating, Fast and Slow: Robust learning from demonstrations via decision-time planning

The goal of imitation learning is to mimic expert behavior from demonstr...
research
10/13/2017

Burn-In Demonstrations for Multi-Modal Imitation Learning

Recent work on imitation learning has generated policies that reproduce ...
research
03/27/2018

Safe end-to-end imitation learning for model predictive control

We propose the use of Bayesian networks, which provide both a mean value...
research
11/05/2020

HILONet: Hierarchical Imitation Learning from Non-Aligned Observations

It is challenging learning from demonstrated observation-only trajectori...
research
11/06/2022

Leveraging Haptic Feedback to Improve Data Quality and Quantity for Deep Imitation Learning Models

Learning from demonstration (LfD) is a proven technique to teach robots ...
research
03/29/2022

ReIL: A Framework for Reinforced Intervention-based Imitation Learning

Compared to traditional imitation learning methods such as DAgger and DA...
research
05/01/2023

Learning Flight Control Systems from Human Demonstrations and Real-Time Uncertainty-Informed Interventions

This paper describes a methodology for learning flight control systems f...

Please sign up or login with your details

Forgot password? Click here to reset