Safe end-to-end imitation learning for model predictive control

03/27/2018
by   Keuntaek Lee, et al.
0

We propose the use of Bayesian networks, which provide both a mean value and an uncertainty estimate as output, to enhance the safety of learned control policies under circumstances in which a test-time input differs significantly from the training set. Our algorithm combines reinforcement learning and end-to-end imitation learning to simultaneously learn a control policy as well as a threshold over the predictive uncertainty of the learned model, with no hand-tuning required. Corrective action, such as a return of control to the model predictive controller or human expert, is taken when the uncertainty threshold is exceeded. We validate our method on fully-observable and vision-based partially-observable systems using cart-pole and autonomous driving simulations using deep convolutional Bayesian neural networks. We demonstrate that our method is robust to uncertainty resulting from varying system dynamics as well as from partial state observability.

READ FULL TEXT

page 2

page 9

research
10/06/2017

End-to-end Driving via Conditional Imitation Learning

Deep networks trained on demonstrations of human driving have learned to...
research
04/17/2020

Approximate Inverse Reinforcement Learning from Vision-based Imitation Learning

In this work, we present a method for obtaining an implicit objective fu...
research
05/07/2019

Uncertainty-Aware Data Aggregation for Deep Imitation Learning

Estimating statistical uncertainties allows autonomous agents to communi...
research
05/01/2023

Learning Flight Control Systems from Human Demonstrations and Real-Time Uncertainty-Informed Interventions

This paper describes a methodology for learning flight control systems f...
research
04/26/2019

Perceptual Attention-based Predictive Control

In this paper, we present a novel information processing architecture fo...
research
10/15/2018

Deep Imitative Models for Flexible Inference, Planning, and Control

Imitation learning provides an appealing framework for autonomous contro...
research
12/09/2020

Neural Rate Control for Video Encoding using Imitation Learning

In modern video encoders, rate control is a critical component and has b...

Please sign up or login with your details

Forgot password? Click here to reset