TaSIL: Taylor Series Imitation Learning

05/30/2022
∙
by   Daniel Pfrommer, et al.
∙
17
∙

We propose Taylor Series Imitation Learning (TaSIL), a simple augmentation to standard behavior cloning losses in the context of continuous control. TaSIL penalizes deviations in the higher-order Taylor series terms between the learned and expert policies. We show that experts satisfying a notion of incremental input-to-state stability are easy to learn, in the sense that a small TaSIL-augmented imitation loss over expert trajectories guarantees a small imitation loss over trajectories generated by the learned policy. We provide sample-complexity bounds for TaSIL that scale as 𝒊Ėƒ(1/n) in the realizable setting, for n the number of expert demonstrations. Finally, we demonstrate experimentally the relationship between the robustness of the expert policy and the order of Taylor expansion required in TaSIL, and compare standard Behavior Cloning, DART, and DAgger with TaSIL-loss-augmented variants. In all cases, we show significant improvement over baselines across a variety of MuJoCo tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
∙ 08/12/2022

Causal Imitation Learning with Unobserved Confounders

One of the common ways children learn is by mimicking adults. Imitation ...
research
∙ 05/23/2022

Data augmentation for efficient learning from parametric experts

We present a simple, yet powerful data-augmentation technique to enable ...
research
∙ 07/27/2023

Imitating Complex Trajectories: Bridging Low-Level Stability and High-Level Behavior

We propose a theoretical framework for studying the imitation of stochas...
research
∙ 07/28/2020

Learning Stable Manoeuvres in Quadruped Robots from Expert Demonstrations

With the research into development of quadruped robots picking up pace, ...
research
∙ 03/25/2021

Adversarial Imitation Learning with Trajectorial Augmentation and Correction

Deep Imitation Learning requires a large number of expert demonstrations...
research
∙ 02/18/2021

Closing the Closed-Loop Distribution Shift in Safe Imitation Learning

Commonly used optimization-based control strategies such as model-predic...
research
∙ 12/09/2020

Neural Rate Control for Video Encoding using Imitation Learning

In modern video encoders, rate control is a critical component and has b...

Please sign up or login with your details

Forgot password? Click here to reset