Skill Disentanglement for Imitation Learning from Suboptimal Demonstrations

06/13/2023
by   Tianxiang Zhao, et al.
0

Imitation learning has achieved great success in many sequential decision-making tasks, in which a neural agent is learned by imitating collected human demonstrations. However, existing algorithms typically require a large number of high-quality demonstrations that are difficult and expensive to collect. Usually, a trade-off needs to be made between demonstration quality and quantity in practice. Targeting this problem, in this work we consider the imitation of sub-optimal demonstrations, with both a small clean demonstration set and a large noisy set. Some pioneering works have been proposed, but they suffer from many limitations, e.g., assuming a demonstration to be of the same optimality throughout time steps and failing to provide any interpretation w.r.t knowledge learned from the noisy set. Addressing these problems, we propose by evaluating and imitating at the sub-demonstration level, encoding action primitives of varying quality into different skills. Concretely, consists of a high-level controller to discover skills and a skill-conditioned module to capture action-taking policies, and is trained following a two-phase pipeline by first discovering skills with all demonstrations and then adapting the controller to only the clean set. A mutual-information-based regularization and a dynamic sub-demonstration optimality estimator are designed to promote disentanglement in the skill space. Extensive experiments are conducted over two gym environments and a real-world healthcare dataset to demonstrate the superiority of in learning from sub-optimal demonstrations and its improved interpretability by examining learned skills.

READ FULL TEXT
research
03/10/2021

Learning from Imperfect Demonstrations from Agents with Varying Dynamics

Imitation learning enables robots to learn from demonstrations. Previous...
research
05/09/2022

Disturbance-Injected Robust Imitation Learning with Task Achievement

Robust imitation learning using disturbance injections overcomes issues ...
research
01/28/2022

Transfering Hierarchical Structure with Dual Meta Imitation Learning

Hierarchical Imitation Learning (HIL) is an effective way for robots to ...
research
02/19/2018

Learning High-level Representations from Demonstrations

Hierarchical learning (HL) is key to solving complex sequential decision...
research
03/01/2019

GRP Model for Sensorimotor Learning

Learning from complex demonstrations is challenging, especially when the...
research
01/19/2023

Keyframe Demonstration Seeded and Bayesian Optimized Policy Search

This paper introduces a novel Learning from Demonstration framework to l...
research
08/31/2022

Let Me Check the Examples: Enhancing Demonstration Learning via Explicit Imitation

Demonstration learning aims to guide the prompt prediction via providing...

Please sign up or login with your details

Forgot password? Click here to reset