Semi-Supervised Imitation Learning with Mixed Qualities of Demonstrations for Autonomous Driving

09/23/2021
by   Gunmin Lee, et al.
0

In this paper, we consider the problem of autonomous driving using imitation learning in a semi-supervised manner. In particular, both labeled and unlabeled demonstrations are leveraged during training by estimating the quality of each unlabeled demonstration. If the provided demonstrations are corrupted and have a low signal-to-noise ratio, the performance of the imitation learning agent can be degraded significantly. To mitigate this problem, we propose a method called semi-supervised imitation learning (SSIL). SSIL first learns how to discriminate and evaluate each state-action pair's reliability in unlabeled demonstrations by assigning higher reliability values to demonstrations similar to labeled expert demonstrations. This reliability value is called leverage. After this discrimination process, both labeled and unlabeled demonstrations with estimated leverage values are utilized while training the policy in a semi-supervised manner. The experimental results demonstrate the validity of the proposed algorithm using unlabeled trajectories with mixed qualities. Moreover, the hardware experiments using an RC car are conducted to show that the proposed method can be applied to real-world applications.

READ FULL TEXT

page 1

page 6

research
02/13/2023

Unlabeled Imperfect Demonstrations in Adversarial Imitation Learning

Adversarial imitation learning has become a widely used imitation learni...
research
05/05/2022

Semi-Supervised Imitation Learning of Team Policies from Suboptimal Demonstrations

We present Bayesian Team Imitation Learner (BTIL), an imitation learning...
research
09/15/2019

VILD: Variational Imitation Learning with Diverse-quality Demonstrations

The goal of imitation learning (IL) is to learn a good policy from high-...
research
08/03/2020

Concurrent Training Improves the Performance of Behavioral Cloning from Observation

Learning from demonstration is widely used as an efficient way for robot...
research
04/06/2023

End-to-end Manipulator Calligraphy Planning via Variational Imitation Learning

Planning from demonstrations has shown promising results with the advanc...
research
06/07/2023

Divide and Repair: Using Options to Improve Performance of Imitation Learning Against Adversarial Demonstrations

We consider the problem of learning to perform a task from demonstration...
research
09/27/2022

Follow The Rules: Online Signal Temporal Logic Tree Search for Guided Imitation Learning in Stochastic Domains

Seamlessly integrating rules in Learning-from-Demonstrations (LfD) polic...

Please sign up or login with your details

Forgot password? Click here to reset