Imitating Unknown Policies via Exploration

08/13/2020
by   Nathan Gavenski, et al.
10

Behavioral cloning is an imitation learning technique that teaches an agent how to behave through expert demonstrations. Recent approaches use self-supervision of fully-observable unlabeled snapshots of the states to decode state-pairs into actions. However, the iterative learning scheme from these techniques are prone to getting stuck into bad local minima. We address these limitations incorporating a two-phase model into the original framework, which learns from unlabeled observations via exploration, substantially improving traditional behavioral cloning by exploiting (i) a sampling mechanism to prevent bad local minima, (ii) a sampling mechanism to improve exploration, and (iii) self-attention modules to capture global features. The resulting technique outperforms the previous state-of-the-art in four different environments by a large margin.

READ FULL TEXT

page 8

page 14

research
04/28/2020

Augmented Behavioral Cloning from Observation

Imitation from observation is a computational technique that teaches an ...
research
04/21/2023

Self-Supervised Adversarial Imitation Learning

Behavioural cloning is an imitation learning technique that teaches an a...
research
07/05/2022

Planning with RL and episodic-memory behavioral priors

The practical application of learning agents requires sample efficient a...
research
12/30/2022

Learning from Guided Play: Improving Exploration for Adversarial Imitation Learning with Simple Auxiliary Tasks

Adversarial imitation learning (AIL) has become a popular alternative to...
research
05/27/2019

SQIL: Imitation Learning via Regularized Behavioral Cloning

Learning to imitate expert behavior given action demonstrations containi...
research
02/26/2023

Diffusion Model-Augmented Behavioral Cloning

Imitation learning addresses the challenge of learning by observing an e...

Please sign up or login with your details

Forgot password? Click here to reset