Augmented Behavioral Cloning from Observation

04/28/2020
by   Juarez Monteiro, et al.
0

Imitation from observation is a computational technique that teaches an agent on how to mimic the behavior of an expert by observing only the sequence of states from the expert demonstrations. Recent approaches learn the inverse dynamics of the environment and an imitation policy by interleaving epochs of both models while changing the demonstration data. However, such approaches often get stuck into sub-optimal solutions that are distant from the expert, limiting their imitation effectiveness. We address this problem with a novel approach that overcomes the problem of reaching bad local minima by exploring: (I) a self-attention mechanism that better captures global features of the states; and (ii) a sampling strategy that regulates the observations that are used for learning. We show empirically that our approach outperforms the state-of-the-art approaches in four different environments by a large margin.

READ FULL TEXT
research
08/13/2020

Imitating Unknown Policies via Exploration

Behavioral cloning is an imitation learning technique that teaches an ag...
research
05/22/2019

Imitation Learning from Video by Leveraging Proprioception

Classically, imitation learning algorithms have been developed for ideal...
research
06/18/2019

RIDM: Reinforced Inverse Dynamics Modeling for Learning from a Single Observed Demonstration

Imitation learning has long been an approach to alleviate the tractabili...
research
02/13/2023

Imitation from Observation With Bootstrapped Contrastive Learning

Imitation from observation (IfO) is a learning paradigm that consists of...
research
04/21/2023

Self-Supervised Adversarial Imitation Learning

Behavioural cloning is an imitation learning technique that teaches an a...
research
08/04/2020

An Imitation from Observation Approach to Sim-to-Real Transfer

The sim to real transfer problem deals with leveraging large amounts of ...
research
01/03/2023

DADAgger: Disagreement-Augmented Dataset Aggregation

DAgger is an imitation algorithm that aggregates its original datasets b...

Please sign up or login with your details

Forgot password? Click here to reset