Model-based Behavioral Cloning with Future Image Similarity Learning

10/08/2019
by   Alan Wu, et al.
9

We present a visual imitation learning framework that enables learning of robot action policies solely based on expert samples without any robot trials. Robot exploration and on-policy trials in a real-world environment could often be expensive/dangerous. We present a new approach to address this problem by learning a future scene prediction model solely on a collection of expert trajectories consisting of unlabeled example videos and actions, and by enabling generalized action cloning using future image similarity. The robot learns to visually predict the consequences of taking an action, and obtains the policy by evaluating how similar the predicted future image is to an expert image. We develop a stochastic action-conditioned convolutional autoencoder, and present how we take advantage of future images for robot learning. We conduct experiments in simulated and real-life environments using a ground mobility robot with and without obstacles, and compare our models to multiple baseline methods.

READ FULL TEXT

page 4

page 7

page 11

page 15

research
05/20/2018

Learning Real-World Robot Policies by Dreaming

Learning to control robots directly based on images is a primary challen...
research
09/27/2019

Zero-shot Imitation Learning from Demonstrations for Legged Robot Visual Navigation

Imitation learning is a popular approach for training effective visual n...
research
06/29/2023

HYDRA: Hybrid Robot Actions for Imitation Learning

Imitation Learning (IL) is a sample efficient paradigm for robot learnin...
research
02/08/2018

Practical Issues of Action-conditioned Next Image Prediction

The problem of action-conditioned image prediction is to predict the exp...
research
08/02/2021

Self-Supervised Disentangled Representation Learning for Third-Person Imitation Learning

Humans learn to imitate by observing others. However, robot imitation le...
research
06/09/2023

Embodied Executable Policy Learning with Language-based Scene Summarization

Large Language models (LLMs) have shown remarkable success in assisting ...
research
10/12/2018

Sequential Learning of Movement Prediction in Dynamic Environments using LSTM Autoencoder

Predicting movement of objects while the action of learning agent intera...

Please sign up or login with your details

Forgot password? Click here to reset