Learning from Observations Using a Single Video Demonstration and Human Feedback

09/29/2019
by   Sunil Gandhi, et al.
0

In this paper, we present a method for learning from video demonstrations by using human feedback to construct a mapping between the standard representation of the agent and the visual representation of the demonstration. In this way, we leverage the advantages of both these representations, i.e., we learn the policy using standard state representations, but are able to specify the expected behavior using video demonstration. We train an autonomous agent using a single video demonstration and use human feedback (using numerical similarity rating) to map the standard representation to the visual representation with a neural network. We show the effectiveness of our method by teaching a hopper agent in the MuJoCo to perform a backflip using a single video demonstration generated in MuJoCo as well as from a real-world YouTube video of a person performing a backflip. Additionally, we show that our method can transfer to new tasks, such as hopping, with very little human feedback.

READ FULL TEXT

page 5

page 6

research
09/22/2022

Minimizing Human Assistance: Augmenting a Single Demonstration for Deep Reinforcement Learning

The use of human demonstrations in reinforcement learning has proven to ...
research
07/06/2022

Transformers are Adaptable Task Planners

Every home is different, and every person likes things done in their par...
research
07/10/2018

Neural Task Graphs: Generalizing to Unseen Tasks from a Single Video Demonstration

Our goal is for a robot to execute a previously unseen task based on a s...
research
09/18/2023

One ACT Play: Single Demonstration Behavior Cloning with Action Chunking Transformers

Learning from human demonstrations (behavior cloning) is a cornerstone o...
research
09/28/2020

The EMPATHIC Framework for Task Learning from Implicit Human Feedback

Reactions such as gestures, facial expressions, and vocalizations are an...
research
05/19/2023

Characterizing tradeoffs between teaching via language and demonstrations in multi-agent systems

Humans teach others about the world through language and demonstration. ...
research
11/21/2019

Third-Person Visual Imitation Learning via Decoupled Hierarchical Controller

We study a generalized setup for learning from demonstration to build an...

Please sign up or login with your details

Forgot password? Click here to reset