Improving Robot-Centric Learning from Demonstration via Personalized Embeddings

10/07/2021
by   Mariah L. Schrum, et al.
0

Learning from demonstration (LfD) techniques seek to enable novice users to teach robots novel tasks in the real world. However, prior work has shown that robot-centric LfD approaches, such as Dataset Aggregation (DAgger), do not perform well with human teachers. DAgger requires a human demonstrator to provide corrective feedback to the learner either in real-time, which can result in degraded performance due to suboptimal human labels, or in a post hoc manner which is time intensive and often not feasible. To address this problem, we present Mutual Information-driven Meta-learning from Demonstration (MIND MELD), which meta-learns a mapping from poor quality human labels to predicted ground truth labels, thereby improving upon the performance of prior LfD approaches for DAgger-based training. The key to our approach for improving upon suboptimal feedback is mutual information maximization via variational inference. Our approach learns a meaningful, personalized embedding via variational inference which informs the mapping from human provided labels to predicted ground truth labels. We demonstrate our framework in a synthetic domain and in a human-subjects experiment, illustrating that our approach improves upon the corrective labels provided by a human demonstrator by 63

READ FULL TEXT
research
05/02/2023

Long-Tailed Recognition by Mutual Information Maximization between Latent Features and Ground-Truth Labels

Although contrastive learning methods have shown prevailing performance ...
research
10/17/2020

Learning from Suboptimal Demonstration via Self-Supervised Reward Regression

Learning from Demonstration (LfD) seeks to democratize robotics by enabl...
research
08/29/2021

Autonomous Curiosity for Real-Time Training Onboard Robotic Agents

Learning requires both study and curiosity. A good learner is not only g...
research
05/24/2022

First Contact: Unsupervised Human-Machine Co-Adaptation via Mutual Information Maximization

How can we train an assistive human-machine interface (e.g., an electrom...
research
06/08/2016

Exploring Implicit Human Responses to Robot Mistakes in a Learning from Demonstration Task

As robots enter human environments, they will be expected to accomplish ...
research
03/14/2019

Inferring Personalized Bayesian Embeddings for Learning from Heterogeneous Demonstration

For assistive robots and virtual agents to achieve ubiquity, machines wi...
research
01/27/2020

Heterogeneous Learning from Demonstration

The development of human-robot systems able to leverage the strengths of...

Please sign up or login with your details

Forgot password? Click here to reset