Efficient Model Learning for Human-Robot Collaborative Tasks

05/24/2014
by   Stefanos Nikolaidis, et al.
0

We present a framework for learning human user models from joint-action demonstrations that enables the robot to compute a robust policy for a collaborative task with a human. The learning takes place completely automatically, without any human intervention. First, we describe the clustering of demonstrated action sequences into different human types using an unsupervised learning algorithm. These demonstrated sequences are also used by the robot to learn a reward function that is representative for each type, through the employment of an inverse reinforcement learning algorithm. The learned model is then used as part of a Mixed Observability Markov Decision Process formulation, wherein the human type is a partially observable variable. With this framework, we can infer, either offline or online, the human type of a new user that was not included in the training set, and can compute a policy for the robot that will be aligned to the preference of this new user and will be robust to deviations of the human actions from prior demonstrations. Finally we validate the approach using data collected in human subject experiments, and conduct proof-of-concept demonstrations in which a person performs a collaborative task with a small industrial robot.

READ FULL TEXT

page 6

page 8

research
06/23/2020

Feature Expansive Reward Learning: Rethinking Human Input

In collaborative human-robot scenarios, when a person is not satisfied w...
research
09/27/2018

Collaborative Robot Learning from Demonstrations using Hidden Markov Model State Distribution

In robotics, there is need of an interactive and expedite learning metho...
research
12/15/2017

Inverse Reinforce Learning with Nonparametric Behavior Clustering

Inverse Reinforcement Learning (IRL) is the task of learning a single re...
research
08/05/2019

Speech Driven Backchannel Generation using Deep Q-Network for Enhancing Engagement in Human-Robot Interaction

We present a novel method for training a social robot to generate backch...
research
03/04/2019

Using Causal Analysis to Learn Specifications from Task Demonstrations

Learning models of user behaviour is an important problem that is broadl...
research
11/15/2022

PARTNR: Pick and place Ambiguity Resolving by Trustworthy iNteractive leaRning

Several recent works show impressive results in mapping language-based h...
research
05/28/2021

Incremental Learning of Probabilistic Movement Primitives (ProMPs) for Human-Robot Cooperation

For a successful deployment of physical Human-Robot Cooperation (pHRC), ...

Please sign up or login with your details

Forgot password? Click here to reset