A Framework for Learning from Demonstration with Minimal Human Effort

06/15/2023
by   Marc Rigter, et al.
8

We consider robot learning in the context of shared autonomy, where control of the system can switch between a human teleoperator and autonomous control. In this setting we address reinforcement learning, and learning from demonstration, where there is a cost associated with human time. This cost represents the human time required to teleoperate the robot, or recover the robot from failures. For each episode, the agent must choose between requesting human teleoperation, or using one of its autonomous controllers. In our approach, we learn to predict the success probability for each controller, given the initial state of an episode. This is used in a contextual multi-armed bandit algorithm to choose the controller for the episode. A controller is learnt online from demonstrations and reinforcement learning so that autonomous performance improves, and the system becomes less reliant on the teleoperator with more experience. We show that our approach to controller selection reduces the human cost to perform two simulated tasks and a single real-world task.

READ FULL TEXT
research
04/12/2021

Risk-Averse Biased Human Policies in Assistive Multi-Armed Bandit Settings

Assistive multi-armed bandit problems can be used to model team situatio...
research
04/20/2023

Aiding reinforcement learning for set point control

While reinforcement learning has made great improvements, state-of-the-a...
research
07/10/2021

Informing Real-time Corrections in Corrective Shared Autonomy Through Expert Demonstrations

Corrective Shared Autonomy is a method where human corrections are layer...
research
07/23/2020

Semi-supervised Learning From Demonstration Through Program Synthesis: An Inspection Robot Case Study

Semi-supervised learning improves the performance of supervised machine ...
research
03/09/2021

I am Robot: Neuromuscular Reinforcement Learning to Actuate Human Limbs through Functional Electrical Stimulation

Human movement disorders or paralysis lead to the loss of control of mus...
research
01/15/2021

Deep Reinforcement Learning for Haptic Shared Control in Unknown Tasks

Recent years have shown a growing interest in using haptic shared contro...
research
03/23/2021

Neural Network Controller for Autonomous Pile Loading Revised

We have recently proposed two pile loading controllers that learn from h...

Please sign up or login with your details

Forgot password? Click here to reset