Sample-efficient Deep Reinforcement Learning with Imaginary Rollouts for Human-Robot Interaction

08/15/2019
by   Mohammad Thabet, et al.
0

Deep reinforcement learning has proven to be a great success in allowing agents to learn complex tasks. However, its application to actual robots can be prohibitively expensive. Furthermore, the unpredictability of human behavior in human-robot interaction tasks can hinder convergence to a good policy. In this paper, we present an architecture that allows agents to learn models of stochastic environments and use them to accelerate learning. We descirbe how an environment model can be learned online and used to generate synthetic transitions, as well as how an agent can leverage these synthetic data to accelerate learning. We validate our approach using an experiment in which a robotic arm has to complete a task composed of a series of actions based on human gestures. Results show that our approach leads to significantly faster learning, requiring much less interaction with the environment. Furthermore, we demonstrate how learned models can be used by a robot to produce optimal plans in real world applications.

READ FULL TEXT

page 1

page 4

page 6

research
07/07/2020

Deep Reinforcement Learning with Interactive Feedback in a Human-Robot Environment

Robots are extending their presence in domestic environments every day, ...
research
09/13/2019

Petri Net Machines for Human-Agent Interaction

Smart speakers and robots become ever more prevalent in our daily lives....
research
07/22/2023

On-Robot Bayesian Reinforcement Learning for POMDPs

Robot learning is often difficult due to the expense of gathering data. ...
research
08/06/2019

Batch Recurrent Q-Learning for Backchannel Generation Towards Engaging Agents

The ability to generate appropriate verbal and non-verbal backchannels b...
research
09/11/2023

Physics-informed reinforcement learning via probabilistic co-adjustment functions

Reinforcement learning of real-world tasks is very data inefficient, and...
research
06/28/2022

DayDreamer: World Models for Physical Robot Learning

To solve tasks in complex environments, robots need to learn from experi...

Please sign up or login with your details

Forgot password? Click here to reset