Collective Robot Reinforcement Learning with Distributed Asynchronous Guided Policy Search

10/03/2016
by   Ali Yahya, et al.
0

In principle, reinforcement learning and policy search methods can enable robots to learn highly complex and general skills that may allow them to function amid the complexity and diversity of the real world. However, training a policy that generalizes well across a wide range of real-world conditions requires far greater quantity and diversity of experience than is practical to collect with a single robot. Fortunately, it is possible for multiple robots to share their experience with one another, and thereby, learn a policy collectively. In this work, we explore distributed and asynchronous policy learning as a means to achieve generalization and improved training times on challenging, real-world manipulation tasks. We propose a distributed and asynchronous version of Guided Policy Search and use it to demonstrate collective policy learning on a vision-based door opening task using four robots. We show that it achieves better generalization, utilization, and training times than the single robot alternative.

READ FULL TEXT

page 1

page 6

page 7

research
10/03/2016

Deep Reinforcement Learning for Robotic Manipulation with Asynchronous Off-Policy Updates

Reinforcement learning holds the promise of enabling autonomous robots t...
research
03/23/2022

Asynchronous Reinforcement Learning for Real-Time Control of Physical Robots

An oft-ignored challenge of real-world reinforcement learning is that th...
research
11/15/2018

Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search

Learning policies on data synthesized by models can in principle quench ...
research
04/02/2015

End-to-End Training of Deep Visuomotor Policies

Policy search methods can allow robots to learn control policies for a w...
research
10/26/2020

High Acceleration Reinforcement Learning for Real-World Juggling with Binary Rewards

Robots that can learn in the physical world will be important to en-able...
research
11/12/2020

Reinforcement Learning with Videos: Combining Offline Observations with Interaction

Reinforcement learning is a powerful framework for robots to acquire ski...
research
06/09/2022

Linear Delta Arrays for Compliant Dexterous Distributed Manipulation

This paper presents a new type of distributed dexterous manipulator: del...

Please sign up or login with your details

Forgot password? Click here to reset