High Acceleration Reinforcement Learning for Real-World Juggling with Binary Rewards

10/26/2020
by   Kai Ploeger, et al.
0

Robots that can learn in the physical world will be important to en-able robots to escape their stiff and pre-programmed movements. For dynamic high-acceleration tasks, such as juggling, learning in the real-world is particularly challenging as one must push the limits of the robot and its actuation without harming the system, amplifying the necessity of sample efficiency and safety for robot learning algorithms. In contrast to prior work which mainly focuses on the learning algorithm, we propose a learning system, that directly incorporates these requirements in the design of the policy representation, initialization, and optimization. We demonstrate that this system enables the high-speed Barrett WAM manipulator to learn juggling two balls from 56 minutes of experience with a binary reward signal. The final policy juggles continuously for up to 33 minutes or about 4500 repeated catches. The videos documenting the learning process and the evaluation can be found at https://sites.google.com/view/jugglingbot

READ FULL TEXT

page 2

page 6

research
06/28/2022

DayDreamer: World Models for Physical Robot Learning

To solve tasks in complex environments, robots need to learn from experi...
research
10/07/2022

GoalsEye: Learning High Speed Precision Table Tennis on a Physical Robot

Learning goal conditioned control in the real world is a challenging ope...
research
10/03/2016

Collective Robot Reinforcement Learning with Distributed Asynchronous Guided Policy Search

In principle, reinforcement learning and policy search methods can enabl...
research
04/17/2023

Continuous Versatile Jumping Using Learned Action Residuals

Jumping is essential for legged robots to traverse through difficult ter...
research
12/10/2019

AVID: Learning Multi-Stage Tasks via Pixel-Level Translation of Human Videos

Robotic reinforcement learning (RL) holds the promise of enabling robots...
research
10/09/2020

LaND: Learning to Navigate from Disengagements

Consistently testing autonomous mobile robots in real world scenarios is...
research
02/18/2019

DIViS: Domain Invariant Visual Servoing for Collision-Free Goal Reaching

Robots should understand both semantics and physics to be functional in ...

Please sign up or login with your details

Forgot password? Click here to reset