Solving Challenging Dexterous Manipulation Tasks With Trajectory Optimisation and Reinforcement Learning

09/09/2020
by   Henry Charlesworth, et al.
25

Training agents to autonomously learn how to use anthropomorphic robotic hands has the potential to lead to systems capable of performing a multitude of complex manipulation tasks in unstructured and uncertain environments. In this work, we first introduce a suite of challenging simulated manipulation tasks that current reinforcement learning and trajectory optimisation techniques find difficult. These include environments where two simulated hands have to pass or throw objects between each other, as well as an environment where the agent must learn to spin a long pen between its fingers. We then introduce a simple trajectory optimisation that performs significantly better than existing methods on these environments. Finally, on the challenging PenSpin task we combine sub-optimal demonstrations generated through trajectory optimisation with off-policy reinforcement learning, obtaining performance that far exceeds either of these approaches individually, effectively solving the environment.

READ FULL TEXT

page 3

page 6

research
09/15/2023

Sim-to-Real Brush Manipulation using Behavior Cloning and Reinforcement Learning

Developing proficient brush manipulation capabilities in real-world scen...
research
03/29/2023

Learning Excavation of Rigid Objects with Offline Reinforcement Learning

Autonomous excavation is a challenging task. The unknown contact dynamic...
research
09/09/2021

Self-supervised Reinforcement Learning with Independently Controllable Subgoals

To successfully tackle challenging manipulation tasks, autonomous agents...
research
11/04/2021

Learning to Manipulate Tools by Aligning Simulation to Video Demonstration

A seamless integration of robots into human environments requires robots...
research
09/14/2021

Few-shot Quality-Diversity Optimisation

In the past few years, a considerable amount of research has been dedica...
research
08/08/2023

Path Signatures for Diversity in Probabilistic Trajectory Optimisation

Motion planning can be cast as a trajectory optimisation problem where a...
research
10/03/2022

Hierarchical reinforcement learning for in-hand robotic manipulation using Davenport chained rotations

End-to-end reinforcement learning techniques are among the most successf...

Please sign up or login with your details

Forgot password? Click here to reset