End-to-End Training of Deep Visuomotor Policies

04/02/2015
by   Sergey Levine, et al.
0

Policy search methods can allow robots to learn control policies for a wide range of tasks, but practical applications of policy search often require hand-engineered components for perception, state estimation, and low-level control. In this paper, we aim to answer the following question: does training the perception and control systems jointly end-to-end provide better performance than training each component separately? To this end, we develop a method that can be used to learn policies that map raw image observations directly to torques at the robot's motors. The policies are represented by deep convolutional neural networks (CNNs) with 92,000 parameters, and are trained using a partially observed guided policy search method, which transforms policy search into supervised learning, with supervision provided by a simple trajectory-centric reinforcement learning method. We evaluate our method on a range of real-world manipulation tasks that require close coordination between vision and control, such as screwing a cap onto a bottle, and present simulated comparisons to a range of prior policy search methods.

READ FULL TEXT

page 2

page 7

page 8

page 14

page 18

page 20

page 22

page 24

research
09/15/2018

Learning Robust Manipulation Skills with Guided Policy Search via Generative Motor Reflexes

Guided Policy Search enables robots to learn control policies for comple...
research
09/20/2018

Zero-shot Sim-to-Real Transfer with Modular Priors

Current end-to-end Reinforcement Learning (RL) approaches are severely l...
research
10/03/2016

Collective Robot Reinforcement Learning with Distributed Asynchronous Guided Policy Search

In principle, reinforcement learning and policy search methods can enabl...
research
03/10/2019

Affordance Learning for End-to-End Visuomotor Robot Control

Training end-to-end deep robot policies requires a lot of domain-, task-...
research
01/18/2022

Programmatic Policy Extraction by Iterative Local Search

Reinforcement learning policies are often represented by neural networks...
research
11/13/2016

CAD2RL: Real Single-Image Flight without a Single Real Image

Deep reinforcement learning has emerged as a promising and powerful tech...
research
07/09/2021

ARC: Adversarially Robust Control Policies for Autonomous Vehicles

Deep neural networks have demonstrated their capability to learn control...

Please sign up or login with your details

Forgot password? Click here to reset