Hardware Conditioned Policies for Multi-Robot Transfer Learning

11/24/2018
by   Tao Chen, et al.
0

Deep reinforcement learning could be used to learn dexterous robotic policies but it is challenging to transfer them to new robots with vastly different hardware properties. It is also prohibitively expensive to learn a new policy from scratch for each robot hardware due to the high sample complexity of modern state-of-the-art algorithms. We propose a novel approach called Hardware Conditioned Policies where we train a universal policy conditioned on a vector representation of robot hardware. We considered robots in simulation with varied dynamics, kinematic structure, kinematic lengths and degrees-of-freedom. First, we use the kinematic structure directly as the hardware encoding and show great zero-shot transfer to completely novel robots not seen during training. For robots with lower zero-shot success rate, we also demonstrate that fine-tuning the policy network is significantly more sample-efficient than training a model from scratch. In tasks where knowing the agent dynamics is important for success, we learn an embedding for robot hardware and show that policies conditioned on the encoding of hardware tend to generalize and transfer well. Videos of experiments are available at: https://sites.google.com/view/robot-transfer-hcp.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/11/2019

Zero-shot generalization using cascaded system-representations

This paper proposes a new framework named CASNET to learn control polici...
research
01/29/2023

Zero-Shot Transfer of Haptics-Based Object Insertion Policies

Humans naturally exploit haptic feedback during contact-rich tasks like ...
research
07/19/2021

Know Thyself: Transferable Visuomotor Control Through Robot-Awareness

Training visuomotor robot controllers from scratch on a new robot typica...
research
07/31/2023

Discovering Adaptable Symbolic Algorithms from Scratch

Autonomous robots deployed in the real world will need control policies ...
research
12/14/2018

Simulation to scaled city: zero-shot policy transfer for traffic control via autonomous vehicles

Using deep reinforcement learning, we train control policies for autonom...
research
07/06/2020

Jump Operator Planning: Goal-Conditioned Policy Ensembles and Zero-Shot Transfer

In Hierarchical Control, compositionality, abstraction, and task-transfe...
research
03/10/2019

Affordance Learning for End-to-End Visuomotor Robot Control

Training end-to-end deep robot policies requires a lot of domain-, task-...

Please sign up or login with your details

Forgot password? Click here to reset