Zero-shot generalization using cascaded system-representations

12/11/2019
by   Ashish Malik, et al.
0

This paper proposes a new framework named CASNET to learn control policies that generalize over similar robot types with different morphologies. The proposed framework leverages the structural similarities in robots to learn general-purpose system-representations. These representations can then be used with the choice of learning algorithms to learn policies that generalize over different robots. The learned policies can be used to design general-purpose robot-controllers that are applicable to a wide variety of robots. We demonstrate the effectiveness of the proposed framework by learning control policies for two separate domains: planer manipulation and legged locomotion. The policy learned for planer manipulation is capable of controlling planer manipulators with varying degrees of freedom and link-lengths. For legged locomotion, the learned policy generalizes over different morphologies of the crawling robots. These policies perform on-par with the expert policies trained for individual robot models and achieves zero-shot generalization on models unseen during training, establishing that the final performance of the general policy is bottlenecked by the learning algorithm rather than the proposed framework.

READ FULL TEXT

page 5

page 8

page 11

page 14

research
09/28/2022

Zero-Shot Retargeting of Learned Quadruped Locomotion Policies Using Hybrid Kinodynamic Model Predictive Control

Reinforcement Learning (RL) has witnessed great strides for quadruped lo...
research
11/24/2018

Hardware Conditioned Policies for Multi-Robot Transfer Learning

Deep reinforcement learning could be used to learn dexterous robotic pol...
research
01/11/2022

Combining Learning-based Locomotion Policy with Model-based Manipulation for Legged Mobile Manipulators

Deep reinforcement learning produces robust locomotion policies for legg...
research
11/11/2020

Zero-Shot Terrain Generalization for Visual Locomotion Policies

Legged robots have unparalleled mobility on unstructured terrains. Howev...
research
07/08/2019

Graph Policy Gradients for Large Scale Robot Control

In this paper, we consider the problem of learning policies to control a...
research
10/05/2021

Unifying AI Algorithms with Probabilistic Programming using Implicitly Defined Representations

We introduce Scruff, a new framework for developing AI systems using pro...
research
08/07/2023

Learning Terrain-Adaptive Locomotion with Agile Behaviors by Imitating Animals

In this paper, we present a general learning framework for controlling a...

Please sign up or login with your details

Forgot password? Click here to reset