Graph Policy Gradients for Large Scale Robot Control

07/08/2019
by   Arbaaz Khan, et al.
0

In this paper, we consider the problem of learning policies to control a large number of homogeneous robots. To this end, we propose a new algorithm we call Graph Policy Gradients (GPG) that exploits the underlying graph symmetry among the robots. The curse of dimensionality one encounters when working with a large number of robots is mitigated by employing a graph convolutional neural (GCN) network to parametrize policies for the robots. The GCN reduces the dimensionality of the problem by learning filters that aggregate information among robots locally, similar to how a convolutional neural network is able to learn local features in an image. Through experiments on formation flying, we show that our proposed method is able to scale better than existing reinforcement methods that employ fully connected networks. More importantly, we show that by using our locally learned filters we are able to zero-shot transfer policies trained on just three robots to over hundred robots.

READ FULL TEXT

page 8

page 12

research
09/24/2019

Graph Policy Gradients for Large Scale Unlabeled Motion Planning with Constraints

In this paper, we present a learning method to solve the unlabelled moti...
research
02/11/2021

Large Scale Distributed Collaborative Unlabeled Motion Planning with Graph Policy Gradients

In this paper, we present a learning method to solve the unlabelled moti...
research
12/11/2019

Zero-shot generalization using cascaded system-representations

This paper proposes a new framework named CASNET to learn control polici...
research
03/21/2018

Zero-shot Recognition via Semantic Embeddings and Knowledge Graphs

We consider the problem of zero-shot recognition: learning a visual clas...
research
03/10/2021

RMP2: A Structured Composable Policy Class for Robot Learning

We consider the problem of learning motion policies for acceleration-bas...
research
06/12/2020

Similarity-based transfer learning of decision policies

A problem of learning decision policy from past experience is considered...
research
07/31/2023

Discovering Adaptable Symbolic Algorithms from Scratch

Autonomous robots deployed in the real world will need control policies ...

Please sign up or login with your details

Forgot password? Click here to reset