Scalable and transferable learning of algorithms via graph embedding for multi-robot reward collection

05/29/2019
by   Hyunwook Kang, et al.
0

Can the success of reinforcement learning methods for combinatorial optimization problems be extended to multi-robot scheduling problems in stochastic contexts? Three issues are particularly important in this context: quality of the resulting decisions, scalability, and transferability. To achieve these ends we generalize the concept of clique potential to stochastic clique potential. We extend a mean field inference fixed point iteration with this new concept and use it to modify thestructure2vec method. We next propose a new reinforcement learning framework combining a graph representation of the problem and a consensus auction inspired by heuristics in the problem domain. This representation enables transferability in terms of the number of robots. Sequential encoding of information through multiple layers of our extended structure2vec results in 96 While training tractability is inherited from single robot methods in the literature, use of a multi-robot consensus auction-based relaxation of the maximum operation in the Bellman optimality equation allows for scalable selection of actions in the fitted Q-iteration. We apply our framework to multi-robot reward collection (MRRC) problems in stochastic environments with linear or non-linear rewards. In stochastic environments with non-linear rewards, the new method achieves 20 popular sequential greedy assignment (SGA) algorithm. Linear scalability in terms of training is achieved and demonstrated. Transferability is demonstrated by the use of a heuristic trained with three robots that continues to achieve 95 robots. We further mention the results obtained when extending the approach to identical parallel machine scheduling(IPMS) problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/29/2019

Learning scalable and transferable multi-robot/machine sequential assignment planning via graph embedding

Can the success of reinforcement learning methods for simple combinatori...
research
11/17/2020

Curiosity Based Reinforcement Learning on Robot Manufacturing Cell

This paper introduces a novel combination of scheduling control on a fle...
research
05/06/2022

Learning Scalable Policies over Graphs for Multi-Robot Task Allocation using Capsule Attention Networks

This paper presents a novel graph reinforcement learning (RL) architectu...
research
03/13/2020

A General Framework for Learning Mean-Field Games

This paper presents a general mean-field game (GMFG) framework for simul...
research
09/13/2022

RTAW: An Attention Inspired Reinforcement Learning Method for Multi-Robot Task Allocation in Warehouse Environments

We present a novel reinforcement learning based algorithm for multi-robo...
research
07/19/2022

New Auction Algorithms for Path Planning, Network Transport, and Reinforcement Learning

We consider some classical optimization problems in path planning and ne...
research
02/11/2020

Reinforcement Learning Enhanced Quantum-inspired Algorithm for Combinatorial Optimization

Quantum hardware and quantum-inspired algorithms are becoming increasing...

Please sign up or login with your details

Forgot password? Click here to reset