Distributed Cooperative Multi-Agent Reinforcement Learning with Directed Coordination Graph

01/10/2022
by   Gangshan Jing, et al.
0

Existing distributed cooperative multi-agent reinforcement learning (MARL) frameworks usually assume undirected coordination graphs and communication graphs while estimating a global reward via consensus algorithms for policy evaluation. Such a framework may induce expensive communication costs and exhibit poor scalability due to requirement of global consensus. In this work, we study MARLs with directed coordination graphs, and propose a distributed RL algorithm where the local policy evaluations are based on local value functions. The local value function of each agent is obtained by local communication with its neighbors through a directed learning-induced communication graph, without using any consensus algorithm. A zeroth-order optimization (ZOO) approach based on parameter perturbation is employed to achieve gradient estimation. By comparing with existing ZOO-based RL algorithms, we show that our proposed distributed RL algorithm guarantees high scalability. A distributed resource allocation example is shown to illustrate the effectiveness of our algorithm.

READ FULL TEXT
research
02/26/2022

A Scalable Graph-Theoretic Distributed Framework for Cooperative Multi-Agent Reinforcement Learning

The main challenge of large-scale cooperative multi-agent reinforcement ...
research
03/11/2021

Adversarial attacks in consensus-based multi-agent reinforcement learning

Recently, many cooperative distributed multi-agent reinforcement learnin...
research
11/11/2022

Distributed Average Consensus Over Noisy Communication Links in Directed Graphs

Motivated by the needs of resiliency, scalability, and plug-and-play ope...
research
05/10/2021

AoI-Aware Resource Allocation for Platoon-Based C-V2X Networks via Multi-Agent Multi-Task Reinforcement Learning

This paper investigates the problem of age of information (AoI) aware ra...
research
09/27/2019

Deep Coordination Graphs

This paper introduces the deep coordination graph (DCG) for collaborativ...
research
12/19/2019

Distributed Reinforcement Learning for Decentralized Linear Quadratic Control: A Derivative-Free Policy Optimization Approach

This paper considers a distributed reinforcement learning problem for de...
research
09/30/2021

Coordinated Reinforcement Learning for Optimizing Mobile Networks

Mobile networks are composed of many base stations and for each of them ...

Please sign up or login with your details

Forgot password? Click here to reset