Communication-Efficient Actor-Critic Methods for Homogeneous Markov Games

02/18/2022
by   Dingyang Chen, et al.
0

Recent success in cooperative multi-agent reinforcement learning (MARL) relies on centralized training and policy sharing. Centralized training eliminates the issue of non-stationarity MARL yet induces large communication costs, and policy sharing is empirically crucial to efficient learning in certain tasks yet lacks theoretical justification. In this paper, we formally characterize a subclass of cooperative Markov games where agents exhibit a certain form of homogeneity such that policy sharing provably incurs no suboptimality. This enables us to develop the first consensus-based decentralized actor-critic method where the consensus update is applied to both the actors and the critics while ensuring convergence. We also develop practical algorithms based on our decentralized actor-critic method to reduce the communication cost during training, while still yielding policies comparable with centralized training.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/17/2020

F2A2: Flexible Fully-decentralized Approximate Actor-critic for Cooperative Multi-agent Reinforcement Learning

Traditional centralized multi-agent reinforcement learning (MARL) algori...
research
03/15/2019

Policy Distillation and Value Matching in Multiagent Reinforcement Learning

Multiagent reinforcement learning algorithms (MARL) have been demonstrat...
research
05/08/2019

Actor-Critic Algorithms for Constrained Multi-agent Reinforcement Learning

In cooperative stochastic games multiple agents work towards learning jo...
research
08/23/2023

E(3)-Equivariant Actor-Critic Methods for Cooperative Multi-Agent Reinforcement Learning

Identification and analysis of symmetrical patterns in the natural world...
research
09/08/2021

Sample and Communication-Efficient Decentralized Actor-Critic Algorithms with Finite-Time Analysis

Actor-critic (AC) algorithms have been widely adopted in decentralized m...
research
07/01/2020

Developing cooperative policies for multi-stage tasks

This paper proposes the Cooperative Soft Actor Critic (CSAC) method of e...
research
07/25/2022

Cooperative Actor-Critic via TD Error Aggregation

In decentralized cooperative multi-agent reinforcement learning, agents ...

Please sign up or login with your details

Forgot password? Click here to reset