Scalable Communication for Multi-Agent Reinforcement Learning via Transformer-Based Email Mechanism

01/05/2023
by   Xudong Guo, et al.
0

Communication can impressively improve cooperation in multi-agent reinforcement learning (MARL), especially for partially-observed tasks. However, existing works either broadcast the messages leading to information redundancy, or learn targeted communication by modeling all the other agents as targets, which is not scalable when the number of agents varies. In this work, to tackle the scalability problem of MARL communication for partially-observed tasks, we propose a novel framework Transformer-based Email Mechanism (TEM). The agents adopt local communication to send messages only to the ones that can be observed without modeling all the agents. Inspired by human cooperation with email forwarding, we design message chains to forward information to cooperate with the agents outside the observation range. We introduce Transformer to encode and decode the message chain to choose the next receiver selectively. Empirically, TEM outperforms the baselines on multiple cooperative MARL benchmarks. When the number of agents varies, TEM maintains superior performance without further training.

READ FULL TEXT

page 3

page 6

research
06/11/2020

Learning Individually Inferred Communication for Multi-Agent Cooperation

Communication lays the foundation for human cooperation. It is also cruc...
research
09/02/2022

Learning Practical Communication Strategies in Cooperative Multi-Agent Reinforcement Learning

In Multi-Agent Reinforcement Learning, communication is critical to enco...
research
02/05/2019

Learning to Schedule Communication in Multi-agent Reinforcement Learning

Many real-world reinforcement learning tasks require multiple agents to ...
research
08/07/2023

Minimizing Return Gaps with Discrete Communications in Decentralized POMDP

Communication is crucial for solving cooperative Multi-Agent Reinforceme...
research
03/21/2018

Distributed Mechanism Design for Multicast Transmission

In the standard Mechanism Design framework (Hurwicz-Reiter), there is a ...
research
03/01/2023

SCRIMP: Scalable Communication for Reinforcement- and Imitation-Learning-Based Multi-Agent Pathfinding

Trading off performance guarantees in favor of scalability, the Multi-Ag...
research
10/02/2020

Correcting Experience Replay for Multi-Agent Communication

We consider the problem of learning to communicate using multi-agent rei...

Please sign up or login with your details

Forgot password? Click here to reset