Correcting Experience Replay for Multi-Agent Communication

10/02/2020
by   Sanjeevan Ahilan, et al.
0

We consider the problem of learning to communicate using multi-agent reinforcement learning (MARL). A common approach is to learn off-policy, using data sampled from a replay buffer. However, messages received in the past may not accurately reflect the current communication policy of each agent, and this complicates learning. We therefore introduce a 'communication correction' which accounts for the non-stationarity of observed communication induced by multi-agent learning. It works by relabelling the received message to make it likely under the communicator's current policy, and thus be a better reflection of the receiver's current environment. To account for cases in which agents are both senders and receivers, we introduce an ordered relabelling scheme. Our correction is computationally efficient and can be integrated with a range of off-policy algorithms. It substantially improves the ability of communicating MARL systems to learn across a variety of cooperative and competitive tasks.

READ FULL TEXT
research
02/21/2023

MAC-PO: Multi-Agent Experience Replay via Collective Priority Optimization

Experience replay is crucial for off-policy reinforcement learning (RL) ...
research
06/15/2021

Minimizing Communication while Maximizing Performance in Multi-Agent Reinforcement Learning

Inter-agent communication can significantly increase performance in mult...
research
06/20/2022

MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer

In this paper, we consider cooperative multi-agent reinforcement learnin...
research
05/23/2023

Research on Multi-Agent Communication and Collaborative Decision-Making Based on Deep Reinforcement Learning

In a multi-agent environment, In order to overcome and alleviate the non...
research
01/05/2023

Scalable Communication for Multi-Agent Reinforcement Learning via Transformer-Based Email Mechanism

Communication can impressively improve cooperation in multi-agent reinfo...
research
12/23/2018

Learning when to Communicate at Scale in Multiagent Cooperative and Competitive Tasks

Learning when to communicate and doing that effectively is essential in ...
research
02/19/2023

Efficient Communication via Self-supervised Information Aggregation for Online and Offline Multi-agent Reinforcement Learning

Utilizing messages from teammates can improve coordination in cooperativ...

Please sign up or login with your details

Forgot password? Click here to reset