Coordinating Policies Among Multiple Agents via an Intelligent Communication Channel

05/21/2022
by   Dianbo Liu, et al.
41

In Multi-Agent Reinforcement Learning (MARL), specialized channels are often introduced that allow agents to communicate directly with one another. In this paper, we propose an alternative approach whereby agents communicate through an intelligent facilitator that learns to sift through and interpret signals provided by all agents to improve the agents' collective performance. To ensure that this facilitator does not become a centralized controller, agents are incentivized to reduce their dependence on the messages it conveys, and the messages can only influence the selection of a policy from a fixed set, not instantaneous actions given the policy. We demonstrate the strength of this architecture over existing baselines on several cooperative MARL environments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/11/2023

Control as Probabilistic Inference as an Emergent Communication Mechanism in Multi-Agent Reinforcement Learning

This paper proposes a generative probabilistic model integrating emergen...
research
11/13/2019

Learning to Communicate in Multi-Agent Reinforcement Learning : A Review

We consider the issue of multiple agents learning to communicate through...
research
03/19/2023

Cheap Talk Discovery and Utilization in Multi-Agent Reinforcement Learning

By enabling agents to communicate, recent cooperative multi-agent reinfo...
research
03/12/2019

On the Pitfalls of Measuring Emergent Communication

How do we know if communication is emerging in a multi-agent system? The...
research
09/17/2021

APIA: An Architecture for Policy-Aware Intentional Agents

This paper introduces the APIA architecture for policy-aware intentional...
research
07/03/2023

Learning to Communicate using Contrastive Learning

Communication is a powerful tool for coordination in multi-agent RL. But...
research
02/21/2019

Policies for allocation of information in task-oriented groups: elitism and egalitarianism outperform welfarism

Communication or influence networks are probably the most controllable o...

Please sign up or login with your details

Forgot password? Click here to reset