Minimizing Return Gaps with Discrete Communications in Decentralized POMDP

08/07/2023
by   Jingdi Chen, et al.
0

Communication is crucial for solving cooperative Multi-Agent Reinforcement Learning tasks in Partially-Observable Markov Decision Processes. Existing works often rely on black-box methods to encode local information/features into messages shared with other agents. However, such black-box approaches are unable to provide any quantitative guarantees on the expected return and often lead to the generation of continuous messages with high communication overhead and poor interpretability. In this paper, we establish an upper bound on the return gap between an ideal policy with full observability and an optimal partially-observable policy with discrete communication. This result enables us to recast multi-agent communication into a novel online clustering problem over the local observations at each agent, with messages as cluster labels and the upper bound on the return gap as clustering loss. By minimizing the upper bound, we propose a surprisingly simple design of message generation functions in multi-agent communication and integrate it with reinforcement learning using a Regularized Information Maximization loss function. Evaluations show that the proposed discrete communication significantly outperforms state-of-the-art multi-agent communication baselines and can achieve nearly-optimal returns with few-bit messages that are naturally interpretable.

READ FULL TEXT
research
11/06/2020

Multi-Agent Decentralized Belief Propagation on Graphs

We consider the problem of interactive partially observable Markov decis...
research
01/28/2022

FCMNet: Full Communication Memory Net for Team-Level Cooperation in Multi-Agent Systems

Decentralized cooperation in partially-observable multi-agent systems re...
research
01/05/2023

Scalable Communication for Multi-Agent Reinforcement Learning via Transformer-Based Email Mechanism

Communication can impressively improve cooperation in multi-agent reinfo...
research
04/03/2020

Multi-agent Reinforcement Learning for Networked System Control

This paper considers multi-agent reinforcement learning (MARL) in networ...
research
04/02/2020

Information State Embedding in Partially Observable Cooperative Multi-Agent Reinforcement Learning

Multi-agent reinforcement learning (MARL) under partial observability ha...
research
03/18/2018

Detection under One-Bit Messaging over Adaptive Networks

This work studies the operation of multi-agent networks engaged in binar...
research
07/10/2020

MAPS: Multi-agent Reinforcement Learning-based Portfolio Management System

Generating an investment strategy using advanced deep learning methods i...

Please sign up or login with your details

Forgot password? Click here to reset