Decentralized Coordination in Partially Observable Queueing Networks

08/29/2022
by   Jiekai Jia, et al.
0

We consider communication in a fully cooperative multi-agent system, where the agents have partial observation of the environment and must act jointly to maximize the overall reward. We have a discrete-time queueing network where agents route packets to queues based only on the partial information of the current queue lengths. The queues have limited buffer capacity, so packet drops happen when they are sent to a full queue. In this work, we implemented a communication channel for the agents to share their information in order to reduce the packet drop rate. For efficient information sharing we use an attention-based communication model, called ATVC, to select informative messages from other agents. The agents then infer the state of queues using a combination of the variational auto-encoder, VAE, and product-of-experts, PoE, model. Ultimately, the agents learn what they need to communicate and with whom, instead of communicating all the time with everyone. We also show empirically that ATVC is able to infer the true state of the queues and leads to a policy which outperforms existing baselines.

READ FULL TEXT
research
02/22/2022

A Decentralized Communication Framework based on Dual-Level Recurrence for Multi-Agent Reinforcement Learning

We propose a model enabling decentralized multiple agents to share their...
research
03/03/2021

Inference-Based Deterministic Messaging For Multi-Agent Communication

Communication is essential for coordination among humans and animals. Th...
research
06/12/2020

Learning to Communicate Using Counterfactual Reasoning

This paper introduces a new approach for multi-agent communication learn...
research
02/16/2020

R-MADDPG for Partially Observable Environments and Limited Communication

There are several real-world tasks that would ben-efit from applying mul...
research
10/02/2018

Learning-Based Physical Layer Communications for Multi-agent Collaboration

Consider a collaborative task carried out by two autonomous agents that ...
research
07/31/2018

Incentives and Coordination in Bottleneck Models

We study a variant of Vickrey's classic bottleneck model. In our model t...
research
07/03/2023

Learning to Communicate using Contrastive Learning

Communication is a powerful tool for coordination in multi-agent RL. But...

Please sign up or login with your details

Forgot password? Click here to reset