NVIF: Neighboring Variational Information Flow for Large-Scale Cooperative Multi-Agent Scenarios

07/03/2022
by   Jiajun Chai, et al.
0

Communication-based multi-agent reinforcement learning (MARL) provides information exchange between agents, which promotes the cooperation. However, existing methods cannot perform well in the large-scale multi-agent system. In this paper, we adopt neighboring communication and propose a Neighboring Variational Information Flow (NVIF) to provide efficient communication for agents. It employs variational auto-encoder to compress the shared information into a latent state. This communication protocol does not rely dependently on a specific task, so that it can be pre-trained to stabilize the MARL training. Besides. we combine NVIF with Proximal Policy Optimization (NVIF-PPO) and Deep Q Network (NVIF-DQN), and present a theoretical analysis to illustrate NVIF-PPO can promote cooperation. We evaluate the NVIF-PPO and NVIF-DQN on MAgent, a widely used large-scale multi-agent environment, by two tasks with different map sizes. Experiments show that our method outperforms other compared methods, and can learn effective and scalable cooperation strategies in the large-scale multi-agent system.

READ FULL TEXT

page 1

page 10

research
05/20/2018

Learning Attentional Communication for Multi-Agent Cooperation

Communication could potentially be an effective way for multi-agent coop...
research
05/23/2023

Research on Multi-Agent Communication and Collaborative Decision-Making Based on Deep Reinforcement Learning

In a multi-agent environment, In order to overcome and alleviate the non...
research
02/11/2020

Learning Structured Communication for Multi-agent Reinforcement Learning

This work explores the large-scale multi-agent communication mechanism u...
research
01/25/2021

Accumulating Risk Capital Through Investing in Cooperation

Recent work on promoting cooperation in multi-agent learning has resulte...
research
11/08/2020

Topology Inference for Multi-agent Cooperation under Unmeasurable Latent Input

Topology inference is a crucial problem for cooperative control in multi...
research
02/24/2023

Multi-Agent Reinforcement Learning with Common Policy for Antenna Tilt Optimization

This paper proposes a method for wireless network optimization applicabl...

Please sign up or login with your details

Forgot password? Click here to reset