Multi-Agent Intention Sharing via Leader-Follower Forest

12/02/2021
by   Zeyang Liu, et al.
0

Intention sharing is crucial for efficient cooperation under partially observable environments in multi-agent reinforcement learning (MARL). However, message deceiving, i.e., a mismatch between the propagated intentions and the final decisions, may happen when agents change strategies simultaneously according to received intentions. Message deceiving leads to potential miscoordination and difficulty for policy learning. This paper proposes the leader-follower forest (LFF) to learn the hierarchical relationship between agents based on interdependencies, achieving one-sided intention sharing in multi-agent communication. By limiting the flowings of intentions through directed edges, intention sharing via LFF (IS-LFF) can eliminate message deceiving effectively and achieve better coordination. In addition, a twostage learning algorithm is proposed to train the forest and the agent network. We evaluate IS-LFF on multiple partially observable MARL benchmarks, and the experimental results show that our method outperforms state-of-the-art communication algorithms.

READ FULL TEXT

page 2

page 6

page 7

research
01/02/2021

A Joint Learning and Communication Framework for Multi-Agent Reinforcement Learning over Noisy Channels

We propose a novel formulation of the "effectiveness problem" in communi...
research
04/19/2020

Intention Propagation for Multi-agent Reinforcement Learning

A hallmark of an AI agent is to mimic human beings to understand and int...
research
05/07/2023

Robust Multi-agent Communication via Multi-view Message Certification

Many multi-agent scenarios require message sharing among agents to promo...
research
08/16/2023

Partially Observable Multi-agent RL with (Quasi-)Efficiency: The Blessing of Information Sharing

We study provable multi-agent reinforcement learning (MARL) in the gener...
research
02/22/2022

A Decentralized Communication Framework based on Dual-Level Recurrence for Multi-Agent Reinforcement Learning

We propose a model enabling decentralized multiple agents to share their...
research
04/18/2013

Interactive POMDP Lite: Towards Practical Planning to Predict and Exploit Intentions for Interacting with Self-Interested Agents

A key challenge in non-cooperative multi-agent systems is that of develo...
research
05/15/2023

More Like Real World Game Challenge for Partially Observable Multi-Agent Cooperation

Some standardized environments have been designed for partially observab...

Please sign up or login with your details

Forgot password? Click here to reset