Learning to Collaborate by Grouping: a Consensus-oriented Strategy for Multi-agent Reinforcement Learning

07/28/2023
by   Jingqing Ruan, et al.
0

Multi-agent systems require effective coordination between groups and individuals to achieve common goals. However, current multi-agent reinforcement learning (MARL) methods primarily focus on improving individual policies and do not adequately address group-level policies, which leads to weak cooperation. To address this issue, we propose a novel Consensus-oriented Strategy (CoS) that emphasizes group and individual policies simultaneously. Specifically, CoS comprises two main components: (a) the vector quantized group consensus module, which extracts discrete latent embeddings that represent the stable and discriminative group consensus, and (b) the group consensus-oriented strategy, which integrates the group policy using a hypernet and the individual policies using the group consensus, thereby promoting coordination at both the group and individual levels. Through empirical experiments on cooperative navigation tasks with both discrete and continuous spaces, as well as Google research football, we demonstrate that CoS outperforms state-of-the-art MARL algorithms and achieves better collaboration, thus providing a promising solution for achieving effective coordination in multi-agent systems.

READ FULL TEXT
research
09/13/2018

Coordination-driven learning in multi-agent problem spaces

We discuss the role of coordination as a direct learning objective in mu...
research
03/11/2021

Adversarial attacks in consensus-based multi-agent reinforcement learning

Recently, many cooperative distributed multi-agent reinforcement learnin...
research
03/02/2023

GHQ: Grouped Hybrid Q Learning for Heterogeneous Cooperative Multi-agent Reinforcement Learning

Previous deep multi-agent reinforcement learning (MARL) algorithms have ...
research
09/20/2022

Rethinking Individual Global Max in Cooperative Multi-Agent Reinforcement Learning

In cooperative multi-agent reinforcement learning, centralized training ...
research
02/09/2021

Structured Diversification Emergence via Reinforced Organization Control and Hierarchical Consensus Learning

When solving a complex task, humans will spontaneously form teams and to...
research
06/26/2018

Learning Existing Social Conventions in Markov Games

In order for artificial agents to coordinate effectively with people, th...
research
11/05/2019

Learning to flock through reinforcement

Flocks of birds, schools of fish, insects swarms are examples of coordin...

Please sign up or login with your details

Forgot password? Click here to reset