Hierarchical Strategies for Cooperative Multi-Agent Reinforcement Learning

12/14/2022
by   Majd Ibrahim, et al.
0

Adequate strategizing of agents behaviors is essential to solving cooperative MARL problems. One intuitively beneficial yet uncommon method in this domain is predicting agents future behaviors and planning accordingly. Leveraging this point, we propose a two-level hierarchical architecture that combines a novel information-theoretic objective with a trajectory prediction model to learn a strategy. To this end, we introduce a latent policy that learns two types of latent strategies: individual z_A, and relational z_R using a modified Graph Attention Network module to extract interaction features. We encourage each agent to behave according to the strategy by conditioning its local Q functions on z_A, and we further equip agents with a shared Q function that conditions on z_R. Additionally, we introduce two regularizers to allow predicted trajectories to be accurate and rewarding. Empirical results on Google Research Football (GRF) and StarCraft (SC) II micromanagement tasks show that our method establishes a new state of the art being, to the best of our knowledge, the first MARL algorithm to solve all super hard SC II scenarios as well as the GRF full game with a win rate higher than 95%, thus outperforming all existing methods. Videos and brief overview of the methods and results are available at: https://sites.google.com/view/hier-strats-marl/home.

READ FULL TEXT

page 14

page 15

page 16

research
03/18/2020

Multi-Agent Reinforcement Learning with Emergent Roles

The role concept provides a useful tool to design and understand complex...
research
03/18/2020

ROMA: Multi-Agent Reinforcement Learning with Emergent Roles

The role concept provides a useful tool to design and understand complex...
research
08/20/2020

Multi-Agent Reinforcement Learning with Graph Clustering

In this paper, we introduce the group concept into multi-agent reinforce...
research
02/19/2021

Deep Latent Competition: Learning to Race Using Visual Control Policies in Latent Space

Learning competitive behaviors in multi-agent settings such as racing re...
research
03/28/2022

UNMAS: Multi-Agent Reinforcement Learning for Unshaped Cooperative Scenarios

Multi-agent reinforcement learning methods such as VDN, QMIX, and QTRAN ...
research
10/16/2019

MAVEN: Multi-Agent Variational Exploration

Centralised training with decentralised execution is an important settin...
research
12/16/2022

Hippocampus-Inspired Cognitive Architecture (HICA) for Operant Conditioning

The neural implementation of operant conditioning with few trials is unc...

Please sign up or login with your details

Forgot password? Click here to reset