Bi-level Actor-Critic for Multi-agent Coordination

09/08/2019
by   Haifeng Zhang, et al.
0

Coordination is one of the essential problems in multi-agent systems. Typically multi-agent reinforcement learning (MARL) methods treat agents equally and the goal is to solve the Markov game to an arbitrary Nash equilibrium (NE) when multiple equilibra exist, thus lacking a solution for NE selection. In this paper, we treat agents unequally and consider Stackelberg equilibrium as a potentially better convergence point than Nash equilibrium in terms of Pareto superiority, especially in cooperative environments. Under Markov games, we formally define the bi-level reinforcement learning problem in finding Stackelberg equilibrium. We propose a novel bi-level actor-critic learning method that allows agents to have different knowledge base (thus intelligent), while their actions still can be executed simultaneously and distributedly. The convergence proof is given, while the resulting learning algorithm is tested against the state of the arts. We found that the proposed bi-level actor-critic algorithm successfully converged to the Stackelberg equilibria in matrix games and find a asymmetric solution in a highway merge environment.

READ FULL TEXT

page 6

page 7

research
09/28/2022

Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning

Equilibrium selection in multi-agent games refers to the problem of sele...
research
01/26/2019

Multi-Agent Generalized Recursive Reasoning

We propose a new reasoning protocol called generalized recursive reasoni...
research
12/08/2020

Resolving Implicit Coordination in Multi-Agent Deep Reinforcement Learning with Deep Q-Networks Game Theory

We address two major challenges of implicit coordination in multi-agent ...
research
06/26/2018

Multi-agent Inverse Reinforcement Learning for General-sum Stochastic Games

This paper addresses the problem of multi-agent inverse reinforcement le...
research
04/20/2023

Inducing Stackelberg Equilibrium through Spatio-Temporal Sequential Decision-Making in Multi-Agent Reinforcement Learning

In multi-agent reinforcement learning (MARL), self-interested agents att...
research
03/14/2023

Multi-agent Attention Actor-Critic Algorithm for Load Balancing in Cellular Networks

In cellular networks, User Equipment (UE) handoff from one Base Station ...
research
01/26/2019

Probabilistic Recursive Reasoning for Multi-Agent Reinforcement Learning

Humans are capable of attributing latent mental contents such as beliefs...

Please sign up or login with your details

Forgot password? Click here to reset