Non-cooperative Multi-agent Systems with Exploring Agents

05/25/2020
by   Jalal Etesami, et al.
0

Multi-agent learning is a challenging problem in machine learning that has applications in different domains such as distributed control, robotics, and economics. We develop a prescriptive model of multi-agent behavior using Markov games. Since in many multi-agent systems, agents do not necessary select their optimum strategies against other agents (e.g., multi-pedestrian interaction), we focus on models in which the agents play "exploration but near optimum strategies". We model such policies using the Boltzmann-Gibbs distribution. This leads to a set of coupled Bellman equations that describes the behavior of the agents. We introduce a set of conditions under which the set of equations admit a unique solution and propose two algorithms that provably provide the solution in finite and infinite time horizon scenarios. We also study a practical setting in which the interactions can be described using the occupancy measures and propose a simplified Markov game with less complexity. Furthermore, we establish the connection between the Markov games with exploration strategies and the principle of maximum causal entropy for multi-agent systems. Finally, we evaluate the performance of our algorithms via several well-known games from the literature and some games that are designed based on real world applications.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/18/2022

RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning

Despite the recent advancement in multi-agent reinforcement learning (MA...
research
04/07/2023

Markov Games with Decoupled Dynamics: Price of Anarchy and Sample Complexity

This paper studies the finite-time horizon Markov games where the agents...
research
11/21/2020

Multi-agent Deep FBSDE Representation For Large Scale Stochastic Differential Games

In this paper, we present a deep learning framework for solving large-sc...
research
04/29/2009

Continuous Strategy Replicator Dynamics for Multi--Agent Learning

The problem of multi-agent learning and adaptation has attracted a great...
research
11/22/2019

Thompson Sampling for Factored Multi-Agent Bandits

Multi-agent coordination is prevalent in many real-world applications. H...
research
06/13/2023

Provably Learning Nash Policies in Constrained Markov Potential Games

Multi-agent reinforcement learning (MARL) addresses sequential decision-...
research
12/20/2022

Automated Configuration and Usage of Strategy Portfolios for Bargaining

Bargaining can be used to resolve mixed-motive games in multi-agent syst...

Please sign up or login with your details

Forgot password? Click here to reset