Soft-Bellman Equilibrium in Affine Markov Games: Forward Solutions and Inverse Learning

03/31/2023
by   Shenghui Chen, et al.
0

Markov games model interactions among multiple players in a stochastic, dynamic environment. Each player in a Markov game maximizes its expected total discounted reward, which depends upon the policies of the other players. We formulate a class of Markov games, termed affine Markov games, where an affine reward function couples the players' actions. We introduce a novel solution concept, the soft-Bellman equilibrium, where each player is boundedly rational and chooses a soft-Bellman policy rather than a purely rational policy as in the well-known Nash equilibrium concept. We provide conditions for the existence and uniqueness of the soft-Bellman equilibrium and propose a nonlinear least squares algorithm to compute such an equilibrium in the forward problem. We then solve the inverse game problem of inferring the players' reward parameters from observed state-action trajectories via a projected gradient algorithm. Experiments in a predator-prey OpenAI Gym environment show that the reward parameters inferred by the proposed algorithm outperform those inferred by a baseline algorithm: they reduce the Kullback-Leibler divergence between the equilibrium policies and observed policies by at least two orders of magnitude.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/19/2020

Risk-Averse Equilibrium for Games

The term rational has become synonymous with maximizing expected payoff ...
research
01/28/2022

Learning Stationary Nash Equilibrium Policies in n-Player Stochastic Games with Independent Chains via Dual Mirror Descent

We consider a subclass of n-player stochastic games, in which players ha...
research
10/18/2021

Empirical Policy Optimization for n-Player Markov Games

In single-agent Markov decision processes, an agent can optimize its pol...
research
01/15/2019

Model and Algorithm for Time-Consistent Risk-Aware Markov Games

In this paper, we propose a model for non-cooperative Markov games with ...
research
08/15/2023

Active Inverse Learning in Stackelberg Trajectory Games

Game-theoretic inverse learning is the problem of inferring the players'...
research
09/03/2020

Bounded Risk-Sensitive Markov Game and Its Inverse Reward Learning Problem

Classical game-theoretic approaches for multi-agent systems in both the ...
research
02/24/2021

Using Inverse Optimization to Learn Cost Functions in Generalized Nash Games

As demonstrated by Ratliff et al. (2014), inverse optimization can be us...

Please sign up or login with your details

Forgot password? Click here to reset