Policy Optimization for Linear-Quadratic Zero-Sum Mean-Field Type Games

09/02/2020
by   René Carmona, et al.
0

In this paper, zero-sum mean-field type games (ZSMFTG) with linear dynamics and quadratic utility are studied under infinite-horizon discounted utility function. ZSMFTG are a class of games in which two decision makers whose utilities sum to zero, compete to influence a large population of agents. In particular, the case in which the transition and utility functions depend on the state, the action of the controllers, and the mean of the state and the actions, is investigated. The game is analyzed and explicit expressions for the Nash equilibrium strategies are derived. Moreover, two policy optimization methods that rely on policy gradient are proposed for both model-based and sample-based frameworks. In the first case, the gradients are computed exactly using the model whereas they are estimated using Monte-Carlo simulations in the second case. Numerical experiments show the convergence of the two players' controls as well as the utility function when the two algorithms are used in different scenarios.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/01/2020

Linear-Quadratic Zero-Sum Mean-Field Type Games: Optimality Conditions and Policy Optimization

In this paper, zero-sum mean-field type games (ZSMFTG) with linear dynam...
research
11/29/2020

Reinforcement Learning in Nonzero-sum Linear Quadratic Deep Structured Games: Global Convergence of Policy Optimization

We study model-based and model-free policy optimization in a class of no...
research
12/17/2018

Semi-Explicit Solutions to some Non-Linear Non-Quadratic Mean-Field-Type Games: A Direct Method

This article examines the solvability of mean-field-type game problems b...
research
08/16/2020

Global Convergence of Policy Gradient for Linear-Quadratic Mean-Field Control/Game in Continuous Time

Reinforcement learning is a powerful tool to learn the optimal policy of...
research
11/06/2017

Performance Analysis of Trial and Error Algorithms

Model-free decentralized optimizations and learning are receiving increa...
research
04/23/2019

Matrix-Valued Mean-Field-Type Games: Risk-Sensitive, Adversarial, and Risk-Neutral Linear-Quadratic Case

In this paper we study a class of matrix-valued linear-quadratic mean-fi...
research
06/10/2022

Dynamic mean field programming

A dynamic mean field theory is developed for model based Bayesian reinfo...

Please sign up or login with your details

Forgot password? Click here to reset