Model Free Reinforcement Learning Algorithm for Stationary Mean field Equilibrium for Multiple Types of Agents

12/31/2020
by   Arnob Ghosh, et al.
0

We consider a multi-agent Markov strategic interaction over an infinite horizon where agents can be of multiple types. We model the strategic interaction as a mean-field game in the asymptotic limit when the number of agents of each type becomes infinite. Each agent has a private state; the state evolves depending on the distribution of the state of the agents of different types and the action of the agent. Each agent wants to maximize the discounted sum of rewards over the infinite horizon which depends on the state of the agent and the distribution of the state of the leaders and followers. We seek to characterize and compute a stationary multi-type Mean field equilibrium (MMFE) in the above game. We characterize the conditions under which a stationary MMFE exists. Finally, we propose Reinforcement learning (RL) based algorithm using policy gradient approach to find the stationary MMFE when the agents are unaware of the dynamics. We, numerically, evaluate how such kind of interaction can model the cyber attacks among defenders and adversaries, and show how RL based algorithm can converge to an equilibrium.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/24/2020

Unified Reinforcement Q-Learning for Mean Field Game and Control Problems

We present a Reinforcement Learning (RL) algorithm to solve infinite hor...
research
09/19/2023

Deep Reinforcement Learning for Infinite Horizon Mean Field Problems in Continuous Spaces

We present the development and analysis of a reinforcement learning (RL)...
research
06/21/2020

Learning Trembling Hand Perfect Mean Field Equilibrium for Dynamic Mean Field Games

Mean Field Games (MFG) are those in which each agent assumes that the st...
research
12/12/2017

Small-Scale Markets for Bilateral Resource Trading in the Sharing Economy

We consider a general small-scale market for agent-to-agent resource sha...
research
05/02/2019

Reputation-Based Information Design for Inducing Prosocial Behavior

We study the idea of information design for inducing prosocial behavior ...
research
05/10/2019

Signaling equilibria in mean-field games

In this paper, we consider both finite and infinite horizon discounted d...
research
07/25/2017

Mean Field Equilibria for Resource Competition in Spatial Settings

We study a model of competition among nomadic agents for time-varying an...

Please sign up or login with your details

Forgot password? Click here to reset