A General Framework for Learning Mean-Field Games

03/13/2020
by   Xin Guo, et al.
0

This paper presents a general mean-field game (GMFG) framework for simultaneous learning and decision-making in stochastic games with a large population. It first establishes the existence of a unique Nash Equilibrium to this GMFG, and demonstrates that naively combining Q-learning with the fixed-point approach in classical MFGs yields unstable algorithms. It then proposes value-based and policy-based reinforcement learning algorithms (GMF-P and GMF-P respectively) with smoothed policies, with analysis of convergence property and computational complexity. The experiments on repeated Ad auction problems demonstrate that GMF-V-Q, a specific GMF-V algorithm based on Q-learning, is efficient and robust in terms of convergence and learning accuracy. Moreover, its performance is superior in convergence, stability, and learning ability, when compared with existing algorithms for multi-agent reinforcement learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/24/2022

Oracle-free Reinforcement Learning in Mean-Field Games along a Single Sample Path

We consider online reinforcement learning in Mean-Field Games. In contra...
research
12/31/2019

Fitted Q-Learning in Mean-field Games

In the literature, existence of equilibria for discrete-time mean field ...
research
02/06/2020

Multi Type Mean Field Reinforcement Learning

Mean field theory provides an effective way of scaling multiagent reinfo...
research
02/02/2021

Approximately Solving Mean Field Games via Entropy-Regularized Deep Reinforcement Learning

The recent mean field game (MFG) formalism facilitates otherwise intract...
research
05/29/2019

Scalable and transferable learning of algorithms via graph embedding for multi-robot reward collection

Can the success of reinforcement learning methods for combinatorial opti...
research
09/30/2020

Entropy Regularization for Mean Field Games with Learning

Entropy regularization has been extensively adopted to improve the effic...
research
07/16/2023

MESOB: Balancing Equilibria Social Optimality

Motivated by bid recommendation in online ad auctions, this paper consid...

Please sign up or login with your details

Forgot password? Click here to reset