Oracle-free Reinforcement Learning in Mean-Field Games along a Single Sample Path

08/24/2022
by   Muhammad Aneeq uz Zaman, et al.
0

We consider online reinforcement learning in Mean-Field Games. In contrast to the existing works, we alleviate the need for a mean-field oracle by developing an algorithm that estimates the mean-field and the optimal policy using a single sample path of the generic agent. We call this Sandbox Learning, as it can be used as a warm-start for any agent operating in a multi-agent non-cooperative setting. We adopt a two timescale approach in which an online fixed-point recursion for the mean-field operates on a slower timescale and in tandem with a control policy update on a faster timescale for the generic agent. Under a sufficient exploration condition, we provide finite sample convergence guarantees in terms of convergence of the mean-field and control policy to the mean-field equilibrium. The sample complexity of the Sandbox learning algorithm is 𝒪(ϵ^-4). Finally, we empirically demonstrate effectiveness of the sandbox learning algorithm in a congestion game.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/08/2020

Provable Fictitious Play for General Mean-Field Games

We propose a reinforcement learning algorithm for stationary mean-field ...
research
03/13/2020

A General Framework for Learning Mean-Field Games

This paper presents a general mean-field game (GMFG) framework for simul...
research
06/05/2023

Networked Communication for Decentralised Agents in Mean-Field Games

We introduce networked communication to the mean-field game framework. I...
research
05/18/2023

On the Statistical Efficiency of Mean Field Reinforcement Learning with General Function Approximation

In this paper, we study the statistical efficiency of Reinforcement Lear...
research
09/13/2022

Independent Learning in Mean-Field Games: Satisficing Paths and Convergence to Subjective Equilibria

Independent learners are learning agents that naively employ single-agen...
research
02/10/2020

Q-Learning for Mean-Field Controls

Multi-agent reinforcement learning (MARL) has been applied to many chall...
research
02/07/2023

Population-size-Aware Policy Optimization for Mean-Field Games

In this work, we attempt to bridge the two fields of finite-agent and in...

Please sign up or login with your details

Forgot password? Click here to reset