Stochastic Stability of Reinforcement Learning in Positive-Utility Games

09/18/2017
by   Georgios C. Chasparis, et al.
0

This paper considers a class of discrete-time reinforcement-learning dynamics and provides a stochastic-stability analysis in repeatedly played positive-utility (strategic-form) games. For this class of dynamics, convergence to pure Nash equilibria has been demonstrated only for the fine class of potential games. Prior work primarily provides convergence properties through stochastic approximations, where the asymptotic behavior can be associated with the limit points of an ordinary-differential equation (ODE). However, analyzing global convergence through an ODE-approximation requires the existence of a Lyapunov or a potential function, which naturally restricts the analysis to a fine class of games. To overcome these limitations, this paper introduces an alternative framework for analyzing convergence under reinforcement learning that is based upon an explicit characterization of the invariant probability measure of the induced Markov chain. We further provide a methodology for computing the invariant probability measure in positive-utility games, together with an illustration in the context of coordination games.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/18/2017

Stochastic Stability of Perturbed Learning Automata in Positive-Utility Games

This paper considers a class of reinforcement-based learning (namely, pe...
research
03/07/2018

Aspiration-based Perturbed Learning Automata

This paper introduces a novel payoff-based learning scheme for distribut...
research
06/29/2018

Learning with minimal information in continuous games

We introduce a stochastic learning process called the dampened gradient ...
research
04/04/2023

On the coordination efficiency of strategic multi-agent robotic teams

We study the problem of achieving decentralized coordination by a group ...
research
08/13/2018

On Passivity, Reinforcement Learning and Higher-Order Learning in Multi-Agent Finite Games

In this paper, we propose a passivity-based methodology for analysis and...
research
09/14/2022

Stability and bifurcations in transportation networks with heterogeneous users

A critical aspect in strategic modeling of transportation systems is use...
research
09/30/2012

On The Convergence of a Nash Seeking Algorithm with Stochastic State Dependent Payoff

Distributed strategic learning has been getting attention in recent year...

Please sign up or login with your details

Forgot password? Click here to reset