On Passivity, Reinforcement Learning and Higher-Order Learning in Multi-Agent Finite Games

08/13/2018
by   Bolin Gao, et al.
0

In this paper, we propose a passivity-based methodology for analysis and design of reinforcement learning in multi-agent finite games. Starting from a known exponentially-discounted reinforcement learning scheme, we show that convergence to a Nash distribution can be shown in the class of games characterized by the monotonicity property of their (negative) payoff. We further exploit passivity to propose a class of higher-order schemes that preserve convergence properties, can improve the speed of convergence and can even converge in cases whereby their first-order counterpart fail to converge. We demonstrate these properties through numerical simulations for several representative games.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/16/2023

Decentralized Multi-Agent Reinforcement Learning for Continuous-Space Stochastic Games

Stochastic games are a popular framework for studying multi-agent reinfo...
research
07/19/2017

On Best-Response Dynamics in Potential Games

The paper studies the convergence properties of (continuous) best-respon...
research
02/23/2020

Finite-Time Last-Iterate Convergence for Multi-Agent Learning in Games

We consider multi-agent learning via online gradient descent (OGD) in a ...
research
04/04/2023

Off-Policy Action Anticipation in Multi-Agent Reinforcement Learning

Learning anticipation in Multi-Agent Reinforcement Learning (MARL) is a ...
research
09/05/2020

PAC Reinforcement Learning Algorithm for General-Sum Markov Games

This paper presents a theoretical framework for probably approximately c...
research
09/18/2017

Stochastic Stability of Reinforcement Learning in Positive-Utility Games

This paper considers a class of discrete-time reinforcement-learning dyn...
research
03/16/2022

Backpropagation through Time and Space: Learning Numerical Methods with Multi-Agent Reinforcement Learning

We introduce Backpropagation Through Time and Space (BPTTS), a method fo...

Please sign up or login with your details

Forgot password? Click here to reset