Poincaré-Bendixson Limit Sets in Multi-Agent Learning

01/29/2021
by   Aleksander Czechowski, et al.
0

A key challenge of evolutionary game theory and multi-agent learning is to characterize the limit behaviour of game dynamics. Whereas convergence is often a property of learning algorithms in games satisfying a particular reward structure (e.g. zero-sum), it is well known, that for general payoffs even basic learning models, such as the replicator dynamics, are not guaranteed to converge. Worse yet, chaotic behavior is possible even in rather simple games, such as variants of Rock-Paper-Scissors games (Sato et al., 2002). Although chaotic behavior in learning dynamics can be precluded by the celebrated Poincaré-Bendixson theorem, it is only applicable to low-dimensional settings. Are there other characteristics of a game, which can force regularity in the limit sets of learning? In this paper, we show that behaviors consistent with the Poincaré-Bendixson theorem (limit cycles, but no chaotic attractor) follows purely based on the topological structure of the interaction graph, even for high-dimensional settings with arbitrary number of players and arbitrary payoff matrices. We prove our result for a wide class of follow-the-regularized leader (FoReL) dynamics, which generalize replicator dynamics, for games where each player has two strategies at disposal, and for interaction graphs where payoffs of each agent are only affected by one other agent (i.e. interaction graphs of indegree one). Since chaos has been observed in a game with only two players and three strategies, this class of non-chaotic games is in a sense maximal. Moreover, we provide simple conditions under which such behavior translates to social welfare guarantees, implying that FoReL learning achieves time average social welfare which is at least as good as that of a Nash equilibrium; and connecting the topology of the dynamics to the Price of Anarchy analysis.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/26/2023

Stability of Multi-Agent Learning: Convergence in Network Games with Many Players

The behaviour of multi-agent learning in many player games has been show...
research
10/03/2019

From Darwin to Poincaré and von Neumann: Recurrence and Cycles in Evolutionary and Algorithmic Game Theory

Replicator dynamics, the continuous-time analogue of Multiplicative Weig...
research
05/30/2023

Steering control of payoff-maximizing players in adaptive learning dynamics

Evolutionary game theory provides a mathematical foundation for cross-di...
research
01/23/2023

Asymptotic Convergence and Performance of Multi-Agent Q-Learning Dynamics

Achieving convergence of multiple learning agents in general N-player ga...
research
09/19/2018

Deterministic limit of temporal difference reinforcement learning for stochastic games

Reinforcement learning in multi-agent systems has been studied in the fi...
research
05/26/2020

On the Impossibility of Global Convergence in Multi-Loss Optimization

Under mild regularity conditions, gradient-based methods converge global...
research
08/06/2017

Empathy in Bimatrix Games

Although the definition of what empathetic preferences exactly are is st...

Please sign up or login with your details

Forgot password? Click here to reset