Beyond Strict Competition: Approximate Convergence of Multi Agent Q-Learning Dynamics

07/26/2023
by   Aamal Hussain, et al.
0

The behaviour of multi-agent learning in competitive settings is often considered under the restrictive assumption of a zero-sum game. Only under this strict requirement is the behaviour of learning well understood; beyond this, learning dynamics can often display non-convergent behaviours which prevent fixed-point analysis. Nonetheless, many relevant competitive games do not satisfy the zero-sum assumption. Motivated by this, we study a smooth variant of Q-Learning, a popular reinforcement learning dynamics which balances the agents' tendency to maximise their payoffs with their propensity to explore the state space. We examine this dynamic in games which are `close' to network zero-sum games and find that Q-Learning converges to a neighbourhood around a unique equilibrium. The size of the neighbourhood is determined by the `distance' to the zero-sum game, as well as the exploration rates of the agents. We complement these results by providing a method whereby, given an arbitrary network game, the `nearest' network zero-sum game can be found efficiently. As our experiments show, these guarantees are independent of whether the dynamics ultimately reach an equilibrium, or remain non-convergent.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/24/2021

Exploration-Exploitation in Multi-Agent Competition: Convergence with Bounded Rationality

The interplay between exploration and exploitation in competitive multi-...
research
07/26/2023

Stability of Multi-Agent Learning: Convergence in Network Games with Many Players

The behaviour of multi-agent learning in many player games has been show...
research
01/23/2023

Asymptotic Convergence and Performance of Multi-Agent Q-Learning Dynamics

Achieving convergence of multiple learning agents in general N-player ga...
research
11/20/2018

Stable Opponent Shaping in Differentiable Games

A growing number of learning methods are actually games which optimise m...
research
05/07/2018

What game are we playing? End-to-end learning in normal and extensive form games

Although recent work in AI has made great progress in solving large, zer...
research
11/05/2021

Online Learning in Periodic Zero-Sum Games

A seminal result in game theory is von Neumann's minmax theorem, which s...
research
09/08/2017

Cycles in adversarial regularized learning

Regularized learning is a fundamental technique in online optimization, ...

Please sign up or login with your details

Forgot password? Click here to reset