Instance-dependent Sample Complexity Bounds for Zero-sum Matrix Games

03/19/2023
by   Arnab Maiti, et al.
0

We study the sample complexity of identifying an approximate equilibrium for two-player zero-sum n× 2 matrix games. That is, in a sequence of repeated game plays, how many rounds must the two players play before reaching an approximate equilibrium (e.g., Nash)? We derive instance-dependent bounds that define an ordering over game matrices that captures the intuition that the dynamics of some games converge faster than others. Specifically, we consider a stochastic observation model such that when the two players choose actions i and j, respectively, they both observe each other's played actions and a stochastic observation X_ij such that 𝔼[ X_ij] = A_ij. To our knowledge, our work is the first case of instance-dependent lower bounds on the number of rounds the players must play before reaching an approximate equilibrium in the sense that the number of rounds depends on the specific properties of the game matrix A as well as the desired accuracy. We also prove a converse statement: there exist player strategies that achieve this lower bound.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/17/2019

Competing Against Equilibria in Zero-Sum Games with Evolving Payoffs

We study the problem of repeated play in a zero-sum game in which the pa...
research
10/12/2021

Intelligent Players in a Fictitious Play Framework

Fictitious play is a popular learning algorithm in which players that ut...
research
11/19/2019

Fictitious Play: Convergence, Smoothness, and Optimism

We consider the dynamics of two-player zero-sum games, with the goal of ...
research
03/03/2023

A Finite-Sample Analysis of Payoff-Based Independent Learning in Zero-Sum Stochastic Games

We study two-player zero-sum stochastic games, and propose a form of ind...
research
06/22/2020

Near-Optimal Reinforcement Learning with Self-Play

This paper considers the problem of designing optimal algorithms for rei...
research
12/23/2022

Adapting to game trees in zero-sum imperfect information games

Imperfect information games (IIG) are games in which each player only pa...
research
12/12/2021

On the Heterogeneity of Independent Learning Dynamics in Zero-sum Stochastic Games

We analyze the convergence properties of the two-timescale fictitious pl...

Please sign up or login with your details

Forgot password? Click here to reset