Steering No-Regret Learners to Optimal Equilibria

06/08/2023
by   Brian Hu Zhang, et al.
0

We consider the problem of steering no-regret-learning agents to play desirable equilibria in extensive-form games via nonnegative payments. We show that steering is impossible if the total budget (across iterations) is finite. However, with average, realized payments converging to zero, we show that steering is possible. In the full-feedback setting, that is, when players' full strategies are observed at each timestep, it is possible with constant per-iteration payments. In the bandit-feedback setting, that is, when only trajectories through the game tree are observable, steering is impossible with constant per-iteration payments but possible if we allow the maximum per-iteration payment to grow with time, while maintaining the property that average, realized payments vanish. We supplement our theoretical positive results with experiments highlighting the efficacy of steering in large, extensive-form games, and show how our framework relates to optimal mechanism design and information design.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/15/2022

Sample-Efficient Learning of Correlated Equilibria in Extensive-Form Games

Imperfect-Information Extensive-Form Games (IIEFGs) is a prevalent model...
research
06/08/2023

Computing Optimal Equilibria and Mechanisms via Learning in Zero-Sum Extensive-Form Games

We introduce a new approach for computing optimal equilibria via learnin...
research
07/11/2023

Polynomial-Time Linear-Swap Regret Minimization in Imperfect-Information Sequential Games

No-regret learners seek to minimize the difference between the loss they...
research
05/30/2022

Efficient Φ-Regret Minimization in Extensive-Form Games via Online Mirror Descent

A conceptually appealing approach for learning Extensive-Form Games (EFG...
research
10/05/2021

Stochastic Multiplicative Weights Updates in Zero-Sum Games

We study agents competing against each other in a repeated network zero-...
research
06/24/2022

Diegetic representation of feedback in open games

We improve the framework of open games with agency by showing how the pl...
research
02/01/2022

Kernelized Multiplicative Weights for 0/1-Polyhedral Games: Bridging the Gap Between Learning in Extensive-Form and Normal-Form Games

While extensive-form games (EFGs) can be converted into normal-form game...

Please sign up or login with your details

Forgot password? Click here to reset