Kernelized Multiplicative Weights for 0/1-Polyhedral Games: Bridging the Gap Between Learning in Extensive-Form and Normal-Form Games

02/01/2022
by   Gabriele Farina, et al.
0

While extensive-form games (EFGs) can be converted into normal-form games (NFGs), doing so comes at the cost of an exponential blowup of the strategy space. So, progress on NFGs and EFGs has historically followed separate tracks, with the EFG community often having to catch up with advances (e.g., last-iterate convergence and predictive regret bounds) from the larger NFG community. In this paper we show that the Optimistic Multiplicative Weights Update (OMWU) algorithm – the premier learning algorithm for NFGs – can be simulated on the normal-form equivalent of an EFG in linear time per iteration in the game tree size using a kernel trick. The resulting algorithm, Kernelized OMWU (KOMWU), applies more broadly to all convex games whose strategy space is a polytope with 0/1 integral vertices, as long as the kernel can be evaluated efficiently. In the particular case of EFGs, KOMWU closes several standing gaps between NFG and EFG learning, by enabling direct, black-box transfer to EFGs of desirable properties of learning dynamics that were so far known to be achievable only in NFGs. Specifically, KOMWU gives the first algorithm that guarantees at the same time last-iterate convergence, lower dependence on the size of the game tree than all prior algorithms, and 𝒪̃(1) regret when followed by all players.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/17/2022

Near-Optimal No-Regret Learning for General Convex Games

A recent line of work has established uncoupled learning dynamics such t...
research
09/15/2020

Finding and Certifying (Near-)Optimal Strategies in Black-Box Extensive-Form Games

Often – for example in war games, strategy video games, and financial si...
research
04/04/2021

Simple Uncoupled No-Regret Learning Dynamics for Extensive-Form Correlated Equilibrium

The existence of simple uncoupled no-regret learning dynamics that conve...
research
11/29/2021

Optimal No-Regret Learning in General Games: Bounded Regret with Unbounded Step-Sizes via Clairvoyant MWU

In this paper we solve the problem of no-regret learning in general game...
research
05/24/2023

Regret Matching+: (In)Stability and Fast Convergence in Games

Regret Matching+ (RM+) and its variants are important algorithms for sol...
research
04/11/2022

Equilibrium Finding in Normal-Form Games Via Greedy Regret Minimization

We extend the classic regret minimization framework for approximating eq...
research
06/08/2023

Steering No-Regret Learners to Optimal Equilibria

We consider the problem of steering no-regret-learning agents to play de...

Please sign up or login with your details

Forgot password? Click here to reset