Learning to Play Sequential Games versus Unknown Opponents

07/10/2020
by   Pier Giuseppe Sessa, et al.
0

We consider a repeated sequential game between a learner, who plays first, and an opponent who responds to the chosen action. We seek to design strategies for the learner to successfully interact with the opponent. While most previous approaches consider known opponent models, we focus on the setting in which the opponent's model is unknown. To this end, we use kernel-based regularity assumptions to capture and exploit the structure in the opponent's response. We propose a novel algorithm for the learner when playing against an adversarial sequence of opponents. The algorithm combines ideas from bilevel optimization and online learning to effectively balance between exploration (learning about the opponent's model) and exploitation (selecting highly rewarding actions for the learner). Our results include algorithm's regret guarantees that depend on the regularity of the opponent's response and scale sublinearly with the number of game rounds. Moreover, we specialize our approach to repeated Stackelberg games, and empirically demonstrate its effectiveness in a traffic routing and wildlife conservation task

READ FULL TEXT
research
02/10/2020

Playing to Learn Better: Repeated Games for Adversarial Learning with Multiple Classifiers

We consider the problem of prediction by a machine learning algorithm, c...
research
09/18/2019

No-Regret Learning in Unknown Games with Correlated Payoffs

We consider the problem of learning to play a repeated multi-agent game ...
research
05/17/2022

Strategizing against Learners in Bayesian Games

We study repeated two-player games where one of the players, the learner...
research
07/13/2021

Contextual Games: Multi-Agent Learning with Side Information

We formulate the novel class of contextual games, a type of repeated gam...
research
07/28/2021

Efficient Episodic Learning of Nonstationary and Unknown Zero-Sum Games Using Expert Game Ensembles

Game theory provides essential analysis in many applications of strategi...
research
07/03/2003

BL-WoLF: A Framework For Loss-Bounded Learnability In Zero-Sum Games

We present BL-WoLF, a framework for learnability in repeated zero-sum ga...
research
11/19/2019

Path Planning Problems with Side Observations-When Colonels Play Hide-and-Seek

Resource allocation games such as the famous Colonel Blotto (CB) and Hid...

Please sign up or login with your details

Forgot password? Click here to reset