Learning in time-varying games

09/10/2018
by   Benoit Duvocelle, et al.
0

In this paper, we examine the long-term behavior of regret-minimizing agents in time-varying games with continuous action spaces. In its most basic form, (external) regret minimization guarantees that an agent's cumulative payoff is no worse in the long run than that of the agent's best fixed action in hindsight. Going beyond this worst-case guarantee, we consider a dynamic regret variant that compares the agent's accrued rewards to those of any sequence of play. Specializing to a wide class of no-regret strategies based on mirror descent, we derive explicit rates of regret minimization relying only on imperfect gradient obvservations. We then leverage these results to show that players are able to stay close to Nash equilibrium in time-varying monotone games - and even converge to Nash equilibrium if the sequence of stage games admits a limit.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/26/2021

Adaptive Learning in Continuous Games: Optimal Regret Bounds and Convergence to Nash Equilibrium

In game-theoretic learning, several agents are simultaneously following ...
research
09/01/2023

Convergence Analysis of the Best Response Algorithm for Time-Varying Games

This paper studies a class of strongly monotone games involving non-coop...
research
11/28/2014

Solving Games with Functional Regret Estimation

We propose a novel online learning method for minimizing regret in large...
research
06/19/2020

Gradient-free Online Learning in Games with Delayed Rewards

Motivated by applications to online advertising and recommender systems,...
research
01/30/2022

No-Regret Learning in Time-Varying Zero-Sum Games

Learning from repeated play in a fixed two-player zero-sum game is a cla...
research
12/30/2021

From Behavioral Theories to Econometrics: Inferring Preferences of Human Agents from Data on Repeated Interactions

We consider the problem of estimating preferences of human agents from d...
research
12/06/2022

Regret Minimization with Dynamic Benchmarks in Repeated Games

In repeated games, strategies are often evaluated by their ability to gu...

Please sign up or login with your details

Forgot password? Click here to reset