Taming the Exponential Action Set: Sublinear Regret and Fast Convergence to Nash Equilibrium in Online Congestion Games

06/19/2023
by   Jing Dong, et al.
0

The congestion game is a powerful model that encompasses a range of engineering systems such as traffic networks and resource allocation. It describes the behavior of a group of agents who share a common set of F facilities and take actions as subsets with k facilities. In this work, we study the online formulation of congestion games, where agents participate in the game repeatedly and observe feedback with randomness. We propose CongestEXP, a decentralized algorithm that applies the classic exponential weights method. By maintaining weights on the facility level, the regret bound of CongestEXP avoids the exponential dependence on the size of possible facility sets, i.e., Fk≈ F^k, and scales only linearly with F. Specifically, we show that CongestEXP attains a regret upper bound of O(kF√(T)) for every individual player, where T is the time horizon. On the other hand, exploiting the exponential growth of weights enables CongestEXP to achieve a fast convergence rate. If a strict Nash equilibrium exists, we show that CongestEXP can converge to the strict Nash policy almost exponentially fast in O(Fexp(-t^1-α)), where t is the number of iterations and α∈ (1/2, 1).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/27/2023

Semi Bandit Dynamics in Congestion Games: Convergence to Nash Equilibrium and No-Regret Guarantees

In this work, we introduce a new variant of online gradient descent, whi...
research
10/19/2020

No-regret learning and mixed Nash equilibria: They do not mix

Understanding the behavior of no-regret dynamics in general N-player gam...
research
06/15/2019

Learning in Cournot Games with Limited Information Feedback

In this work, we study the interaction of strategic players in continuou...
research
02/15/2021

Almost Optimal Algorithms for Two-player Markov Games with Linear Function Approximation

We study reinforcement learning for two-player zero-sum Markov games wit...
research
10/21/2020

Coordinated Online Learning for Multi-Agent Systems with Coupled Constraints and Perturbed Utility Observations

Competitive non-cooperative online decision-making agents whose actions ...
research
01/26/2022

Unpredictable dynamics in congestion games: memory loss can prevent chaos

We study the dynamics of simple congestion games with two resources wher...
research
10/24/2022

Offline congestion games: How feedback type affects data coverage requirement

This paper investigates when one can efficiently recover an approximate ...

Please sign up or login with your details

Forgot password? Click here to reset