Learning in Congestion Games with Bandit Feedback

06/04/2022
by   Qiwen Cui, et al.
0

Learning Nash equilibria is a central problem in multi-agent systems. In this paper, we investigate congestion games, a class of games with benign theoretical structure and broad real-world applications. We first propose a centralized algorithm based on the optimism in the face of uncertainty principle for congestion games with (semi-)bandit feedback, and obtain finite-sample guarantees. Then we propose a decentralized algorithm via a novel combination of the Frank-Wolfe method and G-optimal design. By exploiting the structure of the congestion game, we show the sample complexity of both algorithms depends only polynomially on the number of players and the number of facilities, but not the size of the action set, which can be exponentially large in terms of the number of facilities. We further define a new problem class, Markov congestion games, which allows us to model the non-stationarity in congestion games. We propose a centralized algorithm for Markov congestion games, whose sample complexity again has only polynomial dependence on all relevant problem parameters, but not the size of the action set.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/27/2023

Semi Bandit Dynamics in Congestion Games: Convergence to Nash Equilibrium and No-Regret Guarantees

In this work, we introduce a new variant of online gradient descent, whi...
research
10/24/2022

Offline congestion games: How feedback type affects data coverage requirement

This paper investigates when one can efficiently recover an approximate ...
research
02/07/2023

Breaking the Curse of Multiagents in a Large State Space: RL in Markov Games with Independent Linear Function Approximation

We propose a new model, independent linear Markov game, for multi-agent ...
research
04/01/2020

Provable Sample Complexity Guarantees for Learning of Continuous-Action Graphical Games with Nonparametric Utilities

In this paper, we study the problem of learning the exact structure of c...
research
11/12/2021

Learning Pure Nash Equilibrium in Smart Charging Games

Reinforcement Learning Algorithms (RLA) are useful machine learning tool...
research
09/09/2019

Sensitivity Analysis for Markov Decision Process Congestion Games

We consider a non-atomic congestion game where each decision maker perfo...
research
01/19/2021

Improving DCTCP/Prague Congestion Control Responsiveness

This report explains how DCTCP introduces 1–2 rounds of unnecessary lag,...

Please sign up or login with your details

Forgot password? Click here to reset