Competing Bandits in Time Varying Matching Markets

10/21/2022
by   Deepan Muthirayan, et al.
0

We study the problem of online learning in two-sided non-stationary matching markets, where the objective is to converge to a stable match. In particular, we consider the setting where one side of the market, the arms, has fixed known set of preferences over the other side, the players. While this problem has been studied when the players have fixed but unknown preferences, in this work we study the problem of how to learn when the preferences of the players are time varying and unknown. Our contribution is a methodology that can handle any type of preference structure and variation scenario. We show that, with the proposed algorithm, each player receives a uniform sub-linear regret of 𝒪(L^1/2_TT^1/2) up to the number of changes in the underlying preferences of the agents, L_T. Therefore, we show that the optimal rates for single-agent learning can be achieved in spite of the competition up to a difference of a constant factor. We also discuss extensions of this algorithm to the case where the number of changes need not be known a priori.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/14/2020

Bandit Learning in Decentralized Matching Markets

We study two-sided matching markets in which one side of the market (the...
research
06/06/2022

Decentralized, Communication- and Coordination-free Learning in Structured Matching Markets

We study the problem of online learning in competitive settings in the c...
research
02/13/2023

Converging to Stability in Two-Sided Bandits: The Case of Unknown Preferences on Both Sides of a Matching Market

We study the problem of repeated two-sided matching with uncertain prefe...
research
05/31/2022

Decentralized Competing Bandits in Non-Stationary Matching Markets

Understanding complex dynamics of two-sided online matching markets, whe...
research
06/12/2019

Competing Bandits in Matching Markets

Stable matching, a classical model for two-sided markets, has long been ...
research
06/06/2023

Online Learning under Adversarial Nonlinear Constraints

In many applications, learning systems are required to process continuou...
research
07/20/2023

Player-optimal Stable Regret for Bandit Learning in Matching Markets

The problem of matching markets has been studied for a long time in the ...

Please sign up or login with your details

Forgot password? Click here to reset