Bandit Learning in Decentralized Matching Markets

12/14/2020
by   Lydia T. Liu, et al.
13

We study two-sided matching markets in which one side of the market (the players) does not have a priori knowledge about its preferences for the other side (the arms) and is required to learn its preferences from experience. Also, we assume the players have no direct means of communication. This model extends the standard stochastic multi-armed bandit framework to a decentralized multiple player setting with competition. We introduce a new algorithm for this setting that, over a time horizon T, attains 𝒪(log(T)) stable regret when preferences of the arms over players are shared, and 𝒪(log(T)^2) regret when there are no assumptions on the preferences on either side. Moreover, in the setting where a single player may deviate, we show that the algorithm is incentive compatible whenever the arms' preferences are shared, but not necessarily so when preferences are fully general.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/12/2019

Competing Bandits in Matching Markets

Stable matching, a classical model for two-sided markets, has long been ...
research
07/20/2023

Player-optimal Stable Regret for Bandit Learning in Matching Markets

The problem of matching markets has been studied for a long time in the ...
research
02/11/2021

Regret, stability, and fairness in matching markets with bandit learners

We consider the two-sided matching market with bandit learners. In the s...
research
10/21/2022

Competing Bandits in Time Varying Matching Markets

We study the problem of online learning in two-sided non-stationary matc...
research
03/12/2021

Beyond log^2(T) Regret for Decentralized Bandits in Matching Markets

We design decentralized algorithms for regret minimization in the two-si...
research
06/06/2022

Decentralized, Communication- and Coordination-free Learning in Structured Matching Markets

We study the problem of online learning in competitive settings in the c...
research
02/13/2023

Converging to Stability in Two-Sided Bandits: The Case of Unknown Preferences on Both Sides of a Matching Market

We study the problem of repeated two-sided matching with uncertain prefe...

Please sign up or login with your details

Forgot password? Click here to reset