Regret, stability, and fairness in matching markets with bandit learners

02/11/2021
by   Sarah H. Cen, et al.
3

We consider the two-sided matching market with bandit learners. In the standard matching problem, users and providers are matched to ensure incentive compatibility via the notion of stability. However, contrary to the core assumption of the matching problem, users and providers do not know their true preferences a priori and must learn them. To address this assumption, recent works propose to blend the matching and multi-armed bandit problems. They establish that it is possible to assign matchings that are stable (i.e., incentive-compatible) at every time step while also allowing agents to learn enough so that the system converges to matchings that are stable under the agents' true preferences. However, while some agents may incur low regret under these matchings, others can incur high regret – specifically, Ω(T) optimal regret where T is the time horizon. In this work, we incorporate costs and transfers in the two-sided matching market with bandit learners in order to faithfully model competition between agents. We prove that, under our framework, it is possible to simultaneously guarantee four desiderata: (1) incentive compatibility, i.e., stability, (2) low regret, i.e., O(log(T)) optimal regret, (3) fairness in the distribution of regret among agents, and (4) high social welfare.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/14/2020

Bandit Learning in Decentralized Matching Markets

We study two-sided matching markets in which one side of the market (the...
research
08/19/2021

Learning Equilibria in Matching Markets from Bandit Feedback

Large-scale, two-sided matching platforms must find market outcomes that...
research
01/24/2023

Double Matching Under Complementary Preferences

In this paper, we propose a new algorithm for addressing the problem of ...
research
03/07/2022

Learn to Match with No Regret: Reinforcement Learning in Markov Matching Markets

We study a Markov matching market involving a planner and a set of strat...
research
11/23/2022

Incentive-Aware Recommender Systems in Two-Sided Markets

Online platforms in the Internet Economy commonly incorporate recommende...
research
05/07/2022

Rate-Optimal Contextual Online Matching Bandit

Two-sided online matching platforms have been employed in various market...
research
09/20/2020

Almost Envy-free Repeated Matching in Two-sided Markets

A two-sided market consists of two sets of agents, each of whom have pre...

Please sign up or login with your details

Forgot password? Click here to reset