Learning Equilibria in Matching Markets from Bandit Feedback

08/19/2021
by   Meena Jagadeesan, et al.
2

Large-scale, two-sided matching platforms must find market outcomes that align with user preferences while simultaneously learning these preferences from data. However, since preferences are inherently uncertain during learning, the classical notion of stability (Gale and Shapley, 1962; Shapley and Shubik, 1971) is unattainable in these settings. To bridge this gap, we develop a framework and algorithms for learning stable market outcomes under uncertainty. Our primary setting is matching with transferable utilities, where the platform both matches agents and sets monetary transfers between them. We design an incentive-aware learning objective that captures the distance of a market outcome from equilibrium. Using this objective, we analyze the complexity of learning as a function of preference structure, casting learning as a stochastic multi-armed bandit problem. Algorithmically, we show that "optimism in the face of uncertainty," the principle underlying many bandit algorithms, applies to a primal-dual formulation of matching with transfers and leads to near-optimal regret bounds. Our work takes a first step toward elucidating when and how stable matchings arise in large, data-driven marketplaces.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/11/2021

Regret, stability, and fairness in matching markets with bandit learners

We consider the two-sided matching market with bandit learners. In the s...
research
08/30/2022

Competition, Alignment, and Equilibria in Digital Marketplaces

Competition between traditional platforms is known to improve user utili...
research
05/07/2022

Rate-Optimal Contextual Online Matching Bandit

Two-sided online matching platforms have been employed in various market...
research
01/24/2023

Double Matching Under Complementary Preferences

In this paper, we propose a new algorithm for addressing the problem of ...
research
07/23/2017

Preference Reasoning in Matching Procedures: Application to the Admission Post-Baccalaureat Platform

Because preferences naturally arise and play an important role in many r...
research
03/07/2022

Learn to Match with No Regret: Reinforcement Learning in Markov Matching Markets

We study a Markov matching market involving a planner and a set of strat...
research
06/11/2018

The Instability of Stable Matchings: The Influence of One Strategic Agent on The Matching Market

Consider a matching problem with n men and n women, with preferences dra...

Please sign up or login with your details

Forgot password? Click here to reset