Matching While Learning

03/15/2016
by   Ramesh Johari, et al.
0

We consider the problem faced by a service platform that needs to match supply with demand, but also to learn attributes of new arrivals in order to match them better in the future. We introduce a benchmark model with heterogeneous workers and jobs that arrive over time. Job types are known to the platform, but worker types are unknown and must be learned by observing match outcomes. Workers depart after performing a certain number of jobs. The payoff from a match depends on the pair of types and the goal is to maximize the steady-state rate of accumulation of payoff. Our main contribution is a complete characterization of the structure of the optimal policy in the limit that each worker performs many jobs. The platform faces a trade-off for each worker between myopically maximizing payoffs (exploitation) and learning the type of the worker (exploration). This creates a multitude of multi-armed bandit problems, one for each worker, coupled together by the constraint on the availability of jobs of different types (capacity constraints). We find that the platform should estimate a shadow price for each job type, and use the payoffs adjusted by these prices, first, to determine its learning goals and then, for each worker, (i) to balance learning with payoffs during the "exploration phase", and (ii) to myopically match after it has achieved its learning goals during the "exploitation phase."

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/21/2020

Optimal Growth in Repeated Matching Platforms: Options versus Adoption

We study the design of a decentralized platform in which workers and job...
research
09/18/2018

Exploration vs. Exploitation in Team Formation

An online labor platform faces an online learning problem in matching wo...
research
06/08/2021

Job Dispatching Policies for Queueing Systems with Unknown Service Rates

In multi-server queueing systems where there is no central queue holding...
research
02/04/2021

Matching Impatient and Heterogeneous Demand and Supply

Service platforms must determine rules for matching heterogeneous demand...
research
05/05/2018

DISPATCH: An Optimal Algorithm for Online Perfect Bipartite Matching with i.i.d. Arrivals

This work presents the first algorithm for the problem of weighted onlin...
research
03/19/2018

On Optimal Pricing of Services in On-demand Labor Platforms

I consider the optimal pricing problem faced by a freelance worker on an...
research
01/17/2023

A Semi-supervised Sensing Rate Learning based CMAB Scheme to Combat COVID-19 by Trustful Data Collection in the Crowd

Mobile CrowdSensing (MCS), through employing considerable workers to sen...

Please sign up or login with your details

Forgot password? Click here to reset