The Multisecretary problem with many types

05/18/2022
by   Omar Besbes, et al.
0

We study the multisecretary problem with capacity to hire up to B out of T candidates, and values drawn i.i.d. from a distribution F on [0,1]. We investigate achievable regret performance, where the latter is defined as the difference between the performance of an oracle with perfect information of future types (values) and an online policy. While the case of distributions over a few discrete types is well understood, very little is known when there are many types, with the exception of the special case of a uniform distribution of types. In this work we consider a larger class of distributions which includes the few discrete types as a special case. We first establish the insufficiency of the common certainty equivalent heuristic for distributions with many types and "gaps" (intervals) of absent types; even for simple deviations from the uniform distribution, it leads to regret Θ(√(T)), as large as that of a non-adaptive algorithm. We introduce a new algorithmic principle which we call "conservativeness with respect to gaps" (CwG), and use it to design an algorithm that applies to any distribution. We establish that the proposed algorithm yields optimal regret scaling of Θ̃(T^1/2 - 1/(2(β + 1))) for a broad class of distributions with gaps, where β quantifies the mass accumulation of types around gaps. We recover constant regret scaling for the special case of a bounded number of types (β=0 in this case). In most practical network revenue management problems, the number of types is large and the current certainty equivalent heuristics scale poorly with the number of types. The new algorithmic principle called Conservatism w.r.t Gaps (CwG) that we developed, can pave the way for progress on handling many types for the broader class of network revenue management problems like order fulfillment and online matching.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/16/2019

Does the Multisecretary Problem Always Have Bounded Regret?

Arlotto and Gurvich (2019) showed that the regret in the multisecretary ...
research
11/04/2019

Network Revenue Management with Limited Switches: Known and Unknown Demand Distributions

This work is motivated by a practical concern from our retail partner. W...
research
10/14/2022

Degeneracy is OK: Logarithmic Regret for Network Revenue Management with Indiscrete Distributions

We study the classical Network Revenue Management (NRM) problem with acc...
research
01/23/2021

Optimistic and Adaptive Lagrangian Hedging

In online learning an algorithm plays against an environment with losses...
research
12/14/2015

Fighting Bandits with a New Kind of Smoothness

We define a novel family of algorithms for the adversarial multi-armed b...
research
10/11/2018

Fighting Contextual Bandits with Stochastic Smoothing

We introduce a new stochastic smoothing perspective to study adversarial...
research
10/18/2016

Dynamic Assortment Personalization in High Dimensions

We study the problem of dynamic assortment personalization with large, h...

Please sign up or login with your details

Forgot password? Click here to reset