The Bayesian Prophet: A Low-Regret Framework for Online Decision Making

01/15/2019
by   Alberto Vera, et al.
0

Motivated by the success of using black-box predictive algorithms as subroutines for online decision-making, we develop a new framework for designing online policies given access to an oracle providing statistical information about an offline benchmark. Having access to such prediction oracles enables simple and natural Bayesian selection policies, and raises the question as to how these policies perform in different settings. Our work makes two important contributions towards tackling this question: First, we develop a general technique we call *compensated coupling* which can be used to derive bounds on the expected regret (i.e., additive loss with respect to a benchmark) for any online policy and offline benchmark; Second, using this technique, we show that the Bayes Selector has constant expected regret (i.e., independent of the number of arrivals and resource levels) in any online packing and matching problem with a finite type-space. Our results generalize and simplify many existing results for online packing and matching problems, and suggest a promising pathway for obtaining oracle-driven policies for other online decision-making settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/14/2019

Online Allocation and Pricing: Constant Regret via Bellman Inequalities

We develop a framework for designing tractable heuristics for Markov Dec...
research
02/10/2023

Oracle-Efficient Smoothed Online Learning for Piecewise Continuous Decision Making

Smoothed online learning has emerged as a popular framework to mitigate ...
research
02/18/2021

Online Learning via Offline Greedy Algorithms: Applications in Market Design and Optimization

Motivated by online decision-making in time-varying combinatorial enviro...
research
12/08/2021

Greedy Algorithm for Multiway Matching with Bounded Regret

In this paper we prove the efficacy of a simple greedy algorithm for a f...
research
11/27/2021

Scheduling Appointments Online: The Power of Deferred Decision-Making

The recently introduced online Minimum Peak Appointment Scheduling (MPAS...
research
06/14/2012

On Local Regret

Online learning aims to perform nearly as well as the best hypothesis in...
research
02/16/2023

Quality vs. Quantity of Data in Contextual Decision-Making: Exact Analysis under Newsvendor Loss

When building datasets, one needs to invest time, money and energy to ei...

Please sign up or login with your details

Forgot password? Click here to reset