Degeneracy is OK: Logarithmic Regret for Network Revenue Management with Indiscrete Distributions

10/14/2022
by   Jiashuo Jiang, et al.
0

We study the classical Network Revenue Management (NRM) problem with accept/reject decisions and T IID arrivals. We consider a distributional form where each arrival must fall under a finite number of possible categories, each with a deterministic resource consumption vector, but a random value distributed continuously over an interval. We develop an online algorithm that achieves O(log^2 T) regret under this model, with no further assumptions. We develop another online algorithm that achieves an improved O(log T) regret, with only a second-order growth assumption. To our knowledge, these are the first results achieving logarithmic-level regret in a continuous-distribution NRM model without further “non-degeneracy” assumptions. Our results are achieved via new techniques including: a new method of bounding myopic regret, a “semi-fluid” relaxation of the offline allocation, and an improved bound on the “dual convergence”.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/27/2012

An Adaptive Algorithm for Finite Stochastic Partial Monitoring

We present a new anytime algorithm that achieves near-optimal regret for...
research
02/26/2015

A Chaining Algorithm for Online Nonparametric Regression

We consider the problem of online nonparametric regression with arbitrar...
research
07/22/2022

Fairness-aware Network Revenue Management with Demand Learning

In addition to maximizing the total revenue, decision-makers in lots of ...
research
05/18/2022

The Multisecretary problem with many types

We study the multisecretary problem with capacity to hire up to B out of...
research
02/15/2022

Damped Online Newton Step for Portfolio Selection

We revisit the classic online portfolio selection problem, where at each...
research
10/27/2021

Fairer LP-based Online Allocation

In this paper, we consider a Linear Program (LP)-based online resource a...
research
04/20/2021

Joint Online Learning and Decision-making via Dual Mirror Descent

We consider an online revenue maximization problem over a finite time ho...

Please sign up or login with your details

Forgot password? Click here to reset