No-Regret Stateful Posted Pricing

05/04/2020
by   Yuval Emek, et al.
0

In this paper, a rather general online problem called dynamic resource allocation with capacity constraints (DRACC) is introduced and studied in the realm of posted price mechanisms. This problem subsumes several applications of stateful pricing, including but not limited to posted prices for online job scheduling. As the existing online learning techniques do not yield no-regret mechanisms for this problem, we develop a new online learning framework defined over deterministic Markov decision processes with dynamic state transition and reward functions. We then prove that if the Markov decision process is guaranteed to admit a dominant state in each round and there exists an oracle that can switch the internal state with bounded loss, a condition that is satisfied in the DRACC problem, then the online learning problem can be solved with vanishing regret. Our proof technique is based on a reduction to full information online learning with switching cost (Kalai and Vempala, 2005), in which an online decision maker incurs an extra cost every time she switches from one arm to another. We demonstrate this connection formally, and further show how DRACC can be used in our proposed applications of stateful pricing.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/04/2020

Stateful Posted Pricing with Vanishing Regret via Dynamic Deterministic Markov Decision Processes

In this paper, a rather general online problem called dynamic resource a...
research
06/08/2021

Efficient Online Learning for Dynamic k-Clustering

We study dynamic clustering problems from the perspective of online lear...
research
02/20/2021

Logarithmic Regret in Feature-based Dynamic Pricing

Feature-based dynamic pricing is an increasingly popular model of settin...
research
06/14/2019

Online Allocation and Pricing: Constant Regret via Bellman Inequalities

We develop a framework for designing tractable heuristics for Markov Dec...
research
09/07/2020

An online learning approach to dynamic pricing and capacity sizing in service systems

We study a dynamic pricing and capacity sizing problem in a GI/GI/1 queu...
research
02/09/2022

Online Learning to Transport via the Minimal Selection Principle

Motivated by robust dynamic resource allocation in operations research, ...
research
09/25/2018

Fully Implicit Online Learning

Regularized online learning is widely used in machine learning. In this ...

Please sign up or login with your details

Forgot password? Click here to reset