Logarithmic Regret in Feature-based Dynamic Pricing

02/20/2021
by   Jianyu Xu, et al.
0

Feature-based dynamic pricing is an increasingly popular model of setting prices for highly differentiated products with applications in digital marketing, online sales, real estate and so on. The problem was formally studied as an online learning problem (Cohen et al., 2016; Javanmard Nazerzadeh, 2019) where a seller needs to propose prices on the fly for a sequence of T products based on their features x while having a small regret relative to the best – "omniscient" – pricing strategy she could have come up with in hindsight. We revisit this problem and provide two algorithms (EMLP and ONSP) for stochastic and adversarial feature settings, respectively, and prove the optimal O(dlogT) regret bounds for both. In comparison, the best existing results are O(min{1/λ_min^2logT, √(T)}) and O(T^2/3) respectively, with λ_min being the smallest eigenvalue of 𝔼[xx^T] that could be arbitrarily close to 0. We also prove an Ω(√(T)) information-theoretic lower bound for a slightly more general setting, which demonstrates that "knowing-the-demand-curve" leads to an exponential improvement in feature-based dynamic pricing.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/27/2022

Towards Agnostic Feature-based Dynamic Pricing: Linear Policies vs Linear Valuation with Unknown Noise

In feature-based dynamic pricing, a seller sets appropriate prices for a...
research
11/17/2022

Dynamic Pricing with Volume Discounts in Online Settings

According to the main international reports, more pervasive industrial a...
research
09/23/2022

Doubly Fair Dynamic Pricing

We study the problem of online dynamic pricing with two types of fairnes...
research
05/04/2020

No-Regret Stateful Posted Pricing

In this paper, a rather general online problem called dynamic resource a...
research
05/04/2020

Stateful Posted Pricing with Vanishing Regret via Dynamic Deterministic Markov Decision Processes

In this paper, a rather general online problem called dynamic resource a...
research
11/03/2022

Phase Transitions in Learning and Earning under Price Protection Guarantee

Motivated by the prevalence of “price protection guarantee", which allow...
research
02/28/2019

Meta Dynamic Pricing: Learning Across Experiments

We study the problem of learning across a sequence of price experiments ...

Please sign up or login with your details

Forgot password? Click here to reset