Continuous Assortment Optimization with Logit Choice Probabilities under Incomplete Information

07/17/2018
by   Yannik Peeters, et al.
4

We consider assortment optimization in relation to a product for which a particular attribute can be continuously adjusted. Examples include the duration of a loan (where each duration corresponds to a specific interest rate) and the data limit for a cell phone subscription. The question to be addressed is: how should a retailer determine what to offer to maximize profit? Representing the assortment as a union of subintervals, the choice of a customer is modelled as a continuous logit choice model; a capacity constraint is imposed on the assortment. The problem can be phrased as a multi-armed bandit, i.e., the objective is to estimate demand over time by sequentially offering different assortments to incoming costumers. Kernel density estimation is applied to the observed purchases. We present an explore-then-exploit policy, which endures at most a regret of order T^2/3 (neglecting logarithmic factors). Also, by showing that any policy in the worst case must endure at least a regret of order T^2/3, we conclude that our policy can be regarded as asymptotically optimal.

READ FULL TEXT
research
07/22/2011

Robustness of Anytime Bandit Policies

This paper studies the deviations of the regret in a stochastic multi-ar...
research
12/28/2020

Lifelong Learning in Multi-Armed Bandits

Continuously learning and leveraging the knowledge accumulated from prio...
research
06/07/2022

A Simple and Optimal Policy Design with Safety against Heavy-tailed Risk for Multi-armed Bandits

We design new policies that ensure both worst-case optimality for expect...
research
04/10/2023

Regret Distribution in Stochastic Bandits: Optimal Trade-off between Expectation and Tail Risk

We study the trade-off between expectation and tail risk for regret dist...
research
06/11/2020

Grooming a Single Bandit Arm

The stochastic multi-armed bandit problem captures the fundamental explo...
research
10/16/2017

On the Hardness of Inventory Management with Censored Demand Data

We consider a repeated newsvendor problem where the inventory manager ha...
research
06/09/2020

Determination and estimation of optimal quarantine duration for infectious diseases with application to data analysis of COVID-19

Quarantine measure is a commonly used non-pharmaceutical intervention du...

Please sign up or login with your details

Forgot password? Click here to reset