Dynamic Assortment Selection under the Nested Logit Models

06/27/2018
by   Xi Chen, et al.
0

We study a stylized dynamic assortment planning problem during a selling season of finite length T, by considering a nested multinomial logit model with M nests and N items per nest. Our policy simultaneously learns customers' choice behavior and makes dynamic decisions on assortments based on the current knowledge. It achieves the regret at the order of Õ(√(MNT)+MN^2), where M is the number of nests and N is the number of products in each nest. We further provide a lower bound result of Ω(√(MT)), which shows the optimality of the upper bound when T>M and N is small. However, the N^2 term in the upper bound is not ideal for applications where N is large as compared to T. To address this issue, we further generalize our first policy by introducing a discretization technique, which leads to a regret of Õ(√(M)T^2/3+MNT^1/3) with a specific choice of discretization granularity. It improves the previous regret bound whenever N>T^1/3. We provide numerical results to demonstrate the empirical performance of both proposed policies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/08/2018

Proximal Online Gradient is Optimum for Dynamic Regret

In online learning, the dynamic regret metric chooses the reference (opt...
research
06/26/2019

A Tractable Algorithm For Finite-Horizon Continuous Reinforcement Learning

We consider the finite horizon continuous reinforcement learning problem...
research
07/12/2022

Simultaneously Learning Stochastic and Adversarial Bandits under the Position-Based Model

Online learning to rank (OLTR) interactively learns to choose lists of i...
research
10/09/2019

Robust Dynamic Assortment Optimization in the Presence of Outlier Customers

We consider the dynamic assortment optimization problem under the multin...
research
04/04/2023

Online Joint Assortment-Inventory Optimization under MNL Choices

We study an online joint assortment-inventory optimization problem, in w...
research
10/31/2018

Dynamic Assortment Optimization with Changing Contextual Information

In this paper, we study the dynamic assortment optimization problem unde...
research
10/22/2015

Inventory Control Involving Unknown Demand of Discrete Nonperishable Items - Analysis of a Newsvendor-based Policy

Inventory control with unknown demand distribution is considered, with e...

Please sign up or login with your details

Forgot password? Click here to reset