MNL-Bandit with Knapsacks

06/02/2021
by   Abdellah Aznag, et al.
0

We consider a dynamic assortment selection problem where a seller has a fixed inventory of N substitutable products and faces an unknown demand that arrives sequentially over T periods. In each period, the seller needs to decide on the assortment of products (of cardinality at most K) to offer to the customers. The customer's response follows an unknown multinomial logit model (MNL) with parameters v. The goal of the seller is to maximize the total expected revenue given the fixed initial inventory of N products. We give a policy that achieves a regret of Õ(K √(K N T)(1 + √(v_max)/q_minOPT) ) under a mild assumption on the model parameters. In particular, our policy achieves a near-optimal Õ(√(T)) regret in the large inventory setting. Our policy builds upon the UCB-based approach for MNL-bandit without inventory constraints in [1] and addresses the inventory constraints through an exponentially sized LP for which we present a tractable approximation while keeping the Õ(√(T)) regret bound.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/28/2020

Improved Optimistic Algorithm For The Multinomial Logit Contextual Bandit

We consider a dynamic assortment selection problem where the goal is to ...
research
10/18/2016

Dynamic Assortment Personalization in High Dimensions

We study the problem of dynamic assortment personalization with large, h...
research
05/10/2023

Constant Approximation for Network Revenue Management with Markovian-Correlated Customer Arrivals

The Network Revenue Management (NRM) problem is a well-known challenge i...
research
03/19/2019

Conservative Exploration for Semi-Bandits with Linear Generalization: A Product Selection Problem for Urban Warehouses

The recent rising popularity of ultra-fast delivery services on retail p...
research
11/01/2021

Dynamic Pricing and Demand Learning on a Large Network of Products: A PAC-Bayesian Approach

We consider a seller offering a large network of N products over a time ...
research
10/16/2017

On the Hardness of Inventory Management with Censored Demand Data

We consider a repeated newsvendor problem where the inventory manager ha...
research
10/31/2018

Dynamic Assortment Optimization with Changing Contextual Information

In this paper, we study the dynamic assortment optimization problem unde...

Please sign up or login with your details

Forgot password? Click here to reset