Learning to Rank under Multinomial Logit Choice

09/07/2020
by   James A. Grant, et al.
0

Learning the optimal ordering of content is an important challenge in website design. The learning to rank (LTR) framework models this problem as a sequential problem of selecting lists of content and observing where users decide to click. Most previous work on LTR assumes that the user considers each item in the list in isolation, and makes binary choices to click or not on each. We introduce a multinomial logit (MNL) choice model to the LTR framework, which captures the behaviour of users who consider the ordered list of items as a whole and make a single choice among all the items and a no-click option. Under the MNL model, the user favours items which are either inherently more attractive, or placed in a preferable position within the list. We propose upper confidence bound algorithms to minimise regret in two settings - where the position dependent parameters are known, and unknown. We present theoretical analysis leading to an Ω(√(T)) lower bound for the problem, an Õ(√(T)) upper bound on regret for the known parameter version. Our analyses are based on tight new concentration results for Geometric random variables, and novel functional inequalities for maximum likelihood estimators computed on discrete data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/25/2021

Multinomial Logit Contextual Bandits: Provable Optimality and Practicality

We consider a sequential assortment selection problem where the user cho...
research
02/10/2015

Cascading Bandits: Learning to Rank in the Cascade Model

A search engine usually outputs a list of K web pages. The user examines...
research
06/30/2022

Ranking in Contextual Multi-Armed Bandits

We study a ranking problem in the contextual multi-armed bandit setting....
research
02/09/2016

DCM Bandits: Learning to Rank with Multiple Clicks

A search engine recommends to the user a list of web pages. The user exa...
research
03/01/2022

Ordered Submodularity and its Applications to Diversifying Recommendations

A fundamental task underlying many important optimization problems, from...
research
02/17/2016

Choice by Elimination via Deep Neural Networks

We introduce Neural Choice by Elimination, a new framework that integrat...
research
07/15/2011

From Small-World Networks to Comparison-Based Search

The problem of content search through comparisons has recently received ...

Please sign up or login with your details

Forgot password? Click here to reset