Online Diverse Learning to Rank from Partial-Click Feedback

11/01/2018
by   Prakhar Gupta, et al.
8

Learning to rank is an important problem in machine learning and recommender systems. In a recommender system, a user is typically recommended a list of items. Since the user is unlikely to examine the entire recommended list, partial feedback arises naturally. At the same time, diverse recommendations are important because it is challenging to model all tastes of the user in practice. In this paper, we propose the first algorithm for online learning to rank diverse items from partial-click feedback. We assume that the user examines the list of recommended items until the user is attracted by an item, which is clicked, and does not examine the rest of the items. This model of user behavior is known as the cascade model. We propose an online learning algorithm, cascadelsb, for solving our problem. The algorithm actively explores the tastes of the user with the objective of learning to recommend the optimal diverse list. We analyze the algorithm and prove a gap-free upper bound on its n-step regret. We evaluate cascadelsb on both synthetic and real-world datasets, compare it to various baselines, and show that it learns even when our modeling assumptions do not hold exactly.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/01/2019

A Contextual-Bandit Approach to Online Learning to Rank for Relevance and Diversity

Online learning to rank (LTR) focuses on learning a policy from user int...
research
03/17/2016

Cascading Bandits for Large-Scale Recommendation Problems

Most recommender systems recommend a list of items. The user examines th...
research
12/12/2018

Online Learning to Rank with List-level Feedback for Image Filtering

Online learning to rank (OLTR) via implicit feedback has been extensivel...
research
06/15/2018

BubbleRank: Safe Online Learning to Rerank

We study the problem of online learning to re-rank, where users provide ...
research
06/06/2018

TopRank: A practical algorithm for online stochastic ranking

Online learning to rank is a sequential decision-making problem where in...
research
09/13/2021

Online Learning of Optimally Diverse Rankings

Search engines answer users' queries by listing relevant items (e.g. doc...
research
02/09/2016

DCM Bandits: Learning to Rank with Multiple Clicks

A search engine recommends to the user a list of web pages. The user exa...

Please sign up or login with your details

Forgot password? Click here to reset