Cascading Bandits for Large-Scale Recommendation Problems

03/17/2016
by   Shi Zong, et al.
0

Most recommender systems recommend a list of items. The user examines the list, from the first item to the last, and often chooses the first attractive item and does not examine the rest. This type of user behavior can be modeled by the cascade model. In this work, we study cascading bandits, an online learning variant of the cascade model where the goal is to recommend K most attractive items from a large set of L candidate items. We propose two algorithms for solving this problem, which are based on the idea of linear generalization. The key idea in our solutions is that we learn a predictor of the attraction probabilities of items from their features, as opposing to learning the attraction probability of each item independently as in the existing work. This results in practical learning algorithms whose regret does not depend on the number of items L. We bound the regret of one algorithm and comprehensively evaluate the other on a range of recommendation problems. The algorithm performs well and outperforms all baselines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/01/2018

Online Diverse Learning to Rank from Partial-Click Feedback

Learning to rank is an important problem in machine learning and recomme...
research
02/09/2016

DCM Bandits: Learning to Rank with Multiple Clicks

A search engine recommends to the user a list of web pages. The user exa...
research
10/02/2018

Thompson Sampling for Cascading Bandits

We design and analyze TS-Cascade, a Thompson sampling algorithm for the ...
research
12/21/2020

New Recommendation Algorithm for Implicit Data Motivated by the Multivariate Normal Distribution

The goal of recommender systems is to help users find useful items from ...
research
02/22/2022

No-Regret Learning in Partially-Informed Auctions

Auctions with partially-revealed information about items are broadly emp...
research
09/01/2020

Exploration in two-stage recommender systems

Two-stage recommender systems are widely adopted in industry due to thei...
research
05/21/2023

Multi-channel Integrated Recommendation with Exposure Constraints

Integrated recommendation, which aims at jointly recommending heterogene...

Please sign up or login with your details

Forgot password? Click here to reset