Cascading Non-Stationary Bandits: Online Learning to Rank in the Non-Stationary Cascade Model

05/29/2019
by   Chang Li, et al.
0

Non-stationarity appears in many online applications such as web search and advertising. In this paper, we study the online learning to rank problem in a non-stationary environment where user preferences change abruptly at an unknown moment in time. We consider the problem of identifying the K most attractive items and propose cascading non-stationary bandits, an online learning variant of the cascading model, where a user browses a ranked list from top to bottom and clicks on the first attractive item. We propose two algorithms for solving this non-stationary problem: CascadeDUCB and CascadeSWUCB. We analyze their performance and derive gap-dependent upper bounds on the n-step regret of these algorithms. We also establish a lower bound on the regret for cascading non-stationary bandits and show that both algorithms match the lower bound up to a logarithmic factor. Finally, we evaluate their performance on a real-world web search click dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/10/2015

Cascading Bandits: Learning to Rank in the Cascade Model

A search engine usually outputs a list of K web pages. The user examines...
research
12/01/2020

Non-Stationary Latent Bandits

Users of recommender systems often behave in a non-stationary fashion, d...
research
03/07/2017

Online Learning to Rank in Stochastic Click Models

Online learning to rank is a core problem in information retrieval and m...
research
02/08/2023

Non-Stationary Bandits with Knapsack Problems with Advice

We consider a non-stationary Bandits with Knapsack problem. The outcome ...
research
10/12/2021

Optimizing Ranking Systems Online as Bandits

Ranking system is the core part of modern retrieval and recommender syst...
research
03/01/2013

Second-Order Non-Stationary Online Learning for Regression

The goal of a learner, in standard online learning, is to have the cumul...
research
02/14/2018

Online Learning for Non-Stationary A/B Tests

The rollout of new versions of a feature in modern applications is a man...

Please sign up or login with your details

Forgot password? Click here to reset