Overcoming Prior Misspecification in Online Learning to Rank

01/25/2023
by   Javad Azizi, et al.
0

The recent literature on online learning to rank (LTR) has established the utility of prior knowledge to Bayesian ranking bandit algorithms. However, a major limitation of existing work is the requirement for the prior used by the algorithm to match the true prior. In this paper, we propose and analyze adaptive algorithms that address this issue and additionally extend these results to the linear and generalized linear models. We also consider scalar relevance feedback on top of click feedback. Moreover, we demonstrate the efficacy of our algorithms using both synthetic and real-world experiments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/28/2020

Unbiased Learning to Rank: Online or Offline?

How to obtain an unbiased ranking model by learning to rank with biased ...
research
05/30/2023

Adversarial Attacks on Online Learning to Rank with Stochastic Click Models

We propose the first study of adversarial attacks on online learning to ...
research
02/20/2019

Adaptive scale-invariant online algorithms for learning linear models

We consider online learning with linear models, where the algorithm pred...
research
12/12/2018

Online Learning to Rank with List-level Feedback for Image Filtering

Online learning to rank (OLTR) via implicit feedback has been extensivel...
research
08/16/2022

Online Learning for Mixture of Multivariate Hawkes Processes

Online learning of Hawkes processes has received increasing attention in...
research
06/15/2019

Practical User Feedback-driven Internal Search Using Online Learning to Rank

We present a system, Spoke, for creating and searching internal knowledg...
research
05/18/2012

Online Structured Prediction via Coactive Learning

We propose Coactive Learning as a model of interaction between a learnin...

Please sign up or login with your details

Forgot password? Click here to reset