Online Learning to Rank with List-level Feedback for Image Filtering

12/12/2018
by   Chang Li, et al.
0

Online learning to rank (OLTR) via implicit feedback has been extensively studied for document retrieval in cases where the feedback is available at the level of individual items. To learn from item-level feedback, the current algorithms require certain assumptions about user behavior. In this paper, we study a more general setup: OLTR with list-level feedback, where the feedback is provided only at the level of an entire ranked list. We propose two methods that allow online learning to rank in this setup. The first method, PGLearn, uses a ranking model to generate policies and optimizes it online using policy gradients. The second method, RegLearn, learns to combine individual document relevance scores by directly predicting the observed list-level feedback through regression. We evaluate the proposed methods on the image filtering task, in which deep neural networks (DNNs) are used to rank images in response to a set of standing queries. We show that PGLearn does not perform well in OLTR with list-level feedback. RegLearn, instead, shows good performance in both online and offline metrics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/01/2018

Online Diverse Learning to Rank from Partial-Click Feedback

Learning to rank is an important problem in machine learning and recomme...
research
12/01/2019

A Contextual-Bandit Approach to Online Learning to Rank for Relevance and Diversity

Online learning to rank (LTR) focuses on learning a policy from user int...
research
06/15/2018

BubbleRank: Safe Online Learning to Rerank

We study the problem of online learning to re-rank, where users provide ...
research
06/15/2019

Practical User Feedback-driven Internal Search Using Online Learning to Rank

We present a system, Spoke, for creating and searching internal knowledg...
research
02/24/2017

Rank-to-engage: New Listwise Approaches to Maximize Engagement

For many internet businesses, presenting a given list of items in an ord...
research
06/09/2023

RankFormer: Listwise Learning-to-Rank Using Listwide Labels

Web applications where users are presented with a limited selection of i...
research
01/25/2023

Overcoming Prior Misspecification in Online Learning to Rank

The recent literature on online learning to rank (LTR) has established t...

Please sign up or login with your details

Forgot password? Click here to reset