Offline Evaluation of Ranked Lists using Parametric Estimation of Propensities

06/06/2022
by   Vishwa Vinay, et al.
0

Search engines and recommendation systems attempt to continually improve the quality of the experience they afford to their users. Refining the ranker that produces the lists displayed in response to user requests is an important component of this process. A common practice is for the service providers to make changes (e.g. new ranking features, different ranking models) and A/B test them on a fraction of their users to establish the value of the change. An alternative approach estimates the effectiveness of the proposed changes offline, utilising previously collected clickthrough data on the old ranker to posit what the user behaviour on ranked lists produced by the new ranker would have been. A majority of offline evaluation approaches invoke the well studied inverse propensity weighting to adjust for biases inherent in logged data. In this paper, we propose the use of parametric estimates for these propensities. Specifically, by leveraging well known learning-to-rank methods as subroutines, we show how accurate offline evaluation can be achieved when the new rankings to be evaluated differ from the logged ones.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/11/2018

Offline Comparison of Ranking Functions using Randomized Data

Ranking functions return ranked lists of items, and users often interact...
research
04/27/2018

Offline Evaluation of Ranking Policies with Click Models

Many web systems rank and present a list of items to users, from recomme...
research
05/13/2021

A Methodology for the Offline Evaluation of Recommender Systems in a User Interface with Multiple Carousels

Many video-on-demand and music streaming services provide the user with ...
research
07/03/2014

Reducing Offline Evaluation Bias in Recommendation Systems

Recommendation systems have been integrated into the majority of large o...
research
06/15/2018

BubbleRank: Safe Online Learning to Rerank

We study the problem of online learning to re-rank, where users provide ...
research
07/31/2019

Sudden Death: A New Way to Compare Recommendation Diversification

This paper describes problems with the current way we compare the divers...
research
08/31/2022

Inverse Propensity Score based offline estimator for deterministic ranking lists using position bias

In this work, we present a novel way of computing IPS using a position-b...

Please sign up or login with your details

Forgot password? Click here to reset