Off-policy evaluation for learning-to-rank via interpolating the item-position model and the position-based model

10/15/2022
by   Alexander Buchholz, et al.
0

A critical need for industrial recommender systems is the ability to evaluate recommendation policies offline, before deploying them to production. Unfortunately, widely used off-policy evaluation methods either make strong assumptions about how users behave that can lead to excessive bias, or they make fewer assumptions and suffer from large variance. We tackle this problem by developing a new estimator that mitigates the problems of the two most popular off-policy estimators for rankings, namely the position-based model and the item-position model. In particular, the new estimator, called INTERPOL, addresses the bias of a potentially misspecified position-based model, while providing an adaptable bias-variance trade-off compared to the item-position model. We provide theoretical arguments as well as empirical results that highlight the performance of our novel estimation approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/03/2022

Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model

In real-world recommender systems and search engines, optimizing ranking...
research
08/07/2023

Doubly Robust Estimator for Off-Policy Evaluation with Large Action Spaces

We study Off-Policy Evaluation (OPE) in contextual bandit settings with ...
research
08/31/2022

Inverse Propensity Score based offline estimator for deterministic ranking lists using position bias

In this work, we present a novel way of computing IPS using a position-b...
research
11/05/2018

Intervention Harvesting for Context-Dependent Examination-Bias Estimation

Accurate estimates of examination bias are crucial for unbiased learning...
research
05/10/2023

Improving position bias estimation against sparse and skewed dataset with item embedding

Estimating position bias is a well-known challenge in Learning to rank (...
research
09/17/2019

Ranking metrics on non-shuffled traffic

Ranking metrics are a family of metrics largely used to evaluate recomme...
research
07/28/2021

Ranker-agnostic Contextual Position Bias Estimation

Learning-to-rank (LTR) algorithms are ubiquitous and necessary to explor...

Please sign up or login with your details

Forgot password? Click here to reset