Model-based Unbiased Learning to Rank

07/24/2022
by   Dan Luo, et al.
0

Unbiased Learning to Rank (ULTR) that learns to rank documents with biased user feedback data is a well-known challenge in information retrieval. Existing methods in unbiased learning to rank typically rely on click modeling or inverse propensity weighting (IPW). Unfortunately, the search engines are faced with severe long-tail query distribution, where neither click modeling nor IPW can handle well. Click modeling suffers from data sparsity problem since the same query-document pair appears limited times on tail queries; IPW suffers from high variance problem since it is highly sensitive to small propensity score values. Therefore, a general debiasing framework that works well under tail queries is in desperate need. To address this problem, we propose a model-based unbiased learning-to-rank framework. Specifically, we develop a general context-aware user simulator to generate pseudo clicks for unobserved ranked lists to train rankers, which addresses the data sparsity problem. In addition, considering the discrepancy between pseudo clicks and actual clicks, we take the observation of a ranked list as the treatment variable and further incorporate inverse propensity weighting with pseudo labels in a doubly robust way. The derived bias and variance indicate that the proposed model-based method is more robust than existing methods. Finally, extensive experiments on benchmark datasets, including simulated datasets and real click logs, demonstrate that the proposed model-based method consistently performs outperforms state-of-the-art methods in various scenarios.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/26/2022

Bilateral Self-unbiased Learning from Biased Implicit Feedback

Implicit feedback has been widely used to build commercial recommender s...
research
04/16/2018

Unbiased Learning to Rank with Unbiased Propensity Estimation

Learning to rank with biased click data is a well-known challenge. A var...
research
07/19/2020

Counterfactual Learning to Rank using Heterogeneous Treatment Effect Estimation

Learning-to-Rank (LTR) models trained from implicit feedback (e.g. click...
research
06/03/2022

Scalar is Not Enough: Vectorization-based Unbiased Learning to Rank

Unbiased learning to rank (ULTR) aims to train an unbiased ranking model...
research
08/16/2022

Approximated Doubly Robust Search Relevance Estimation

Extracting query-document relevance from the sparse, biased clickthrough...
research
05/11/2021

Federated Unbiased Learning to Rank

Unbiased Learning to Rank (ULTR) studies the problem of learning a ranki...
research
08/24/2021

Contrastive Learning of User Behavior Sequence for Context-Aware Document Ranking

Context information in search sessions has proven to be useful for captu...

Please sign up or login with your details

Forgot password? Click here to reset