Counterfactual Learning-to-Rank for Additive Metrics and Deep Models

04/30/2018
by   Aman Agarwal, et al.
0

Implicit feedback (e.g., clicks, dwell times) is an attractive source of training data for Learning-to-Rank, but it inevitably suffers from biases such as position bias. It was recently shown how counterfactual inference techniques can provide a rigorous approach for handling these biases, but existing methods are restricted to the special case of optimizing average rank for linear ranking functions. In this work, we generalize the counterfactual learning-to-rank approach to a broad class of additive rank metrics -- like Discounted Cumulative Gain (DCG) and Precision@k -- as well as non-linear deep network models. Focusing on DCG, this conceptual generalization gives rise to two new learning methods that both directly optimize an unbiased estimate of DCG despite the bias in the implicit feedback data. The first, SVM PropDCG, generalizes the Propensity Ranking SVM (SVM PropRank), and we show how the resulting optimization problem can be addressed via the Convex Concave Procedure (CCP). The second, Deep PropDCG, further generalizes the counterfactual learning-to-rank approach to deep networks as non-linear ranking functions. In addition to the theoretical support, we empirically find that SVM PropDCG significantly outperforms SVM PropRank in terms of DCG, and that it is robust to varying severity of presentation bias, noise, and propensity-model misspecification. Moreover, the ability to train non-linear ranking functions via Deep PropDCG further improves DCG.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/18/2020

Policy-Aware Unbiased Learning to Rank for Top-k Rankings

Counterfactual Learning to Rank (LTR) methods optimize ranking systems u...
research
06/09/2018

Consistent Position Bias Estimation without Online Interventions for Learning-to-Rank

Presentation bias is one of the key challenges when learning from implic...
research
04/01/2022

Implicit Feedback for Dense Passage Retrieval: A Counterfactual Approach

In this paper we study how to effectively exploit implicit feedback in D...
research
07/15/2019

To Model or to Intervene: A Comparison of Counterfactual and Online Learning to Rank from User Interactions

Learning to Rank (LTR) from user interactions is challenging as user fee...
research
10/19/2022

Whole Page Unbiased Learning to Rank

The page presentation biases in the information retrieval system, especi...
research
12/12/2018

Estimating Position Bias without Intrusive Interventions

Presentation bias is one of the key challenges when learning from implic...
research
02/14/2020

Learning to rank for uplift modeling

Uplift modeling has effectively been used in fields such as marketing an...

Please sign up or login with your details

Forgot password? Click here to reset