Computationally Efficient Optimization of Plackett-Luce Ranking Models for Relevance and Fairness

05/03/2021
by   Harrie Oosterhuis, et al.
0

Recent work has proposed stochastic Plackett-Luce (PL) ranking models as a robust choice for optimizing relevance and fairness metrics. Unlike their deterministic counterparts that require heuristic optimization algorithms, PL models are fully differentiable. Theoretically, they can be used to optimize ranking metrics via stochastic gradient descent. However, in practice, the computation of the gradient is infeasible because it requires one to iterate over all possible permutations of items. Consequently, actual applications rely on approximating the gradient via sampling techniques. In this paper, we introduce a novel algorithm: PL-Rank, that estimates the gradient of a PL ranking model w.r.t. both relevance and fairness metrics. Unlike existing approaches that are based on policy gradients, PL-Rank makes use of the specific structure of PL models and ranking metrics. Our experimental analysis shows that PL-Rank has a greater sample-efficiency and is computationally less costly than existing policy gradients, resulting in faster convergence at higher performance. PL-Rank further enables the industry to apply PL models for more relevant and fairer real-world ranking systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/25/2023

Optimizing Group-Fair Plackett-Luce Ranking Models for Relevance and Ex-Post Fairness

In learning-to-rank (LTR), optimizing only the relevance (or the expecte...
research
03/04/2020

StochasticRank: Global Optimization of Scale-Free Discrete Functions

In this paper, we introduce a powerful and efficient framework for the d...
research
02/11/2019

Policy Learning for Fairness in Ranking

Conventional Learning-to-Rank (LTR) methods optimize the utility of the ...
research
05/17/2023

Unconfounded Propensity Estimation for Unbiased Ranking

The goal of unbiased learning to rank (ULTR) is to leverage implicit use...
research
04/22/2022

Learning-to-Rank at the Speed of Sampling: Plackett-Luce Gradient Estimation With Minimal Computational Complexity

Plackett-Luce gradient estimation enables the optimization of stochastic...
research
11/01/2021

Calibrating Explore-Exploit Trade-off for Fair Online Learning to Rank

Online learning to rank (OL2R) has attracted great research interests in...
research
04/28/2022

Probabilistic Permutation Graph Search: Black-Box Optimization for Fairness in Ranking

There are several measures for fairness in ranking, based on different u...

Please sign up or login with your details

Forgot password? Click here to reset