Fair Learning-to-Rank from Implicit Feedback

11/19/2019
by   Himank Yadav, et al.
8

Addressing unfairness in rankings has become an increasingly important problem due to the growing influence of rankings in critical decision making, yet existing learning-to-rank algorithms suffer from multiple drawbacks when learning fair ranking policies from implicit feedback. Some algorithms suffer from extrinsic reasons of unfairness due to inherent selection biases in implicit feedback leading to rich-get-richer dynamics. While those that address the biased nature of implicit feedback suffer from intrinsic reasons of unfairness due to the lack of explicit control over the allocation of exposure based on merit (i.e, relevance). In both cases, the learned ranking policy can be unfair and lead to suboptimal results. To this end, we propose a novel learning-to-rank framework, FULTR, that is the first to address both intrinsic and extrinsic reasons of unfairness when learning ranking policies from logged implicit feedback. Considering the needs of various applications, we define a class of amortized fairness of exposure constraints with respect to items based on their merit, and propose corresponding counterfactual estimators of disparity (aka unfairness) and utility that are also robust to click noise. Furthermore, we provide an efficient algorithm that optimizes both utility and fairness via a policy-gradient approach. To show that our proposed algorithm learns accurate and fair ranking policies from biased and noisy feedback, we provide empirical results beyond the theoretical justification of the framework.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/11/2019

Policy Learning for Fairness in Ranking

Conventional Learning-to-Rank (LTR) methods optimize the utility of the ...
research
08/25/2023

Optimizing Group-Fair Plackett-Luce Ranking Models for Relevance and Ex-Post Fairness

In learning-to-rank (LTR), optimizing only the relevance (or the expecte...
research
10/28/2021

Sayer: Using Implicit Feedback to Optimize System Policies

We observe that many system policies that make threshold decisions invol...
research
05/29/2020

Controlling Fairness and Bias in Dynamic Learning-to-Rank

Rankings are the primary interface through which many online platforms m...
research
04/29/2023

Learning to Re-rank with Constrained Meta-Optimal Transport

Many re-ranking strategies in search systems rely on stochastic ranking ...
research
11/01/2020

U-rank: Utility-oriented Learning to Rank with Implicit Feedback

Learning to rank with implicit feedback is one of the most important tas...
research
05/16/2022

Pareto-Optimal Fairness-Utility Amortizations in Rankings with a DBN Exposure Model

In recent years, it has become clear that rankings delivered in many are...

Please sign up or login with your details

Forgot password? Click here to reset