Achieving User-Side Fairness in Contextual Bandits

10/22/2020
by   Wen Huang, et al.
0

Personalized recommendation based on multi-arm bandit (MAB) algorithms has shown to lead to high utility and efficiency as it can dynamically adapt the recommendation strategy based on feedback. However, unfairness could incur in personalized recommendation. In this paper, we study how to achieve user-side fairness in personalized recommendation. We formulate our fair personalized recommendation as a modified contextual bandit and focus on achieving fairness on the individual whom is being recommended an item as opposed to achieving fairness on the items that are being recommended. We introduce and define a metric that captures the fairness in terms of rewards received for both the privileged and protected groups. We develop a fair contextual bandit algorithm, Fair-LinUCB, that improves upon the traditional LinUCB algorithm to achieve group-level fairness of users. Our algorithm detects and monitors unfairness while it learns to recommend personalized videos to students to achieve high efficiency. We provide a theoretical regret analysis and show that our algorithm has a slightly higher regret bound than LinUCB. We conduct numerous experimental evaluations to compare the performances of our fair contextual bandit to that of LinUCB and show that our approach achieves group-level fairness while maintaining a high utility.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/03/2021

Fairness of Exposure in Stochastic Bandits

Contextual bandit algorithms have become widely used for recommendation ...
research
09/21/2021

Achieving Counterfactual Fairness for Causal Bandit

In online recommendation, customers arrive in a sequential and stochasti...
research
05/23/2016

Fairness in Learning: Classic and Contextual Bandits

We introduce the study of fairness in multi-armed bandit problems. Our f...
research
09/09/2018

Fairness-Aware Recommendation of Information Curators

This paper highlights our ongoing efforts to create effective informatio...
research
12/09/2019

Group Fairness in Bandit Arm Selection

We consider group fairness in the contextual bandit setting. Here, a seq...
research
02/28/2010

A Contextual-Bandit Approach to Personalized News Article Recommendation

Personalized web services strive to adapt their services (advertisements...
research
06/23/2023

Trading-off price for data quality to achieve fair online allocation

We consider the problem of online allocation subject to a long-term fair...

Please sign up or login with your details

Forgot password? Click here to reset