Fairness via Adversarial Attribute Neighbourhood Robust Learning

10/12/2022
by   Qi Qi, et al.
0

Improving fairness between privileged and less-privileged sensitive attribute groups (e.g, race, gender) has attracted lots of attention. To enhance the model performs uniformly well in different sensitive attributes, we propose a principled Robust Adversarial Attribute Neighbourhood (RAAN) loss to debias the classification head and promote a fairer representation distribution across different sensitive attribute groups. The key idea of RAAN is to mitigate the differences of biased representations between different sensitive attribute groups by assigning each sample an adversarial robust weight, which is defined on the representations of adversarial attribute neighbors, i.e, the samples from different protected groups. To provide efficient optimization algorithms, we cast the RAAN into a sum of coupled compositional functions and propose a stochastic adaptive (Adam-style) and non-adaptive (SGD-style) algorithm framework SCRAAN with provable theoretical guarantee. Extensive empirical studies on fairness-related benchmark datasets verify the effectiveness of the proposed method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/28/2022

Multiple Attribute Fairness: Application to Fraud Detection

We propose a fairness measure relaxing the equality conditions in the po...
research
06/11/2023

Toward Fair Facial Expression Recognition with Improved Distribution Alignment

We present a novel approach to mitigate bias in facial expression recogn...
research
06/23/2021

Fairness via Representation Neutralization

Existing bias mitigation methods for DNN models primarily work on learni...
research
09/17/2019

AdaFair: Cumulative Fairness Adaptive Boosting

The widespread use of ML-based decision making in domains with high soci...
research
05/10/2022

Selective Fairness in Recommendation via Prompts

Recommendation fairness has attracted great attention recently. In real-...
research
07/08/2022

Probing Classifiers are Unreliable for Concept Removal and Detection

Neural network models trained on text data have been found to encode und...
research
06/19/2019

Agnostic data debiasing through a local sanitizer learnt from an adversarial network approach

The widespread use of automated decision processes in many areas of our ...

Please sign up or login with your details

Forgot password? Click here to reset