Augmented Fairness: An Interpretable Model Augmenting Decision-Makers' Fairness

11/17/2020
by   Tong Wang, et al.
8

We propose a model-agnostic approach for mitigating the prediction bias of a black-box decision-maker, and in particular, a human decision-maker. Our method detects in the feature space where the black-box decision-maker is biased and replaces it with a few short decision rules, acting as a "fair surrogate". The rule-based surrogate model is trained under two objectives, predictive performance and fairness. Our model focuses on a setting that is common in practice but distinct from other literature on fairness. We only have black-box access to the model, and only a limited set of true labels can be queried under a budget constraint. We formulate a multi-objective optimization for building a surrogate model, where we simultaneously optimize for both predictive performance and bias. To train the model, we propose a novel training algorithm that combines a nondominated sorting genetic algorithm with active learning. We test our model on public datasets where we simulate various biased "black-box" classifiers (decision-makers) and apply our approach for interpretable augmented fairness.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/10/2020

Interpretable Companions for Black-Box Models

We present an interpretable companion model for any pre-trained black-bo...
research
04/22/2022

Balancing Fairness and Accuracy in Sentiment Detection using Multiple Black Box Models

Sentiment detection is an important building block for multiple informat...
research
07/14/2020

Model-Agnostic Interpretable and Data-driven suRRogates suited for highly regulated industries

Highly regulated industries, like banking and insurance, ask for transpa...
research
06/04/2019

Concept Tree: High-Level Representation of Variables for More Interpretable Surrogate Decision Trees

Interpretable surrogates of black-box predictors trained on high-dimensi...
research
11/15/2016

Iterative Orthogonal Feature Projection for Diagnosing Bias in Black-Box Models

Predictive models are increasingly deployed for the purpose of determini...
research
02/16/2022

On Learning and Enforcing Latent Assessment Models using Binary Feedback from Human Auditors Regarding Black-Box Classifiers

Algorithmic fairness literature presents numerous mathematical notions a...
research
05/19/2023

Latent Imitator: Generating Natural Individual Discriminatory Instances for Black-Box Fairness Testing

Machine learning (ML) systems have achieved remarkable performance acros...

Please sign up or login with your details

Forgot password? Click here to reset