Multiaccuracy: Black-Box Post-Processing for Fairness in Classification

05/31/2018
by   Michael P. Kim, et al.
0

Machine learning predictors are successfully deployed in applications ranging from disease diagnosis, to predicting credit scores, to image recognition. Even when the overall accuracy is high, the predictions often have systematic biases that harm specific subgroups, especially for subgroups that are minorities in the training data. We develop a rigorous framework of multiaccuracy auditing and post-processing to improve predictor accuracies across identifiable subgroups. Our algorithm, MultiaccuracyBoost, works in any setting where we have black-box access to a predictor and a relatively small set of labeled data for auditing. We prove guarantees on the convergence rate of the algorithm and show that it improves overall accuracy at each step. Importantly, if the initial model is accurate on an identifiable subgroup, then the post-processed model will be also. We demonstrate the effectiveness of this approach on diverse applications in image classification, finance, and population health. MultiaccuracyBoost can improve subpopulation accuracy (e.g. for `black women') even when the sensitive features (e.g. `race', `gender') are not known to the algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/15/2019

MimickNet, Matching Clinical Post-Processing Under Realistic Black-Box Constraints

Image post-processing is used in clinical-grade ultrasound scanners to i...
research
10/17/2021

Developing a novel fair-loan-predictor through a multi-sensitive debiasing pipeline: DualFair

Machine learning (ML) models are increasingly used for high-stake applic...
research
07/09/2020

GAMA: a General Automated Machine learning Assistant

The General Automated Machine learning Assistant (GAMA) is a modular Aut...
research
01/31/2022

Fair Wrapping for Black-box Predictions

We introduce a new family of techniques to post-process ("wrap") a black...
research
01/26/2022

Competition over data: how does data purchase affect users?

As machine learning (ML) is deployed by many competing service providers...
research
09/15/2022

Multicalibrated Regression for Downstream Fairness

We show how to take a regression function f̂ that is appropriately “mult...

Please sign up or login with your details

Forgot password? Click here to reset