Can I Trust My Fairness Metric? Assessing Fairness with Unlabeled Data and Bayesian Inference

10/19/2020
by   Disi Ji, et al.
5

We investigate the problem of reliably assessing group fairness when labeled examples are few but unlabeled examples are plentiful. We propose a general Bayesian framework that can augment labeled data with unlabeled data to produce more accurate and lower-variance estimates compared to methods based on labeled data alone. Our approach estimates calibrated scores for unlabeled examples in each group using a hierarchical latent variable model conditioned on labeled examples. This in turn allows for inference of posterior distributions with associated notions of uncertainty for a variety of group fairness metrics. We demonstrate that our approach leads to significant and consistent reductions in estimation error across multiple well-known fairness datasets, sensitive attributes, and predictive models. The results show the benefits of using both unlabeled data and Bayesian inference in terms of assessing whether a prediction model is fair or not.

READ FULL TEXT
research
05/28/2019

When can unlabeled data improve the learning rate?

In semi-supervised classification, one is given access both to labeled a...
research
04/26/2015

Assessing binary classifiers using only positive and unlabeled data

Assessing the performance of a learned model is a crucial part of machin...
research
04/07/2012

Density-sensitive semisupervised inference

Semisupervised methods are techniques for using labeled data (X_1,Y_1),....
research
03/03/2021

Comparing the Value of Labeled and Unlabeled Data in Method-of-Moments Latent Variable Estimation

Labeling data for modern machine learning is expensive and time-consumin...
research
02/18/2020

Hierarchical Classification of Enzyme Promiscuity Using Positive, Unlabeled, and Hard Negative Examples

Despite significant progress in sequencing technology, there are many ce...
research
02/16/2023

Group Fairness with Uncertainty in Sensitive Attributes

We consider learning a fair predictive model when sensitive attributes a...
research
06/15/2020

Automatic Validation of Textual Attribute Values in E-commerce Catalog by Learning with Limited Labeled Data

Product catalogs are valuable resources for eCommerce website. In the ca...

Please sign up or login with your details

Forgot password? Click here to reset