Ex-Ante Assessment of Discrimination in Dataset

08/16/2022
by   Jonathan Vasquez, et al.
0

Data owners face increasing liability for how the use of their data could harm under-priviliged communities. Stakeholders would like to identify the characteristics of data that lead to algorithms being biased against any particular demographic groups, for example, defined by their race, gender, age, and/or religion. Specifically, we are interested in identifying subsets of the feature space where the ground truth response function from features to observed outcomes differs across demographic groups. To this end, we propose FORESEE, a FORESt of decision trEEs algorithm, which generates a score that captures how likely an individual's response varies with sensitive attributes. Empirically, we find that our approach allows us to identify the individuals who are most likely to be misclassified by several classifiers, including Random Forest, Logistic Regression, Support Vector Machine, and k-Nearest Neighbors. The advantage of our approach is that it allows stakeholders to characterize risky samples that may contribute to discrimination, as well as, use the FORESEE to estimate the risk of upcoming samples.

READ FULL TEXT
research
01/14/2020

Perfecting the Crime Machine

This study explores using different machine learning techniques and work...
research
02/02/2022

Fairness of Machine Learning Algorithms in Demography

The paper is devoted to the study of the model fairness and process fair...
research
12/04/2019

Algorithmic Discrimination: Formulation and Exploration in Deep Learning-based Face Biometrics

The most popular face recognition benchmarks assume a distribution of su...
research
10/13/2020

Similarity Based Stratified Splitting: an approach to train better classifiers

We propose a Similarity-Based Stratified Splitting (SBSS) technique, whi...
research
04/18/2022

Demographic-Reliant Algorithmic Fairness: Characterizing the Risks of Demographic Data Collection in the Pursuit of Fairness

Most proposed algorithmic fairness techniques require access to data on ...
research
03/10/2018

Influence of the Event Rate on Discrimination Abilities of Bankruptcy Prediction Models

In bankruptcy prediction, the proportion of events is very low, which is...
research
12/04/2020

Biased Programmers? Or Biased Data? A Field Experiment in Operationalizing AI Ethics

Why do biased predictions arise? What interventions can prevent them? We...

Please sign up or login with your details

Forgot password? Click here to reset