Fair Feature Subset Selection using Multiobjective Genetic Algorithm

04/30/2022
by   Ayaz Ur Rehman, et al.
0

The feature subset selection problem aims at selecting the relevant subset of features to improve the performance of a Machine Learning (ML) algorithm on training data. Some features in data can be inherently noisy, costly to compute, improperly scaled, or correlated to other features, and they can adversely affect the accuracy, cost, and complexity of the induced algorithm. The goal of traditional feature selection approaches has been to remove such irrelevant features. In recent years ML is making a noticeable impact on the decision-making processes of our everyday lives. We want to ensure that these decisions do not reflect biased behavior towards certain groups or individuals based on protected attributes such as age, sex, or race. In this paper, we present a feature subset selection approach that improves both fairness and accuracy objectives and computes Pareto-optimal solutions using the NSGA-II algorithm. We use statistical disparity as a fairness metric and F1-Score as a metric for model performance. Our experiments on the most commonly used fairness benchmark datasets with three different machine learning algorithms show that using the evolutionary algorithm we can effectively explore the trade-off between fairness and accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/01/2021

Information Theoretic Measures for Fairness-aware Feature Selection

Machine learning algorithms are increasingly used for consequential deci...
research
07/13/2022

Towards A Holistic View of Bias in Machine Learning: Bridging Algorithmic Fairness and Imbalanced Learning

Machine learning (ML) is playing an increasingly important role in rende...
research
02/05/2023

Fair Spatial Indexing: A paradigm for Group Spatial Fairness

Machine learning (ML) is playing an increasing role in decision-making t...
research
06/28/2023

Systematic analysis of the impact of label noise correction on ML Fairness

Arbitrary, inconsistent, or faulty decision-making raises serious concer...
research
04/28/2020

Genetic programming approaches to learning fair classifiers

Society has come to rely on algorithms like classifiers for important de...
research
04/06/2023

The Eyes Have It!: Using Human-Selected Features for Predicting Athletes' Performance

Predicting athletes' performance has relied mostly on statistical data. ...
research
07/20/2020

Crowd, Lending, Machine, and Bias

Big data and machine learning (ML) algorithms are key drivers of many fi...

Please sign up or login with your details

Forgot password? Click here to reset