Adaptive Sampling for Minimax Fair Classification

03/01/2021
by   Shubhanshu Shekhar, et al.
0

Machine learning models trained on imbalanced datasets can often end up adversely affecting inputs belonging to the underrepresented groups. To address this issue, we consider the problem of adaptively constructing training sets which allow us to learn classifiers that are fair in a minimax sense. We first propose an adaptive sampling algorithm based on the principle of optimism, and derive theoretical bounds on its performance. We then suitably adapt the techniques developed for the analysis of our proposed algorithm to derive bounds on the performance of a related ϵ-greedy strategy recently proposed in the literature. Next, by deriving algorithm independent lower-bounds for a specific class of problems, we show that the performance achieved by our adaptive scheme cannot be improved in general. We then validate the benefits of adaptively constructing training sets via experiments on synthetic tasks with logistic regression classifiers, as well as on several real-world tasks using convolutional neural networks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/04/2020

Multiclass classification by sparse multinomial logistic regression

In this paper we consider high-dimensional multiclass classification by ...
research
07/06/2022

Careful seeding for the k-medoids algorithm with incremental k++ cluster construction

The k-medoids algorithm is a popular variant of the k-means algorithm an...
research
10/03/2019

Minimax Bounds for Distributed Logistic Regression

We consider a distributed logistic regression problem where labeled data...
research
01/15/2022

Training Fair Deep Neural Networks by Balancing Influence

Most fair machine learning methods either highly rely on the sensitive i...
research
02/16/2021

Lexicographically Fair Learning: Algorithms and Generalization

We extend the notion of minimax fairness in supervised learning problems...
research
11/13/2021

On the Statistical Benefits of Curriculum Learning

Curriculum learning (CL) is a commonly used machine learning training st...

Please sign up or login with your details

Forgot password? Click here to reset