Learning to Generate Fair Clusters from Demonstrations

02/08/2021
by   Sainyam Galhotra, et al.
3

Fair clustering is the process of grouping similar entities together, while satisfying a mathematically well-defined fairness metric as a constraint. Due to the practical challenges in precise model specification, the prescribed fairness constraints are often incomplete and act as proxies to the intended fairness requirement, leading to biased outcomes when the system is deployed. We examine how to identify the intended fairness constraint for a problem based on limited demonstrations from an expert. Each demonstration is a clustering over a subset of the data. We present an algorithm to identify the fairness metric from demonstrations and generate clusters using existing off-the-shelf clustering techniques, and analyze its theoretical properties. To extend our approach to novel fairness metrics for which clustering algorithms do not currently exist, we present a greedy method for clustering. Additionally, we investigate how to generate interpretable solutions using our approach. Empirical evaluation on three real-world datasets demonstrates the effectiveness of our approach in quickly identifying the underlying fairness and interpretability constraints, which are then used to generate fair and interpretable clusters.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/28/2022

Fair Labeled Clustering

Numerous algorithms have been produced for the fundamental problem of cl...
research
12/29/2022

Cluster-level Group Representativity Fairness in k-means Clustering

There has been much interest recently in developing fair clustering algo...
research
06/01/2021

Fair Clustering Using Antidote Data

Clustering algorithms are widely utilized for many modern data science a...
research
09/02/2021

Efficient Algorithms For Fair Clustering with a New Fairness Notion

We revisit the problem of fair clustering, first introduced by Chieriche...
research
04/04/2020

Abstracting Fairness: Oracles, Metrics, and Interpretability

It is well understood that classification algorithms, for example, for d...
research
03/03/2023

FairShap: A Data Re-weighting Approach for Algorithmic Fairness based on Shapley Values

In this paper, we propose FairShap, a novel and interpretable pre-proces...
research
09/26/2022

Deep Fair Clustering via Maximizing and Minimizing Mutual Information

Fair clustering aims to divide data into distinct clusters, while preven...

Please sign up or login with your details

Forgot password? Click here to reset