Representativity Fairness in Clustering

10/11/2020
by   Deepak P, et al.
0

Incorporating fairness constructs into machine learning algorithms is a topic of much societal importance and recent interest. Clustering, a fundamental task in unsupervised learning that manifests across a number of web data scenarios, has also been subject of attention within fair ML research. In this paper, we develop a novel notion of fairness in clustering, called representativity fairness. Representativity fairness is motivated by the need to alleviate disparity across objects' proximity to their assigned cluster representatives, to aid fairer decision making. We illustrate the importance of representativity fairness in real-world decision making scenarios involving clustering and provide ways of quantifying objects' representativity and fairness over it. We develop a new clustering formulation, RFKM, that targets to optimize for representativity fairness along with clustering quality. Inspired by the K-Means framework, RFKM incorporates novel loss terms to formulate an objective function. The RFKM objective and optimization approach guides it towards clustering configurations that yield higher representativity fairness. Through an empirical evaluation over a variety of public datasets, we establish the effectiveness of our method. We illustrate that we are able to significantly improve representativity fairness at only marginal impact to clustering quality.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/11/2019

Fairness in Clustering with Multiple Sensitive Attributes

A clustering may be considered as fair on pre-specified sensitive attrib...
research
10/11/2020

Local Connectivity in Centroid Clustering

Clustering is a fundamental task in unsupervised learning, one that targ...
research
05/27/2023

Fair Clustering via Hierarchical Fair-Dirichlet Process

The advent of ML-driven decision-making and policy formation has led to ...
research
07/08/2020

Whither Fair Clustering?

Within the relatively busy area of fair machine learning that has been d...
research
04/27/2023

Proportionally Representative Clustering

In recent years, there has been a surge in effort to formalize notions o...
research
06/14/2021

Fair Clustering Under a Bounded Cost

Clustering is a fundamental unsupervised learning problem where a datase...
research
02/06/2023

Fair Minimum Representation Clustering

Clustering is an unsupervised learning task that aims to partition data ...

Please sign up or login with your details

Forgot password? Click here to reset