Deep Fair Clustering via Maximizing and Minimizing Mutual Information

09/26/2022
by   Pengxin Zeng, et al.
0

Fair clustering aims to divide data into distinct clusters, while preventing sensitive attributes (e.g., gender, race, RNA sequencing technique) from dominating the clustering. Although a number of works have been conducted and achieved huge success in recent, most of them are heuristical, and there lacks a unified theory for algorithm design. In this work, we fill this blank by developing a mutual information theory for deep fair clustering and accordingly designing a novel algorithm, dubbed FCMI. In brief, through maximizing and minimizing mutual information, FCMI is designed to achieve four characteristics highly expected by deep fair clustering, i.e., compact, balanced, and fair clusters, as well as informative features. Besides the contributions to theory and algorithm, another contribution of this work is proposing a novel fair clustering metric built upon information theory as well. Unlike existing evaluation metrics, our metric measures the clustering quality and fairness in a whole instead of separate manner. To verify the effectiveness of the proposed FCMI, we carry out experiments on six benchmarks including a single-cell RNA-seq atlas compared with 11 state-of-the-art methods in terms of five metrics. Code will be released after the acceptance.

READ FULL TEXT
research
10/12/2022

Generalised Mutual Information for Discriminative Clustering

In the last decade, recent successes in deep clustering majorly involved...
research
09/06/2023

Generalised Mutual Information: a Framework for Discriminative Clustering

In the last decade, recent successes in deep clustering majorly involved...
research
04/10/2019

Attraction-Repulsion clustering with applications to fairness

In the framework of fair learning, we consider clustering methods that a...
research
10/03/2019

Information based Deep Clustering: An experimental study

Recently, two methods have shown outstanding performance for clustering ...
research
03/23/2021

Pairwise Adjusted Mutual Information

A well-known metric for quantifying the similarity between two clusterin...
research
02/21/2023

Scalable Infomin Learning

The task of infomin learning aims to learn a representation with high ut...
research
02/08/2021

Learning to Generate Fair Clusters from Demonstrations

Fair clustering is the process of grouping similar entities together, wh...

Please sign up or login with your details

Forgot password? Click here to reset