Fair Clustering for Diverse and Experienced Groups

06/10/2020
by   Ilya Amburg, et al.
0

The ability for machine learning to exacerbate bias has led to many algorithms centered on fairness. For example, fair clustering algorithms typically focus on balanced representation of protected attributes within clusters. Here, we develop a fair clustering variant where the input data is a hypergraph with multiple edge types, representing information about past experiences of groups of individuals. Our method is based on diversity of experience, instead of protected attributes, with a goal of forming groups that have both experience and diversity with respect to participation in edge types. We model this goal with a regularized edge-based clustering objective, design an efficient 2-approximation algorithm for optimizing the NP-hard objective, and provide bounds on hyperparameters to avoid trivial solutions. We demonstrate a potential application of this framework in online review platforms, where the goal is to curate sets of user reviews for a product type. In this context, "experience" corresponds to users familiar with the type of product, and "diversity" to users that have reviewed related products.

READ FULL TEXT
research
04/10/2019

Attraction-Repulsion clustering with applications to fairness

In the framework of fair learning, we consider clustering methods that a...
research
05/27/2023

Fair Clustering via Hierarchical Fair-Dirichlet Process

The advent of ML-driven decision-making and policy formation has led to ...
research
02/03/2022

Fair Representation Clustering with Several Protected Classes

We study the problem of fair k-median where each cluster is required to ...
research
05/08/2021

Protecting Individual Interests across Clusters: Spectral Clustering with Guarantees

Studies related to fairness in machine learning have recently gained tra...
research
02/15/2018

Fair Clustering Through Fairlets

We study the question of fair clustering under the disparate impact doc...
research
04/25/2021

Fair-Capacitated Clustering

Traditionally, clustering algorithms focus on partitioning the data into...
research
10/22/2019

Hypergraph clustering with categorical edge labels

Graphs and networks are a standard model for describing data or systems ...

Please sign up or login with your details

Forgot password? Click here to reset