Fairness-aware Multi-view Clustering

02/11/2023
by   Lecheng Zheng, et al.
0

In the era of big data, we are often facing the challenge of data heterogeneity and the lack of label information simultaneously. In the financial domain (e.g., fraud detection), the heterogeneous data may include not only numerical data (e.g., total debt and yearly income), but also text and images (e.g., financial statement and invoice images). At the same time, the label information (e.g., fraud transactions) may be missing for building predictive models. To address these challenges, many state-of-the-art multi-view clustering methods have been proposed and achieved outstanding performance. However, these methods typically do not take into consideration the fairness aspect and are likely to generate biased results using sensitive information such as race and gender. Therefore, in this paper, we propose a fairness-aware multi-view clustering method named FairMVC. It incorporates the group fairness constraint into the soft membership assignment for each cluster to ensure that the fraction of different groups in each cluster is approximately identical to the entire data set. Meanwhile, we adopt the idea of both contrastive learning and non-contrastive learning and propose novel regularizers to handle heterogeneous data in complex scenarios with missing data or noisy features. Experimental results on real-world data sets demonstrate the effectiveness and efficiency of the proposed framework. We also derive insights regarding the relative performance of the proposed regularizers in various scenarios.

READ FULL TEXT
research
05/19/2021

Heterogeneous Contrastive Learning

With the advent of big data across multiple high-impact applications, we...
research
10/22/2021

Multi-view Contrastive Graph Clustering

With the explosive growth of information technology, multi-view graph da...
research
04/15/2020

A Feature-Reduction Multi-View k-Means Clustering Algorithm

The k-means clustering algorithm is the oldest and most known method in ...
research
09/01/2023

Asymmetric double-winged multi-view clustering network for exploring Diverse and Consistent Information

In unsupervised scenarios, deep contrastive multi-view clustering (DCMVC...
research
01/01/2018

Error-Robust Multi-View Clustering

In the era of big data, data may come from multiple sources, known as mu...
research
11/21/2019

Large-scale Multi-view Subspace Clustering in Linear Time

A plethora of multi-view subspace clustering (MVSC) methods have been pr...
research
05/30/2023

Adapting Fairness Interventions to Missing Values

Missing values in real-world data pose a significant and unique challeng...

Please sign up or login with your details

Forgot password? Click here to reset