Combining Multiple Clusterings via Crowd Agreement Estimation and Multi-Granularity Link Analysis

05/06/2014
by   Dong Huang, et al.
0

The clustering ensemble technique aims to combine multiple clusterings into a probably better and more robust clustering and has been receiving an increasing attention in recent years. There are mainly two aspects of limitations in the existing clustering ensemble approaches. Firstly, many approaches lack the ability to weight the base clusterings without access to the original data and can be affected significantly by the low-quality, or even ill clusterings. Secondly, they generally focus on the instance level or cluster level in the ensemble system and fail to integrate multi-granularity cues into a unified model. To address these two limitations, this paper proposes to solve the clustering ensemble problem via crowd agreement estimation and multi-granularity link analysis. We present the normalized crowd agreement index (NCAI) to evaluate the quality of base clusterings in an unsupervised manner and thus weight the base clusterings in accordance with their clustering validity. To explore the relationship between clusters, the source aware connected triple (SACT) similarity is introduced with regard to their common neighbors and the source reliability. Based on NCAI and multi-granularity information collected among base clusterings, clusters, and data instances, we further propose two novel consensus functions, termed weighted evidence accumulation clustering (WEAC) and graph partitioning with multi-granularity link analysis (GP-MGLA) respectively. The experiments are conducted on eight real-world datasets. The experimental results demonstrate the effectiveness and robustness of the proposed methods.

READ FULL TEXT
research
10/30/2018

Enhanced Ensemble Clustering via Fast Propagation of Cluster-wise Similarities

Ensemble clustering has been a popular research topic in data mining and...
research
06/03/2016

Robust Ensemble Clustering Using Probability Trajectories

Although many successful ensemble clustering approaches have been develo...
research
06/01/2022

DeepCluE: Enhanced Image Clustering via Multi-layer Ensembles in Deep Neural Networks

Deep clustering has recently emerged as a promising technique for comple...
research
04/23/2022

Selective clustering ensemble based on kappa and F-score

Clustering ensemble has an impressive performance in improving the accur...
research
12/16/2020

Clustering Ensemble Meets Low-rank Tensor Approximation

This paper explores the problem of clustering ensemble, which aims to co...
research
10/25/2016

Image Clustering without Ground Truth

Cluster analysis has become one of the most exercised research areas ove...
research
03/15/2017

Aggregation of Classifiers: A Justifiable Information Granularity Approach

In this study, we introduce a new approach to combine multi-classifiers ...

Please sign up or login with your details

Forgot password? Click here to reset