Clustering Uncertain Data via Representative Possible Worlds with Consistency Learning

09/27/2019
by   Han Liu, et al.
3

Clustering uncertain data is an essential task in data mining for the internet of things. Possible world based algorithms seem promising for clustering uncertain data. However, there are two issues in existing possible world based algorithms: (1) They rely on all the possible worlds and treat them equally, but some marginal possible worlds may cause negative effects. (2) They do not well utilize the consistency among possible worlds, since they conduct clustering or construct the affinity matrix on each possible world independently. In this paper, we propose a representative possible world based consistent clustering (RPC) algorithm for uncertain data. First, by introducing representative loss and using Jensen-Shannon divergence as the distribution measure, we design a heuristic strategy for the selection of representative possible worlds, thus avoiding the negative effects caused by marginal possible worlds. Second, we integrate a consistency learning procedure into spectral clustering to deal with the representative possible worlds synergistically, thus utilizing the consistency to achieve better performance. Experimental results show that our proposed algorithm performs better than the state-of-the-art algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

page 7

research
12/21/2019

Research on Clustering Performance of Sparse Subspace Clustering

Recently, sparse subspace clustering has been a valid tool to deal with ...
research
12/19/2021

TECM: Transfer Evidential C-means Clustering

Clustering is widely used in text analysis, natural language processing,...
research
10/24/2021

Improving Spectral Clustering Using Spectrum-Preserving Node Reduction

Spectral clustering is one of the most popular clustering methods. Howev...
research
12/06/2019

Improved Analysis of Spectral Algorithm for Clustering

Spectral algorithms are graph partitioning algorithms that partition a n...
research
11/01/2019

Regularized Non-negative Spectral Embedding for Clustering

Spectral Clustering is a popular technique to split data points into gro...
research
05/11/2023

Deep Multi-View Subspace Clustering with Anchor Graph

Deep multi-view subspace clustering (DMVSC) has recently attracted incre...
research
11/16/2010

A Probabilistic Approach for Learning Folksonomies from Structured Data

Learning structured representations has emerged as an important problem ...

Please sign up or login with your details

Forgot password? Click here to reset