Spectral Clustering with Smooth Tiny Clusters

09/10/2020
by   Hengrui Wang, et al.
9

Spectral clustering is one of the most prominent clustering approaches. The distance-based similarity is the most widely used method for spectral clustering. However, people have already noticed that this is not suitable for multi-scale data, as the distance varies a lot for clusters with different densities. State of the art(ROSC and CAST ) addresses this limitation by taking the reachability similarity of objects into account. However, we observe that in real-world scenarios, data in the same cluster tend to present in a smooth manner, and previous algorithms never take this into account. Based on this observation, we propose a novel clustering algorithm, which con-siders the smoothness of data for the first time. We first divide objects into a great many tiny clusters. Our key idea is to cluster tiny clusters, whose centers constitute smooth graphs. Theoretical analysis and experimental results show that our clustering algorithm significantly outperforms state of the art. Although in this paper, we singly focus on multi-scale situations, the idea of data smoothness can certainly be extended to any clustering algorithms

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/21/2020

KCoreMotif: An Efficient Graph Clustering Algorithm for Large Networks by Exploiting k-core Decomposition and Motifs

Clustering analysis has been widely used in trust evaluation on various ...
research
01/08/2010

Spectral clustering based on local linear approximations

In the context of clustering, we assume a generative model where each cl...
research
02/06/2019

An Automated Spectral Clustering for Multi-scale Data

Spectral clustering algorithms typically require a priori selection of i...
research
11/11/2017

Differential Performance Debugging with Discriminant Regression Trees

Differential performance debugging is a technique to find performance pr...
research
05/15/2019

EasiCS: the objective and fine-grained classification method of cervical spondylosis dysfunction

The precise diagnosis is of great significance in developing precise tre...
research
11/20/2014

Clustering evolving data using kernel-based methods

In this thesis, we propose several modelling strategies to tackle evolvi...
research
11/10/2020

Higher-Order Spectral Clustering of Directed Graphs

Clustering is an important topic in algorithms, and has a number of appl...

Please sign up or login with your details

Forgot password? Click here to reset