A general framework for the IT-based clustering methods

06/19/2015
by   Teng Qiu, et al.
0

Previously, we proposed a physically inspired rule to organize the data points in a sparse yet effective structure, called the in-tree (IT) graph, which is able to capture a wide class of underlying cluster structures in the datasets, especially for the density-based datasets. Although there are some redundant edges or lines between clusters requiring to be removed by computer, this IT graph has a big advantage compared with the k-nearest-neighborhood (k-NN) or the minimal spanning tree (MST) graph, in that the redundant edges in the IT graph are much more distinguishable and thus can be easily determined by several methods previously proposed by us. In this paper, we propose a general framework to re-construct the IT graph, based on an initial neighborhood graph, such as the k-NN or MST, etc, and the corresponding graph distances. For this general framework, our previous way of constructing the IT graph turns out to be a special case of it. This general framework 1) can make the IT graph capture a wider class of underlying cluster structures in the datasets, especially for the manifolds, and 2) should be more effective to cluster the sparse or graph-based datasets.

READ FULL TEXT

page 5

page 8

page 14

research
07/29/2015

IT-Dendrogram: A New Member of the In-Tree (IT) Clustering Family

Previously, we proposed a physically-inspired method to construct data p...
research
02/16/2015

Clustering by Descending to the Nearest Neighbor in the Delaunay Graph Space

In our previous works, we proposed a physically-inspired rule to organiz...
research
07/30/2013

Scalable k-NN graph construction

The k-NN graph has played a central role in increasingly popular data-dr...
research
05/03/2011

Pruning nearest neighbor cluster trees

Nearest neighbor (k-NN) graphs are widely used in machine learning and d...
research
06/11/2020

Faster DBSCAN via subsampled similarity queries

DBSCAN is a popular density-based clustering algorithm. It computes the ...
research
09/11/2017

On the definition of Shape Parts: a Dominant Sets Approach

In the present paper a novel graph-based approach to the shape decomposi...
research
09/09/2015

Clustering by Hierarchical Nearest Neighbor Descent (H-NND)

Previously in 2014, we proposed the Nearest Descent (ND) method, capable...

Please sign up or login with your details

Forgot password? Click here to reset