Discovering the Graph Structure in the Clustering Results

05/18/2017
by   Evgeny Bauman, et al.
0

In a standard cluster analysis, such as k-means, in addition to clusters locations and distances between them, it's important to know if they are connected or well separated from each other. The main focus of this paper is discovering the relations between the resulting clusters. We propose a new method which is based on pairwise overlapping k-means clustering, that in addition to means of clusters provides the graph structure of their relations. The proposed method has a set of parameters that can be tuned in order to control the sensitivity of the model and the desired relative size of the pairwise overlapping interval between means of two adjacent clusters, i.e., level of overlapping. We present the exact formula for calculating that parameter. The empirical study presented in the paper demonstrates that our approach works well not only on toy data but also compliments standard clustering results with a reasonable graph structure on real datasets, such as financial indices and restaurants.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/29/2012

Classification Recouvrante Basée sur les Méthodes à Noyau

Overlapping clustering problem is an important learning issue in which c...
research
05/27/2023

Overlapping Indices for Dynamic Information Borrowing in Bayesian Hierarchical Modeling

Bayesian hierarchical model (BHM) has been widely used in synthesizing i...
research
04/13/2020

Exact recovery and sharp thresholds of Stochastic Ising Block Model

The stochastic block model (SBM) is a random graph model in which the ed...
research
03/19/2019

Predictive Clustering

We show how to convert any clustering into a prediction set. This has th...
research
09/08/2016

Functorial Hierarchical Clustering with Overlaps

This work draws its inspiration from three important sources of research...
research
08/02/2023

Are Easy Data Easy (for K-Means)

This paper investigates the capability of correctly recovering well-sepa...
research
04/28/2021

SMLSOM: The shrinking maximum likelihood self-organizing map

Determining the number of clusters in a dataset is a fundamental issue i...

Please sign up or login with your details

Forgot password? Click here to reset