Distributional Clustering: A distribution-preserving clustering method

11/14/2019
by   Arvind Krishna, et al.
0

One key use of k-means clustering is to identify cluster prototypes which can serve as representative points for a dataset. However, a drawback of using k-means cluster centers as representative points is that such points distort the distribution of the underlying data. This can be highly disadvantageous in problems where the representative points are subsequently used to gain insights on the data distribution, as these points do not mimic the distribution of the data. To this end, we propose a new clustering method called "distributional clustering", which ensures cluster centers capture the distribution of the underlying data. We first prove the asymptotic convergence of the proposed cluster centers to the data generating distribution, then present an efficient algorithm for computing these cluster centers in practice. Finally, we demonstrate the effectiveness of distributional clustering on synthetic and real datasets.

READ FULL TEXT
research
06/19/2020

Fair clustering via equitable group representations

What does it mean for a clustering to be fair? One popular approach seek...
research
12/11/2013

Fast Approximate K-Means via Cluster Closures

K-means, a simple and effective clustering algorithm, is one of the most...
research
06/19/2019

Robust Clustering Using Tau-Scales

K means is a popular non-parametric clustering procedure introduced by S...
research
04/17/2023

K-means Clustering Based Feature Consistency Alignment for Label-free Model Evaluation

The label-free model evaluation aims to predict the model performance on...
research
02/10/2023

Neural Capacitated Clustering

Recent work on deep clustering has found new promising methods also for ...
research
06/23/2014

Further heuristics for k-means: The merge-and-split heuristic and the (k,l)-means

Finding the optimal k-means clustering is NP-hard in general and many he...
research
02/03/2021

Validating Optimal COVID-19 Vaccine Distribution Models

With the approval of vaccines for the coronavirus disease by many countr...

Please sign up or login with your details

Forgot password? Click here to reset