Local Connectivity in Centroid Clustering

10/11/2020
by   Deepak P, et al.
0

Clustering is a fundamental task in unsupervised learning, one that targets to group a dataset into clusters of similar objects. There has been recent interest in embedding normative considerations around fairness within clustering formulations. In this paper, we propose 'local connectivity' as a crucial factor in assessing membership desert in centroid clustering. We use local connectivity to refer to the support offered by the local neighborhood of an object towards supporting its membership to the cluster in question. We motivate the need to consider local connectivity of objects in cluster assignment, and provide ways to quantify local connectivity in a given clustering. We then exploit concepts from density-based clustering and devise LOFKM, a clustering method that seeks to deepen local connectivity in clustering outputs, while staying within the framework of centroid clustering. Through an empirical evaluation over real-world datasets, we illustrate that LOFKM achieves notable improvements in local connectivity at reasonable costs to clustering quality, illustrating the effectiveness of the method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/11/2020

Representativity Fairness in Clustering

Incorporating fairness constructs into machine learning algorithms is a ...
research
12/01/2022

Clustering – Basic concepts and methods

We review clustering as an analysis tool and the underlying concepts fro...
research
08/10/2018

Connectivity-Driven Brain Parcellation via Consensus Clustering

We present two related methods for deriving connectivity-based brain atl...
research
10/13/2017

Edge sampling using network local information

Edge sampling is an important topic in network analysis. It provides a n...
research
02/14/2019

A Probabilistic framework for Quantum Clustering

Quantum Clustering is a powerful method to detect clusters in data with ...
research
08/08/2022

Clustering Optimisation Method for Highly Connected Biological Data

Currently, data-driven discovery in biological sciences resides in findi...
research
08/09/2018

α-Approximation Density-based Clustering of Multi-valued Objects

Multi-valued data are commonly found in many real applications. During t...

Please sign up or login with your details

Forgot password? Click here to reset