ck-means, a novel unsupervised learning method that combines fuzzy and crispy clustering methods to extract intersecting data

Clustering data is a popular feature in the field of unsupervised machine learning. Most algorithms aim to find the best method to extract consistent clusters of data, but very few of them intend to cluster data that share the same intersections between two features or more. This paper proposes a method to do so. The main idea of this novel method is to generate fuzzy clusters of data using a Fuzzy C-Means (FCM) algorithm. The second part involves applying a filter that selects a range of minimum and maximum membership values, emphasizing the border data. A μ parameter defines the amplitude of this range. It finally applies a k-means algorithm using the membership values generated by the FCM. Naturally, the data having similar membership values will regroup in a new crispy cluster. The algorithm is also able to find the optimal number of clusters for the FCM and the k-means algorithm, according to the consistency of the clusters given by the Silhouette Index (SI). The result is a list of data and clusters that regroup data sharing the same intersection, intersecting two features or more. ck-means allows extracting the very similar data that does not naturally fall in the same cluster but at the intersection of two clusters or more. The algorithm also always finds itself the optimal number of clusters.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/21/2018

Review: Metaheuristic Search-Based Fuzzy Clustering Algorithms

Fuzzy clustering is a famous unsupervised learning method used to collec...
research
10/27/2016

PCM and APCM Revisited: An Uncertainty Perspective

In this paper, we take a new look at the possibilistic c-means (PCM) and...
research
08/16/2021

Robust Trimmed k-means

Clustering is a fundamental tool in unsupervised learning, used to group...
research
02/14/2019

A Probabilistic framework for Quantum Clustering

Quantum Clustering is a powerful method to detect clusters in data with ...
research
01/27/2020

A Proposed Method for Assessing Cluster Heterogeneity

Assessing how adequate clusters fit a dataset and finding an optimum num...
research
04/27/2020

A Centroid Auto-Fused Hierarchical Fuzzy c-Means Clustering

Like k-means and Gaussian Mixture Model (GMM), fuzzy c-means (FCM) with ...
research
02/08/2019

Bounded Fuzzy Possibilistic Method

This paper introduces Bounded Fuzzy Possibilistic Method (BFPM) by addre...

Please sign up or login with your details

Forgot password? Click here to reset