An Entropy-based Variable Feature Weighted Fuzzy k-Means Algorithm for High Dimensional Data

12/24/2019
by   Vikas Singh, et al.
43

This paper presents a new fuzzy k-means algorithm for the clustering of high dimensional data in various subspaces. Since, In the case of high dimensional data, some features might be irrelevant and relevant but may have different significance in the clustering. For a better clustering, it is crucial to incorporate the contribution of these features in the clustering process. To combine these features, in this paper, we have proposed a new fuzzy k-means clustering algorithm in which the objective function of the fuzzy k-means is modified using two different entropy term. The first entropy term helps to minimize the within-cluster dispersion and maximize the negative entropy to determine clusters to contribute to the association of data points. The second entropy term helps to control the weight of the features because different features have different contributing weights in the clustering process for obtaining the better partition of the data. The efficacy of the proposed method is presented in terms of various clustering measures on multiple datasets and compared with various state-of-the-art methods.

READ FULL TEXT

page 1

page 5

page 6

research
08/15/2022

POCS-based Clustering Algorithm

A novel clustering technique based on the projection onto convex set (PO...
research
06/15/2018

Supervised Fuzzy Partitioning

Centroid-based methods including k-means and fuzzy c-means (FCM) are kno...
research
12/29/2021

A sampling-based approach for efficient clustering in large datasets

We propose a simple and efficient clustering method for high-dimensional...
research
12/10/2019

Adaptive Manifold Clustering

We extend the theoretical study of a recently proposed nonparametric clu...
research
12/01/2020

Improving cluster recovery with feature rescaling factors

The data preprocessing stage is crucial in clustering. Features may desc...
research
05/25/2019

A New Clustering Method Based on Morphological Operations

With the booming development of data science, many clustering methods ha...
research
02/27/2020

Supervised Enhanced Soft Subspace Clustering (SESSC) for TSK Fuzzy Classifiers

Fuzzy c-means based clustering algorithms are frequently used for Takagi...

Please sign up or login with your details

Forgot password? Click here to reset