Clustering for Different Scales of Measurement - the Gap-Ratio Weighted K-means Algorithm

03/22/2017
by   Joris Guérin, et al.
0

This paper describes a method for clustering data that are spread out over large regions and which dimensions are on different scales of measurement. Such an algorithm was developed to implement a robotics application consisting in sorting and storing objects in an unsupervised way. The toy dataset used to validate such application consists of Lego bricks of different shapes and colors. The uncontrolled lighting conditions together with the use of RGB color features, respectively involve data with a large spread and different levels of measurement between data dimensions. To overcome the combination of these two characteristics in the data, we have developed a new weighted K-means algorithm, called gap-ratio K-means, which consists in weighting each dimension of the feature space before running the K-means algorithm. The weight associated with a feature is proportional to the ratio of the biggest gap between two consecutive data points, and the average of all the other gaps. This method is compared with two other variants of K-means on the Lego bricks clustering problem as well as two other common classification datasets.

READ FULL TEXT

page 2

page 8

research
03/11/2013

Improved Performance of Unsupervised Method by Renovated K-Means

Clustering is a separation of data into groups of similar objects. Every...
research
06/04/2021

Entropy K-Means Clustering With Feature Reduction Under Unknown Number of Clusters

The k-means algorithm with its extensions is the most used clustering me...
research
05/10/2020

Improving The Performance Of The K-means Algorithm

The Incremental K-means (IKM), an improved version of K-means (KM), was ...
research
07/06/2017

CNN features are also great at unsupervised classification

This paper aims at providing insight on the transferability of deep CNN ...
research
03/24/2013

Generalizing k-means for an arbitrary distance matrix

The original k-means clustering method works only if the exact vectors r...
research
04/12/2018

Unsupervised robotic sorting: Towards autonomous decision making robots

Autonomous sorting is a crucial task in industrial robotics which can be...
research
04/23/2019

Heterofusion: Fusing genomics data of different measurement scales

In systems biology, it is becoming increasingly common to measure bioche...

Please sign up or login with your details

Forgot password? Click here to reset