Novel Feature-Based Clustering of Micro-Panel Data (CluMP)

07/16/2018
by   Lukas Sobisek, et al.
0

Micro-panel data are collected and analysed in many research and industry areas. Cluster analysis of micro-panel data is an unsupervised learning exploratory method identifying subgroup clusters in a data set which include homogeneous objects in terms of the development dynamics of monitored variables. The supply of clustering methods tailored to micro-panel data is limited. The present paper focuses on a feature-based clustering method, introducing a novel two-step characteristic-based approach designed for this type of data. The proposed CluMP method aims to identify clusters that are at least as internally homogeneous and externally heterogeneous as those obtained by alternative methods already implemented in the statistical system R. We compare the clustering performance of the devised algorithm with two extant methods using simulated micro-panel data sets. Our approach has yielded similar or better outcomes than the other methods, the advantage of the proposed algorithm being time efficiency which makes it applicable for large data sets.

READ FULL TEXT
research
11/18/2022

Asymptotics for The k-means

The k-means is one of the most important unsupervised learning technique...
research
04/04/2022

Multivariate Microaggregation of Set-Valued Data

Data controllers manage immense data, and occasionally, it is released p...
research
12/22/2022

Co-clustering based exploratory analysis of mixed-type data tables

Co-clustering is a class of unsupervised data analysis techniques that e...
research
01/22/2016

When is Clustering Perturbation Robust?

Clustering is a fundamental data mining tool that aims to divide data in...
research
07/24/2020

New clustering approach for symbolic polygonal data: application to the clustering of entrepreneurial regimes

Entrepreneurial regimes are topic, receiving ever more research attentio...
research
07/17/2020

Functional clustering methods for resistance spot welding process data in the automotive industry

Quality assessment of resistance spot welding (RSW) joints of metal shee...
research
04/13/2013

Identification of relevant subtypes via preweighted sparse clustering

Cluster analysis methods are used to identify homogeneous subgroups in a...

Please sign up or login with your details

Forgot password? Click here to reset