Robust Two-Layer Partition Clustering of Sparse Multivariate Functional Data

04/26/2022
by   Zhuo Qu, et al.
0

In this work, a novel elastic time distance for sparse multivariate functional data is proposed. This concept serves as the foundation for clustering functional data with various time measurements per subject. Subsequently, a robust two-layer partition clustering is introduced. With the proposed distance, our approach not only is applicable to both complete and imbalanced multivariate functional data but also is resistant to outliers and capable of detecting outliers that do not belong to any clusters. The classical distance-based clustering methods such as K-medoids and agglomerative hierarchical clustering are extended to the sparse multivariate functional case based on our proposed distance. Numerical experiments on the simulated data highlight that the performance of the proposed algorithm is superior to the performances of the existing model-based and extended distance-based methods. Using Northwest Pacific cyclone track data as an example, we demonstrate the effectiveness of the proposed approach. The code is available online for readers to apply our clustering method and replicate our analyses.

READ FULL TEXT

page 13

page 16

page 25

research
03/14/2021

Sparse Functional Boxplots for Multivariate Curves

This paper introduces the sparse functional boxplot and the intensity sp...
research
07/31/2021

Functional clustering via multivariate clustering

Clustering techniques applied to multivariate data are a very useful too...
research
03/28/2023

Investigating swimming technical skills by a double partition clustering of multivariate functional data allowing for dimension selection

Investigating technical skills of swimmers is a challenge for performanc...
research
12/02/2019

A novel framework for joint sparse clustering and alignment of functional data

We propose a novel framework for sparse functional clustering that also ...
research
06/14/2021

Outlier detection in multivariate functional data through a contaminated mixture model

This work is motivated by an application in an industrial context, where...
research
03/18/2022

Statistical analysis of a hierarchical clustering algorithm with outliers

It is well known that the classical single linkage algorithm usually fai...
research
04/17/2013

The Mahalanobis distance for functional data with applications to classification

This paper presents a general notion of Mahalanobis distance for functio...

Please sign up or login with your details

Forgot password? Click here to reset