Clustering Multivariate Time Series using Energy Distance

03/24/2023
by   Richard A. Davis, et al.
0

A novel methodology is proposed for clustering multivariate time series data using energy distance defined in Székely and Rizzo (2013). Specifically, a dissimilarity matrix is formed using the energy distance statistic to measure separation between the finite dimensional distributions for the component time series. Once the pairwise dissimilarity matrix is calculated, a hierarchical clustering method is then applied to obtain the dendrogram. This procedure is completely nonparametric as the dissimilarities between stationary distributions are directly calculated without making any model assumptions. In order to justify this procedure, asymptotic properties of the energy distance estimates are derived for general stationary and ergodic time series. The method is illustrated in a simulation study for various component time series that are either linear or nonlinear. Finally the methodology is applied to two examples; one involves GDP of selected countries and the other is population size of various states in the U.S.A. in the years 1900 -1999.

READ FULL TEXT
research
03/10/2021

Stationary subspace analysis based on second-order statistics

In stationary subspace analysis (SSA) one assumes that the observable p-...
research
09/05/2021

Nonparametric Extrema Analysis in Time Series for Envelope Extraction, Peak Detection and Clustering

In this paper, we propose a nonparametric approach that can be used in e...
research
06/07/2020

Information Mandala: Statistical Distance Matrix with Its Clustering

In machine learning, observation features are measured in a metric space...
research
09/29/2021

Kernel distance measures for time series, random fields and other structured data

This paper introduces kdiff, a novel kernel-based measure for estimating...
research
12/14/2020

Clustering high dimensional meteorological scenarios: results and performance index

The Reseau de Transport d'Electricité (RTE) is the French main electrici...
research
11/27/2018

Extracting conditionally heteroscedastic components using ICA

In the independent component model, the multivariate data is assumed to ...
research
05/26/2023

Clustering Method for Time-Series Images Using Quantum-Inspired Computing Technology

Time-series clustering serves as a powerful data mining technique for ti...

Please sign up or login with your details

Forgot password? Click here to reset