K-ARMA Models for Clustering Time Series Data

06/30/2022
by   Derek O. Hoare, et al.
0

We present an approach to clustering time series data using a model-based generalization of the K-Means algorithm which we call K-Models. We prove the convergence of this general algorithm and relate it to the hard-EM algorithm for mixture modeling. We then apply our method first with an AR(p) clustering example and show how the clustering algorithm can be made robust to outliers using a least-absolute deviations criteria. We then build our clustering algorithm up for ARMA(p,q) models and extend this to ARIMA(p,d,q) models. We develop a goodness of fit statistic for the models fitted to clusters based on the Ljung-Box statistic. We perform experiments with simulated data to show how the algorithm can be used for outlier detection, detecting distributional drift, and discuss the impact of initialization method on empty clusters. We also perform experiments on real data which show that our method is competitive with other existing methods for similar time series clustering tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/13/2022

AWT – Clustering Meteorological Time Series Using an Aggregated Wavelet Tree

Both clustering and outlier detection play an important role for meteoro...
research
04/26/2021

tsrobprep - an R package for robust preprocessing of time series data

Data cleaning is a crucial part of every data analysis exercise. Yet, th...
research
03/16/2021

Rollage: Efficient Rolling Average Algorithm to Estimate ARMA Models for Big Time Series Data

We develop a new method to estimate an ARMA model in the presence of big...
research
05/26/2023

Clustering Method for Time-Series Images Using Quantum-Inspired Computing Technology

Time-series clustering serves as a powerful data mining technique for ti...
research
10/22/2021

Clustering Market Regimes using the Wasserstein Distance

The problem of rapid and automated detection of distinct market regimes ...
research
01/16/2013

Learning Graphical Models of Images, Videos and Their Spatial Transformations

Mixtures of Gaussians, factor analyzers (probabilistic PCA) and hidden M...
research
04/13/2018

Adversarial Clustering: A Grid Based Clustering Algorithm Against Active Adversaries

Nowadays more and more data are gathered for detecting and preventing cy...

Please sign up or login with your details

Forgot password? Click here to reset