AUTOSHAPE: An Autoencoder-Shapelet Approach for Time Series Clustering

08/06/2022
by   Guozhong Li, et al.
0

Time series shapelets are discriminative subsequences that have been recently found effective for time series clustering (TSC). The shapelets are convenient for interpreting the clusters. Thus, the main challenge for TSC is to discover high-quality variable-length shapelets to discriminate different clusters. In this paper, we propose a novel autoencoder-shapelet approach (AUTOSHAPE), which is the first study to take the advantage of both autoencoder and shapelet for determining shapelets in an unsupervised manner. An autoencoder is specially designed to learn high-quality shapelets. More specifically, for guiding the latent representation learning, we employ the latest self-supervised loss to learn the unified embeddings for variable-length shapelet candidates (time series subsequences) of different variables, and propose the diversity loss to select the discriminating embeddings in the unified space. We introduce the reconstruction loss to recover shapelets in the original time series space for clustering. Finally, we adopt Davies Bouldin index (DBI) to inform AUTOSHAPE of the clustering performance during learning. We present extensive experiments on AUTOSHAPE. To evaluate the clustering performance on univariate time series (UTS), we compare AUTOSHAPE with 15 representative methods using UCR archive datasets. To study the performance of multivariate time series (MTS), we evaluate AUTOSHAPE on 30 UEA archive datasets with 5 competitive methods. The results validate that AUTOSHAPE is the best among all the methods compared. We interpret clusters with shapelets, and can obtain interesting intuitions about clusters in three UTS case studies and one MTS case study, respectively.

READ FULL TEXT

page 1

page 9

page 12

research
02/10/2020

Autoencoder-based time series clustering with energy applications

Time series clustering is a challenging task due to the specific nature ...
research
01/11/2021

Hierarchical Clustering using Auto-encoded Compact Representation for Time-series Analysis

Getting a robust time-series clustering with best choice of distance mea...
research
04/06/2023

SS-shapelets: Semi-supervised Clustering of Time Series Using Representative Shapelets

Shapelets that discriminate time series using local features (subsequenc...
research
11/29/2018

Recurrent Deep Divergence-based Clustering for simultaneous feature learning and clustering of variable length time series

The task of clustering unlabeled time series and sequences entails a par...
research
11/11/2021

ORION-AE: Multisensor acoustic emission datasets reflecting supervised untightening of bolts in a jointed vibrating structure

Monitoring loosening in jointed structures during operation is challengi...
research
08/16/2019

N2D:(Not Too) Deep clustering via clustering the local manifold of an autoencoded embedding

Deep clustering has increasingly been demonstrating superiority over con...
research
01/30/2019

Unsupervised Scalable Representation Learning for Multivariate Time Series

Time series constitute a challenging data type for machine learning algo...

Please sign up or login with your details

Forgot password? Click here to reset