SS-shapelets: Semi-supervised Clustering of Time Series Using Representative Shapelets

04/06/2023
by   Borui Cai, et al.
0

Shapelets that discriminate time series using local features (subsequences) are promising for time series clustering. Existing time series clustering methods may fail to capture representative shapelets because they discover shapelets from a large pool of uninformative subsequences, and thus result in low clustering accuracy. This paper proposes a Semi-supervised Clustering of Time Series Using Representative Shapelets (SS-Shapelets) method, which utilizes a small number of labeled and propagated pseudo-labeled time series to help discover representative shapelets, thereby improving the clustering accuracy. In SS-Shapelets, we propose two techniques to discover representative shapelets for the effective clustering of time series. 1) A salient subsequence chain (SSC) that can extract salient subsequences (as candidate shapelets) of a labeled/pseudo-labeled time series, which helps remove massive uninformative subsequences from the pool. 2) A linear discriminant selection (LDS) algorithm to identify shapelets that can capture representative local features of time series in different classes, for convenient clustering. Experiments on UCR time series datasets demonstrate that SS-shapelets discovers representative shapelets and achieves higher clustering accuracy than counterpart semi-supervised time series clustering methods.

READ FULL TEXT

page 1

page 10

research
05/02/2018

COBRAS-TS: A new approach to Semi-Supervised Clustering of Time Series

Clustering is ubiquitous in data analysis, including analysis of time se...
research
08/06/2022

AUTOSHAPE: An Autoencoder-Shapelet Approach for Time Series Clustering

Time series shapelets are discriminative subsequences that have been rec...
research
05/14/2019

A self-organising eigenspace map for time series clustering

This paper presents a novel time series clustering method, the self-orga...
research
04/09/2023

Embarrassingly Simple MixUp for Time-series

Labeling time series data is an expensive task because of domain experti...
research
11/08/2019

Hierarchical Clustering for Smart Meter Electricity Loads based on Quantile Autocovariances

In order to improve the efficiency and sustainability of electricity sys...
research
07/07/2022

Semi-unsupervised Learning for Time Series Classification

Time series are ubiquitous and therefore inherently hard to analyze and ...
research
06/19/2020

Semi-supervised time series classification method for quantum computing

In this paper we develop methods to solve two problems related to time s...

Please sign up or login with your details

Forgot password? Click here to reset