COBRAS-TS: A new approach to Semi-Supervised Clustering of Time Series

05/02/2018
by   Toon Van Craenendonck, et al.
0

Clustering is ubiquitous in data analysis, including analysis of time series. It is inherently subjective: different users may prefer different clusterings for a particular dataset. Semi-supervised clustering addresses this by allowing the user to provide examples of instances that should (not) be in the same cluster. This paper studies semi-supervised clustering in the context of time series. We show that COBRAS, a state-of-the-art semi-supervised clustering method, can be adapted to this setting. We refer to this approach as COBRAS-TS. An extensive experimental evaluation supports the following claims: (1) COBRAS-TS far outperforms the current state of the art in semi-supervised clustering for time series, and thus presents a new baseline for the field; (2) COBRAS-TS can identify clusters with separated components; (3) COBRAS-TS can identify clusters that are characterized by small local patterns; (4) a small amount of semi-supervision can greatly improve clustering quality for time series; (5) the choice of the clustering algorithm matters (contrary to earlier claims in the literature).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/06/2023

SS-shapelets: Semi-supervised Clustering of Time Series Using Representative Shapelets

Shapelets that discriminate time series using local features (subsequenc...
research
04/04/2021

Program Behavior Analysis and Clustering using Performance Counters

Understanding the dynamic behavior of computer programs during normal wo...
research
02/16/2017

Semi-supervised Learning for Discrete Choice Models

We introduce a semi-supervised discrete choice model to calibrate discre...
research
07/17/2019

Clustering Activity-Travel Behavior Time Series using Topological Data Analysis

Over the last few years, traffic data has been exploding and the transpo...
research
07/07/2022

Semi-unsupervised Learning for Time Series Classification

Time series are ubiquitous and therefore inherently hard to analyze and ...
research
12/24/2014

An Effective Semi-supervised Divisive Clustering Algorithm

Nowadays, data are generated massively and rapidly from scientific field...

Please sign up or login with your details

Forgot password? Click here to reset