An Empirical Evaluation of Similarity Measures for Time Series Classification

01/16/2014
by   Joan Serrà, et al.
0

Time series are ubiquitous, and a measure to assess their similarity is a core part of many computational systems. In particular, the similarity measure is the most essential ingredient of time series clustering and classification systems. Because of this importance, countless approaches to estimate time series similarity have been proposed. However, there is a lack of comparative studies using empirical, rigorous, quantitative, and large-scale assessment strategies. In this article, we provide an extensive evaluation of similarity measures for time series classification following the aforementioned principles. We consider 7 different measures coming from alternative measure `families', and 45 publicly-available time series data sets coming from a wide variety of scientific domains. We focus on out-of-sample classification accuracy, but in-sample accuracies and parameter choices are also discussed. Our work is based on rigorous evaluation methodologies and includes the use of powerful statistical significance tests to derive meaningful conclusions. The obtained results show the equivalence, in terms of accuracy, of a number of measures, but with one single candidate outperforming the rest. Such findings, together with the followed methodology, invite researchers on the field to adopt a more consistent evaluation criteria and a more informed decision regarding the baseline measures to which new developments should be compared.

READ FULL TEXT
research
12/05/2019

Clustering Time-Series by a Novel Slope-Based Similarity Measure Considering Particle Swarm Optimization

Recently there has been an increase in the studies on time-series data m...
research
12/09/2010

Experimental Comparison of Representation Methods and Distance Measures for Time Series Data

The previous decade has brought a remarkable increase of the interest in...
research
08/04/2017

Research Activity Classification based on Time Series Bibliometrics

Bibliometrics such as the number of papers and times cited are often use...
research
01/17/2021

Free congruence: an exploration of expanded similarity measures for time series data

Time series similarity measures are highly relevant in a wide range of e...
research
02/25/2015

Exploiting a comparability mapping to improve bi-lingual data categorization: a three-mode data analysis perspective

We address in this paper the co-clustering and co-classification of bili...
research
07/01/2011

The Influence of Global Constraints on Similarity Measures for Time-Series Databases

A time series consists of a series of values or events obtained over rep...

Please sign up or login with your details

Forgot password? Click here to reset