Simulated Data Experiments for Time Series Classification Part 1: Accuracy Comparison with Default Settings

03/28/2017
by   Anthony Bagnall, et al.
0

There are now a broad range of time series classification (TSC) algorithms designed to exploit different representations of the data. These have been evaluated on a range of problems hosted at the UCR-UEA TSC Archive (www.timeseriesclassification.com), and there have been extensive comparative studies. However, our understanding of why one algorithm outperforms another is still anecdotal at best. This series of experiments is meant to help provide insights into what sort of discriminatory features in the data lead one set of algorithms that exploit a particular representation to be better than other algorithms. We categorise five different feature spaces exploited by TSC algorithms then design data simulators to generate randomised data from each representation. We describe what results we expected from each class of algorithm and data representation, then observe whether these prior beliefs are supported by the experimental evidence. We provide an open source implementation of all the simulators to allow for the controlled testing of hypotheses relating to classifier performance on different data representations. We identify many surprising results that confounded our expectations, and use these results to highlight how an over simplified view of classifier structure can often lead to erroneous prior beliefs. We believe ensembling can often overcome prior bias, and our results support the belief by showing that the ensemble approach adopted by the Hierarchical Collective of Transform based Ensembles (HIVE-COTE) is significantly better than the alternatives when the data representation is unknown, and is significantly better than, or not significantly significantly better than, or not significantly worse than, the best other approach on three out of five of the individual simulators.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/02/2023

Unsupervised Feature Based Algorithms for Time Series Extrinsic Regression

Time Series Extrinsic Regression (TSER) involves using a set of training...
research
10/25/2017

The Heterogeneous Ensembles of Standard Classification Algorithms (HESCA): the Whole is Greater than the Sum of its Parts

Building classification models is an intrinsically practical exercise th...
research
04/13/2020

On the Usage and Performance of The Hierarchical Vote Collective of Transformation-based Ensembles version 1.0 (HIVE-COTE 1.0)

The Hierarchical Vote Collective of Transformation-based Ensembles (HIVE...
research
03/30/2020

Hyperplane arrangements in polymake

Hyperplane arrangements form the latest addition to the zoo of combinato...
research
04/25/2023

Bake off redux: a review and experimental evaluation of recent time series classification algorithms

In 2017, a research paper compared 18 Time Series Classification (TSC) a...
research
09/12/2019

A tale of two toolkits, report the first: benchmarking time series classification algorithms for correctness and efficiency

sktime is an open source, Python based, sklearn compatible toolkit for t...
research
03/04/2021

A Comparative Evaluation of Quantification Methods

Quantification represents the problem of predicting class distributions ...

Please sign up or login with your details

Forgot password? Click here to reset