Highly comparative feature-based time-series classification

by   Ben D. Fulcher, et al.

A highly comparative, feature-based approach to time series classification is introduced that uses an extensive database of algorithms to extract thousands of interpretable features from time series. These features are derived from across the scientific time-series analysis literature, and include summaries of time series in terms of their correlation structure, distribution, entropy, stationarity, scaling properties, and fits to a range of time-series models. After computing thousands of features for each time series in a training set, those that are most informative of the class structure are selected using greedy forward feature selection with a linear classifier. The resulting feature-based classifiers automatically learn the differences between classes using a reduced number of time-series properties, and circumvent the need to calculate distances between time series. Representing time series in this way results in orders of magnitude of dimensionality reduction, allowing the method to perform well on very large datasets containing long time series or time series of different lengths. For many of the datasets studied, classification performance exceeded that of conventional instance-based classifiers, including one nearest neighbor classifiers using Euclidean distances and dynamic time warping and, most importantly, the features selected provide an understanding of the properties of the dataset, insight that can guide further scientific investigation.


page 4

page 9


catch22: CAnonical Time-series CHaracteristics

Capturing the dynamical properties of time series concisely as interpret...

Highly comparative fetal heart rate analysis

A database of fetal heart rate (FHR) time series measured from 7221 pati...

A review on distance based time series classification

Time series classification is an increasing research topic due to the va...

Explainable time series tweaking via irreversible and reversible temporal transformations

Time series classification has received great attention over the past de...

An Empirical Evaluation of Time-Series Feature Sets

Solving time-series problems with features has been rising in popularity...

GENDIS: GENetic DIscovery of Shapelets

In the time series classification domain, shapelets are small time serie...

Explorative Data Analysis of Time Series based AlgorithmFeatures of CMA-ES Variants

In this study, we analyze behaviours of the well-known CMA-ES by extract...

Please sign up or login with your details

Forgot password? Click here to reset