Feature-Based Time-Series Analysis in R using the theft Package

08/12/2022
by   Trent Henderson, et al.
27

Time series are measured and analyzed across the sciences. One method of quantifying the structure of time series is by calculating a set of summary statistics or `features', and then representing a time series in terms of its properties as a feature vector. The resulting feature space is interpretable and informative, and enables conventional statistical learning approaches, including clustering, regression, and classification, to be applied to time-series datasets. Many open-source software packages for computing sets of time-series features exist across multiple programming languages, including catch22 (22 features: Matlab, R, Python, Julia), feasts (42 features: R), tsfeatures (63 features: R), Kats (40 features: Python), tsfresh (779 features: Python), and TSFEL (390 features: Python). However, there are several issues: (i) a singular access point to these packages is not currently available; (ii) to access all feature sets, users must be fluent in multiple languages; and (iii) these feature-extraction packages lack extensive accompanying methodological pipelines for performing feature-based time-series analysis, such as applications to time-series classification. Here we introduce a solution to these issues in an R software package called theft: Tools for Handling Extraction of Features from Time series. theft is a unified and extendable framework for computing features from the six open-source time-series feature sets listed above. It also includes a suite of functions for processing and interpreting the performance of extracted features, including extensive data-visualization templates, low-dimensional projections, and time-series classification operations. With an increasing volume and complexity of time-series datasets in the sciences and industry, theft provides a standardized framework for comprehensively quantifying and interpreting informative structure in time series.

READ FULL TEXT

page 8

page 16

page 18

research
08/20/2022

From Time Series to Networks in R with the ts2net Package

Network science established itself as a prominent tool for modeling time...
research
11/24/2021

tsflex: flexible time series processing feature extraction

Time series processing and feature extraction are crucial and time-inten...
research
10/21/2021

An Empirical Evaluation of Time-Series Feature Sets

Solving time-series problems with features has been rising in popularity...
research
09/15/2016

cesium: Open-Source Platform for Time-Series Inference

Inference on time series data is a common requirement in many scientific...
research
06/02/2021

semopy 2: A Structural Equation Modeling Package with Random Effects in Python

Structural Equation Modeling (SEM) is an umbrella term that includes num...
research
04/16/2021

Explorative Data Analysis of Time Series based AlgorithmFeatures of CMA-ES Variants

In this study, we analyze behaviours of the well-known CMA-ES by extract...
research
08/10/2022

TSInterpret: A unified framework for time series interpretability

With the increasing application of deep learning algorithms to time seri...

Please sign up or login with your details

Forgot password? Click here to reset