Benchmarking optimality of time series classification methods in distinguishing diffusions

01/30/2023
by   Zehong Zhang, et al.
0

Performance benchmarking is a crucial component of time series classification (TSC) algorithm design, and a fast-growing number of datasets have been established for empirical benchmarking. However, the empirical benchmarks are costly and do not guarantee statistical optimality. This study proposes to benchmark the optimality of TSC algorithms in distinguishing diffusion processes by the likelihood ratio test (LRT). The LRT is optimal in the sense of the Neyman-Pearson lemma: it has the smallest false positive rate among classifiers with a controlled level of false negative rate. The LRT requires the likelihood ratio of the time series to be computable. The diffusion processes from stochastic differential equations provide such time series and are flexible in design for generating linear or nonlinear time series. We demonstrate the benchmarking with three scalable state-of-the-art TSC algorithms: random forest, ResNet, and ROCKET. Test results show that they can achieve LRT optimality for univariate time series and multivariate Gaussian processes. However, these model-agnostic algorithms are suboptimal in classifying nonlinear multivariate time series from high-dimensional stochastic interacting particle systems. Additionally, the LRT benchmark provides tools to analyze the dependence of classification accuracy on the time length, dimension, temporal sampling frequency, and randomness of the time series. Thus, the LRT with diffusion processes can systematically and efficiently benchmark the optimality of TSC algorithms and may guide their future improvements.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/19/2020

Monash University, UEA, UCR Time Series Regression Archive

Time series research has gathered lots of interests in the last decade, ...
research
07/26/2020

Benchmarking Multivariate Time Series Classification Algorithms

Time Series Classification (TSC) involved building predictive models for...
research
09/19/2019

Timage -- A Robust Time Series Classification Pipeline

Time series are series of values ordered by time. This kind of data can ...
research
02/18/2019

Large-scale directed network inference with multivariate transfer entropy and hierarchical statistical testing

Network inference algorithms are valuable tools for the study of large-s...
research
03/09/2020

Exact Inference of Linear Dependence Between Multiple Autocorrelated Time Series

The ability to quantify complex relationships within multivariate time s...
research
02/16/2021

Classification of multivariate weakly-labelled time-series with attention

This research identifies a gap in weakly-labelled multivariate time-seri...
research
10/11/2021

Chaos as an interpretable benchmark for forecasting and data-driven modelling

The striking fractal geometry of strange attractors underscores the gene...

Please sign up or login with your details

Forgot password? Click here to reset