Current Time Series Anomaly Detection Benchmarks are Flawed and are Creating the Illusion of Progress

09/29/2020
by   Renjie Wu, et al.
0

Time series anomaly detection has been a perennially important topic in data science, with papers dating back to the 1950s. However, in recent years there has been an explosion of interest in this topic, much of it driven by the success of deep learning in other domains and for other time series tasks. Most of these papers test on one or more of a handful of popular benchmark datasets, created by Yahoo, Numenta, NASA, etc. In this work we make a surprising claim. The majority of the individual exemplars in these datasets suffer from one or more of four flaws. Because of these four flaws, we believe that many published comparisons of anomaly detection algorithms may be unreliable, and more importantly, much of the apparent progress in recent years may be illusionary. In addition to demonstrating these claims, with this paper we introduce the UCR Time Series Anomaly Datasets. We believe that this resource will perform a similar role as the UCR Time Series Classification Archive, by providing the community with a benchmark that allows meaningful comparisons between approaches and a meaningful gauge of overall progress.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/09/2022

Deep Learning for Time Series Anomaly Detection: A Survey

Time series anomaly detection has applications in a wide range of resear...
research
04/04/2022

Do Deep Neural Networks Contribute to Multivariate Time Series Anomaly Detection?

Anomaly detection in time series is a complex task that has been widely ...
research
08/24/2023

Multivariate Time Series Anomaly Detection: Fancy Algorithms and Flawed Evaluation Methodology

Multivariate Time Series (MVTS) anomaly detection is a long-standing and...
research
10/17/2018

The UCR Time Series Archive

The UCR Time Series Archive - introduced in 2002, has become an importan...
research
02/08/2022

Time Series Anomaly Detection by Cumulative Radon Features

Detecting anomalous time series is key for scientific, medical and indus...
research
03/25/2020

FastDTW is approximate and Generally Slower than the Algorithm it Approximates

Many time series data mining problems can be solved with repeated use of...
research
11/07/2018

Time Series Classification to Improve Poultry Welfare

Poultry farms are an important contributor to the human food chain. Worl...

Please sign up or login with your details

Forgot password? Click here to reset