Anomaly detection on streamed data

06/05/2020
by   Thomas Cochrane, et al.
0

We introduce powerful but simple methodology for identifying anomalous observations against a corpus of `normal' observations. All data are observed through a vector-valued feature map. Our approach depends on the choice of corpus and that feature map but is invariant to affine transformations of the map and has no other external dependencies, such as choices of metric; we call it conformance. Applying this method to (signatures) of time series and other types of streamed data we provide an effective methodology of broad applicability for identifying anomalous complex multimodal sequential data. We demonstrate the applicability and effectiveness of our method by evaluating it against multiple data sets. Based on quantifying performance using the receiver operating characteristic (ROC) area under the curve (AUC), our method yields an AUC score of 98.9% for the PenDigits data set; in a subsequent experiment involving marine vessel traffic data our approach yields an AUC score of 89.1%. Based on comparison involving univariate time series from the UEA & UCR time series repository with performance quantified using balanced accuracy and assuming an optimal operating point, our approach outperforms a state-of-the-art shapelet method for 19 out of 28 data sets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/03/2019

Local Trend Inconsistency: A Prediction-driven Approach to Unsupervised Anomaly Detection in Multi-seasonal Time Series

On-line detection of anomalies in time series is a key technique in vari...
research
08/23/2020

Multiple Network Embedding for Anomaly Detection in Time Series of Graphs

This paper considers the graph signal processing problem of anomaly dete...
research
11/16/2022

Are we certain it's anomalous?

The progress in modelling time series and, more generally, sequences of ...
research
05/08/2023

Is AUC the best measure for practical comparison of anomaly detectors?

The area under receiver operating characteristics (AUC) is the standard ...
research
08/24/2023

Low-count Time Series Anomaly Detection

Low-count time series describe sparse or intermittent events, which are ...
research
09/25/2018

Dynamic detection of anomalous regions within distributed acoustic sensing data streams using locally stationary wavelet time series

Distributed acoustic sensing technology is increasingly being used to su...
research
02/08/2022

The Lifecycle of a Statistical Model: Model Failure Detection, Identification, and Refitting

The statistical machine learning community has demonstrated considerable...

Please sign up or login with your details

Forgot password? Click here to reset