Anomaly detection using data depth: multivariate case

10/06/2022
by   Pavlo Mozharovskyi, et al.
0

Anomaly detection is a branch of machine learning and data analysis which aims at identifying observations that exhibit abnormal behaviour. Be it measurement errors, disease development, severe weather, production quality default(s) (items) or failed equipment, financial frauds or crisis events, their on-time identification, isolation and explanation constitute an important task in almost any branch of industry and science. By providing a robust ordering, data depth – statistical function that measures belongingness of any point of the space to a data set – becomes a particularly useful tool for detection of anomalies. Already known for its theoretical properties, data depth has undergone substantial computational developments in the last decade and particularly recent years, which has made it applicable for contemporary-sized problems of data analysis and machine learning. In this article, data depth is studied as an efficient anomaly detection tool, assigning abnormality labels to observations with lower depth values, in a multivariate setting. Practical questions of necessity and reasonability of invariances and shape of the depth function, its robustness and computational complexity, choice of the threshold are discussed. Illustrations include use-cases that underline advantageous behaviour of data depth in various settings.

READ FULL TEXT

page 15

page 22

research
12/13/2021

Why Are You Weird? Infusing Interpretability in Isolation Forest for Anomaly Detection

Anomaly detection is concerned with identifying examples in a dataset th...
research
01/13/2022

Functional Anomaly Detection: a Benchmark Study

The increasing automation in many areas of the Industry expressly demand...
research
08/20/2015

Multi-criteria Similarity-based Anomaly Detection using Pareto Depth Analysis

We consider the problem of identifying patterns in a data set that exhib...
research
10/09/2019

The Area of the Convex Hull of Sampled Curves: a Robust Functional Statistical Depth Measure

With the ubiquity of sensors in the IoT era, statistical observations ar...
research
05/13/2022

A Vision Inspired Neural Network for Unsupervised Anomaly Detection in Unordered Data

A fundamental problem in the field of unsupervised machine learning is t...
research
04/09/2019

Functional Isolation Forest

For the purpose of monitoring the behavior of complex infrastructures (e...
research
01/18/2022

Antimodes and Graphical Anomaly Exploration via Depth Quantile Functions

Depth quantile functions (DQF) encode geometric information about a poin...

Please sign up or login with your details

Forgot password? Click here to reset