Copula Quadrant Similarity for Anomaly Scores

01/07/2021
by   Matthew Davidow, et al.
0

Practical anomaly detection requires applying numerous approaches due to the inherent difficulty of unsupervised learning. Direct comparison between complex or opaque anomaly detection algorithms is intractable; we instead propose a framework for associating the scores of multiple methods. Our aim is to answer the question: how should one measure the similarity between anomaly scores generated by different methods? The scoring crux is the extremes, which identify the most anomalous observations. A pair of algorithms are defined here to be similar if they assign their highest scores to roughly the same small fraction of observations. To formalize this, we propose a measure based on extremal similarity in scoring distributions through a novel upper quadrant modeling approach, and contrast it with tail and other dependence measures. We illustrate our method with simulated and real experiments, applying spectral methods to cluster multiple anomaly detection methods and to contrast our similarity measure with others. We demonstrate that our method is able to detect the clusters of anomaly detection algorithms to achieve an accurate and robust ensemble algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/19/2019

Deep Anomaly Detection with Deviation Networks

Although deep learning has been applied to successfully address many dat...
research
12/15/2020

Anomaly Detection and Localization based on Double Kernelized Scoring and Matrix Kernels

Anomaly detection is necessary for proper and safe operation of large-sc...
research
05/25/2020

Factor Analysis of Mixed Data for Anomaly Detection

Anomaly detection aims to identify observations that deviate from the ty...
research
11/19/2021

UN-AVOIDS: Unsupervised and Nonparametric Approach for Visualizing Outliers and Invariant Detection Scoring

The visualization and detection of anomalies (outliers) are of crucial i...
research
09/22/2022

Oracle Analysis of Representations for Deep Open Set Detection

The problem of detecting a novel class at run time is known as Open Set ...
research
03/11/2020

Building and Interpreting Deep Similarity Models

Many learning algorithms such as kernel machines, nearest neighbors, clu...
research
04/06/2023

Anomaly Detection via Gumbel Noise Score Matching

We propose Gumbel Noise Score Matching (GNSM), a novel unsupervised meth...

Please sign up or login with your details

Forgot password? Click here to reset