How to Evaluate the Quality of Unsupervised Anomaly Detection Algorithms?

07/05/2016
by   Nicolas Goix, et al.
0

When sufficient labeled data are available, classical criteria based on Receiver Operating Characteristic (ROC) or Precision-Recall (PR) curves can be used to compare the performance of un-supervised anomaly detection algorithms. However , in many situations, few or no data are labeled. This calls for alternative criteria one can compute on non-labeled data. In this paper, two criteria that do not require labels are empirically shown to discriminate accurately (w.r.t. ROC or PR based criteria) between algorithms. These criteria are based on existing Excess-Mass (EM) and Mass-Volume (MV) curves, which generally cannot be well estimated in large dimension. A methodology based on feature sub-sampling and aggregating is also described and tested, extending the use of these criteria to high-dimensional datasets and solving major drawbacks inherent to standard EM and MV curves.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/31/2023

Deep Semi-Supervised Anomaly Detection for Finding Fraud in the Futures Market

Modern financial electronic exchanges are an exciting and fast-paced mar...
research
11/24/2019

Latent space conditioning for improved classification and anomaly detection

We propose a variational autoencoder to perform improved pre-processing ...
research
01/09/2018

An overview of deep learning based methods for unsupervised and semi-supervised anomaly detection in videos

Videos represent the primary source of information for surveillance appl...
research
02/05/2015

Performance Analysis of Cone Detection Algorithms

Many algorithms have been proposed to help clinicians evaluate cone dens...
research
11/07/2016

One Class Splitting Criteria for Random Forests

Random Forests (RFs) are strong machine learning tools for classificatio...
research
03/27/2023

Disruption Precursor Onset Time Study Based on Semi-supervised Anomaly Detection

The full understanding of plasma disruption in tokamaks is currently lac...
research
02/05/2015

On Anomaly Ranking and Excess-Mass Curves

Learning how to rank multivariate unlabeled observations depending on th...

Please sign up or login with your details

Forgot password? Click here to reset