Anomaly Detection: How to Artificially Increase your F1-Score with a Biased Evaluation Protocol

06/30/2021
by   Damien Fourure, et al.
0

Anomaly detection is a widely explored domain in machine learning. Many models are proposed in the literature, and compared through different metrics measured on various datasets. The most popular metrics used to compare performances are F1-score, AUC and AVPR. In this paper, we show that F1-score and AVPR are highly sensitive to the contamination rate. One consequence is that it is possible to artificially increase their values by modifying the train-test split procedure. This leads to misleading comparisons between algorithms in the literature, especially when the evaluation protocol is not well detailed. Moreover, we show that the F1-score and the AVPR cannot be used to compare performances on different datasets as they do not reflect the intrinsic difficulty of modeling such data. Based on these observations, we claim that F1-score and AVPR should not be used as metrics for anomaly detection. We recommend a generic evaluation procedure for unsupervised anomaly detection, including the use of other metrics such as the AUC, which are more robust to arbitrary choices in the evaluation protocol.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/06/2022

Unsupervised Anomaly Detection in Time-series: An Extensive Evaluation and Analysis of State-of-the-art Methods

Unsupervised anomaly detection in time-series has been extensively inves...
research
03/15/2021

Multiclass Anomaly Detection in GI Endoscopic Images using Optimized Deep One-class Classification in an Imbalanced Dataset

Wireless Capsule Endoscopy helps physicians examine the gastrointestinal...
research
02/28/2021

Protocol-independent Detection of "Messaging Ordering" Network Covert Channels

Detection methods are available for several known covert channels. Howev...
research
02/25/2022

An exploration of the performances achievable by combining unsupervised background subtraction algorithms

Background subtraction (BGS) is a common choice for performing motion de...
research
04/21/2022

A Revealing Large-Scale Evaluation of Unsupervised Anomaly Detection Algorithms

Anomaly detection has many applications ranging from bank-fraud detectio...
research
07/15/2020

Are We There Yet? Evaluating State-of-the-Art Neural Network based Geoparsers Using EUPEG as a Benchmarking Platform

Geoparsing is an important task in geographic information retrieval. A g...

Please sign up or login with your details

Forgot password? Click here to reset