SPADE: Semi-supervised Anomaly Detection under Distribution Mismatch

11/30/2022
by   Jinsung Yoon, et al.
0

Semi-supervised anomaly detection is a common problem, as often the datasets containing anomalies are partially labeled. We propose a canonical framework: Semi-supervised Pseudo-labeler Anomaly Detection with Ensembling (SPADE) that isn't limited by the assumption that labeled and unlabeled data come from the same distribution. Indeed, the assumption is often violated in many applications - for example, the labeled data may contain only anomalies unlike unlabeled data, or unlabeled data may contain different types of anomalies, or labeled data may contain only 'easy-to-label' samples. SPADE utilizes an ensemble of one class classifiers as the pseudo-labeler to improve the robustness of pseudo-labeling with distribution mismatch. Partial matching is proposed to automatically select the critical hyper-parameters for pseudo-labeling without validation data, which is crucial with limited labeled data. SPADE shows state-of-the-art semi-supervised anomaly detection performance across a wide range of scenarios with distribution mismatch in both tabular and image domains. In some common real-world settings such as model facing new types of unlabeled anomalies, SPADE outperforms the state-of-the-art alternatives by 5

READ FULL TEXT
research
08/31/2022

Deep Anomaly Detection and Search via Reinforcement Learning

Semi-supervised Anomaly Detection (AD) is a kind of data mining task whi...
research
02/15/2023

Deep Anomaly Detection under Labeling Budget Constraints

Selecting informative data points for expert feedback can significantly ...
research
03/28/2022

Semi-supervised anomaly detection algorithm based on KL divergence (SAD-KL)

The unlabeled data are generally assumed to be normal data in detecting ...
research
11/22/2021

A Semi-Supervised Adaptive Discriminative Discretization Method Improving Discrimination Power of Regularized Naive Bayes

Recently, many improved naive Bayes methods have been developed with enh...
research
03/03/2022

Data-Efficient and Interpretable Tabular Anomaly Detection

Anomaly detection (AD) plays an important role in numerous applications....
research
10/30/2019

Weakly-supervised Deep Anomaly Detection with Pairwise Relation Learning

This paper studies a rarely explored but critical anomaly detection prob...
research
02/19/2021

Self-Taught Semi-Supervised Anomaly Detection on Upper Limb X-rays

Detecting anomalies in musculoskeletal radiographs is of paramount impor...

Please sign up or login with your details

Forgot password? Click here to reset