Semi-supervised anomaly detection algorithm based on KL divergence (SAD-KL)

03/28/2022
by   Chong Hyun Lee, et al.
0

The unlabeled data are generally assumed to be normal data in detecting abnormal data via semisupervised learning. This assumption, however, causes inevitable detection error when distribution of unlabeled data is different from distribution of labeled normal dataset. To deal the problem caused by distribution gap between labeled and unlabeled data, we propose a semi-supervised anomaly detection algorithm using KL divergence (SAD-KL). The proposed SAD-KL is composed of two steps: (1) estimating KL divergence of probability density functions (PDFs) of the local outlier factors (LOFs) of the labeled normal data and the unlabeled data (2) estimating detection probability and threshold for detecting normal data in unlabeled data by using the KL divergence. We show that the PDFs of the LOFs follow Burr distribution and use them for detection. Once the threshold is computed, the SAD-KL runs iteratively until the labeling change rate is lower than the predefined threshold. Experiments results show that the SAD-KL shows superior detection probability over the existing algorithms even though it takes less learning time.

READ FULL TEXT
research
11/30/2022

SPADE: Semi-supervised Anomaly Detection under Distribution Mismatch

Semi-supervised anomaly detection is a common problem, as often the data...
research
12/09/2020

ESAD: End-to-end Deep Semi-supervised Anomaly Detection

This paper explores semi-supervised anomaly detection, a more practical ...
research
02/15/2023

Deep Anomaly Detection under Labeling Budget Constraints

Selecting informative data points for expert feedback can significantly ...
research
07/24/2022

Semi-supervised Deep Multi-view Stereo

Significant progress has been witnessed in learning-based Multi-view Ste...
research
06/26/2023

Anomaly Detection with Score Distribution Discrimination

Recent studies give more attention to the anomaly detection (AD) methods...
research
05/02/2022

Open-Set Semi-Supervised Learning for 3D Point Cloud Understanding

Semantic understanding of 3D point cloud relies on learning models with ...
research
03/09/2017

Detecting Sockpuppets in Deceptive Opinion Spam

This paper explores the problem of sockpuppet detection in deceptive opi...

Please sign up or login with your details

Forgot password? Click here to reset