RCC-Dual-GAN: An Efficient Approach for Outlier Detection with Few Identified Anomalies

03/07/2020
by   Zhe Li, et al.
11

Outlier detection is an important task in data mining and many technologies have been explored in various applications. However, due to the default assumption that outliers are non-concentrated, unsupervised outlier detection may not correctly detect group anomalies with higher density levels. As for the supervised outlier detection, although high detection rates and optimal parameters can usually be achieved, obtaining sufficient and correct labels is a time-consuming task. To address these issues, we focus on semi-supervised outlier detection with few identified anomalies, in the hope of using limited labels to achieve high detection accuracy. First, we propose a novel detection model Dual-GAN, which can directly utilize the potential information in identified anomalies to detect discrete outliers and partially identified group anomalies simultaneously. And then, considering the instances with similar output values may not all be similar in a complex data structure, we replace the two MO-GAN components in Dual-GAN with the combination of RCC and M-GAN (RCC-Dual-GAN). In addition, to deal with the evaluation of Nash equilibrium and the selection of optimal model, two evaluation indicators are created and introduced into the two models to make the detection process more intelligent. Extensive experiments on both benchmark datasets and two practical tasks demonstrate that our proposed approaches (i.e., Dual-GAN and RCC-Dual-GAN) can significantly improve the accuracy of outlier detection even with only a few identified anomalies. Moreover, compared with the two MO-GAN components in Dual-GAN, the network structure combining RCC and M-GAN has greater stability in various situations.

READ FULL TEXT

page 4

page 5

page 7

page 8

page 12

research
01/24/2020

Detection of Thin Boundaries between Different Types of Anomalies in Outlier Detection using Enhanced Neural Networks

Outlier detection has received special attention in various fields, main...
research
06/09/2023

WePaMaDM-Outlier Detection: Weighted Outlier Detection using Pattern Approaches for Mass Data Mining

Weighted Outlier Detection is a method for identifying unusual or anomal...
research
08/10/2022

SSDBCODI: Semi-Supervised Density-Based Clustering with Outliers Detection Integrated

Clustering analysis is one of the critical tasks in machine learning. Tr...
research
12/22/2021

Robust learning of data anomalies with analytically-solvable entropic outlier sparsification

Entropic Outlier Sparsification (EOS) is proposed as a robust computatio...
research
09/28/2018

Generative Adversarial Active Learning for Unsupervised Outlier Detection

Outlier detection is an important topic in machine learning and has been...
research
05/10/2018

A Proposal for Outlier and Noise Detection in Public Officials' Affidavits

Outlier and noise detection processes are highly useful in the quality a...
research
11/06/2022

The Importance of Suppressing Complete Reconstruction in Autoencoders for Unsupervised Outlier Detection

Autoencoders are widely used in outlier detection due to their superiori...

Please sign up or login with your details

Forgot password? Click here to reset