Improving Autoencoder-based Outlier Detection with Adjustable Probabilistic Reconstruction Error and Mean-shift Outlier Scoring

04/03/2023
by   Xu Tan, et al.
0

Autoencoders were widely used in many machine learning tasks thanks to their strong learning ability which has drawn great interest among researchers in the field of outlier detection. However, conventional autoencoder-based methods lacked considerations in two aspects. This limited their performance in outlier detection. First, the mean squared error used in conventional autoencoders ignored the judgment uncertainty of the autoencoder, which limited their representation ability. Second, autoencoders suffered from the abnormal reconstruction problem: some outliers can be unexpectedly reconstructed well, making them difficult to identify from the inliers. To mitigate the aforementioned issues, two novel methods were proposed in this paper. First, a novel loss function named Probabilistic Reconstruction Error (PRE) was constructed to factor in both reconstruction bias and judgment uncertainty. To further control the trade-off of these two factors, two weights were introduced in PRE producing Adjustable Probabilistic Reconstruction Error (APRE), which benefited the outlier detection in different applications. Second, a conceptually new outlier scoring method based on mean-shift (MSS) was proposed to reduce the false inliers caused by the autoencoder. Experiments on 32 real-world outlier detection datasets proved the effectiveness of the proposed methods. The combination of the proposed methods achieved 41 performance improvement compared to the best baseline. The MSS improved the performance of multiple autoencoder-based outlier detectors by an average of 20 development in outlier detection. The code is available on www.OutlierNet.com for reproducibility.

READ FULL TEXT
research
05/12/2021

Autoencoding Under Normalization Constraints

Likelihood is a standard estimate for outlier detection. The specific ro...
research
12/22/2020

Probabilistic Outlier Detection and Generation

A new method for outlier detection and generation is introduced by lifti...
research
11/06/2022

The Importance of Suppressing Complete Reconstruction in Autoencoders for Unsupervised Outlier Detection

Autoencoders are widely used in outlier detection due to their superiori...
research
09/06/2020

Anomaly Detection With Partitioning Overfitting Autoencoder Ensembles

In this paper, we propose POTATOES (Partitioning OverfiTing AuTOencoder ...
research
10/22/2019

Unsupervised Boosting-based Autoencoder Ensembles for Outlier Detection

Autoencoders, as a dimensionality reduction technique, have been recentl...
research
03/17/2021

Fairness-aware Outlier Ensemble

Outlier ensemble methods have shown outstanding performance on the disco...
research
02/04/2021

RECol: Reconstruction Error Columns for Outlier Detection

Detecting outliers or anomalies is a common data analysis task. As a sub...

Please sign up or login with your details

Forgot password? Click here to reset