ODIM: an efficient method to detect outliers via inlier-memorization effect of deep generative models

01/11/2023
by   Dongha Kim, et al.
3

Identifying whether a given sample is an outlier or not is an important issue in various real-world domains. This study aims to solve the unsupervised outlier detection problem where training data contain outliers, but any label information about inliers and outliers is not given. We propose a powerful and efficient learning framework to identify outliers in a training data set using deep neural networks. We start with a new observation called the inlier-memorization (IM) effect. When we train a deep generative model with data contaminated with outliers, the model first memorizes inliers before outliers. Exploiting this finding, we develop a new method called the outlier detection via the IM effect (ODIM). The ODIM only requires a few updates; thus, it is computationally efficient, tens of times faster than other deep-learning-based algorithms. Also, the ODIM filters out outliers successfully, regardless of the types of data, such as tabular, image, and sequential. We empirically demonstrate the superiority and efficiency of the ODIM by analyzing 20 data sets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/25/2020

Further Analysis of Outlier Detection with Deep Generative Models

The recent, counter-intuitive discovery that deep generative models (DGM...
research
10/19/2019

Efficient Discovery of Meaningful Outlier Relationships

We propose PODS (Predictable Outliers in Data-trendS), a method that, gi...
research
12/22/2020

Probabilistic Outlier Detection and Generation

A new method for outlier detection and generation is introduced by lifti...
research
07/02/2020

Outlier Detection through Null Space Analysis of Neural Networks

Many machine learning classification systems lack competency awareness. ...
research
12/21/2020

TVOR: Finding Discrete Total Variation Outliers among Histograms

Pearson's chi-squared test can detect outliers in the data distribution ...
research
06/05/2020

Generating Artificial Outliers in the Absence of Genuine Ones – a Survey

By definition, outliers are rarely observed in reality, making them diff...
research
10/02/2018

Analysis of Robust Functions for Registration Algorithms

Registration accuracy is influenced by the presence of outliers and nume...

Please sign up or login with your details

Forgot password? Click here to reset