Detecting Point Outliers Using Prune-based Outlier Factor (PLOF)

11/05/2019
by   Kasra Babaei, et al.
0

Outlier detection (also known as anomaly detection or deviation detection) is a process of detecting data points in which their patterns deviate significantly from others. It is common to have outliers in industry applications, which could be generated by different causes such as human error, fraudulent activities, or system failure. Recently, density-based methods have shown promising results, particularly among which Local Outlier Factor (LOF) is arguably dominating. However, one of the major drawbacks of LOF is that it is computationally expensive. Motivated by the mentioned problem, this research presents a novel pruning-based procedure in which the execution time of LOF is reduced while the performance is maintained. A novel Prune-based Local Outlier Factor (PLOF) approach is proposed, in which prior to employing LOF, outlierness of each data instance is measured. Next, based on a threshold, data instances that require further investigation are separated and LOF score is only computed for these points. Extensive experiments have been conducted and results are promising. Comparison experiments with the original LOF and two state-of-the-art variants of LOF have shown that PLOF produces higher accuracy and precision while reducing execution time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/08/2020

Outlier Detection Using a Novel method: Quantum Clustering

We propose a new assumption in outlier detection: Normal data instances ...
research
12/12/2017

Outlier Detection by Consistent Data Selection Method

Often the challenge associated with tasks like fraud and spam detection[...
research
06/14/2019

Detecting Network Soft-failures with the Network Link Outlier Factor (NLOF)

In this paper, we describe and experimentally evaluate the performance o...
research
09/15/2023

BANSAC: A dynamic BAyesian Network for adaptive SAmple Consensus

RANSAC-based algorithms are the standard techniques for robust estimatio...
research
06/09/2023

WePaMaDM-Outlier Detection: Weighted Outlier Detection using Pattern Approaches for Mass Data Mining

Weighted Outlier Detection is a method for identifying unusual or anomal...
research
10/26/2021

Revisiting randomized choices in isolation forests

Isolation forest or "iForest" is an intuitive and widely used algorithm ...

Please sign up or login with your details

Forgot password? Click here to reset