Robust Subspace Outlier Detection in High Dimensional Space

05/05/2014
by   Zhana Bao, et al.
0

Rare data in a large-scale database are called outliers that reveal significant information in the real world. The subspace-based outlier detection is regarded as a feasible approach in very high dimensional space. However, the outliers found in subspaces are only part of the true outliers in high dimensional space, indeed. The outliers hidden in normal-clustered points are sometimes neglected in the projected dimensional subspace. In this paper, we propose a robust subspace method for detecting such inner outliers in a given dataset, which uses two dimensional-projections: detecting outliers in subspaces with local density ratio in the first projected dimensions; finding outliers by comparing neighbor's positions in the second projected dimensions. Each point's weight is calculated by summing up all related values got in the two steps projected dimensions, and then the points scoring the largest weight values are taken as outliers. By taking a series of experiments with the number of dimensions from 10 to 10000, the results show that our proposed method achieves high precision in the case of extremely high dimensional space, and works well in low dimensional space.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/05/2014

Finding Inner Outliers in High Dimensional Space

Outlier detection in a large-scale database is a significant and complex...
research
05/05/2014

K-NS: Section-Based Outlier Detection in High Dimensional Space

Finding rare information hidden in a huge amount of data from the Intern...
research
09/04/2019

Theory of high-dimensional outliers

This study concerns the issue of high dimensional outliers which are cha...
research
11/01/2016

Local Subspace-Based Outlier Detection using Global Neighbourhoods

Outlier detection in high-dimensional data is a challenging yet importan...
research
02/16/2015

Random Subspace Learning Approach to High-Dimensional Outliers Detection

We introduce and develop a novel approach to outlier detection based on ...
research
10/29/2018

Feature Bagging for Steganographer Identification

Traditional steganalysis algorithms focus on detecting the existence of ...
research
11/01/2022

Typical Yet Unlikely: Using Information Theoretic Approaches to Identify Outliers which Lie Close to the Mean

Normality, in the colloquial sense, has historically been considered an ...

Please sign up or login with your details

Forgot password? Click here to reset