Local Subspace-Based Outlier Detection using Global Neighbourhoods

11/01/2016
by   Bas van Stein, et al.
0

Outlier detection in high-dimensional data is a challenging yet important task, as it has applications in, e.g., fraud detection and quality control. State-of-the-art density-based algorithms perform well because they 1) take the local neighbourhoods of data points into account and 2) consider feature subspaces. In highly complex and high-dimensional data, however, existing methods are likely to overlook important outliers because they do not explicitly take into account that the data is often a mixture distribution of multiple components. We therefore introduce GLOSS, an algorithm that performs local subspace outlier detection using global neighbourhoods. Experiments on synthetic data demonstrate that GLOSS more accurately detects local outliers in mixed data than its competitors. Moreover, experiments on real-world data show that our approach identifies relevant outliers overlooked by existing methods, confirming that one should keep an eye on the global perspective even when doing local outlier detection.

READ FULL TEXT

page 7

page 9

research
05/05/2014

Robust Subspace Outlier Detection in High Dimensional Space

Rare data in a large-scale database are called outliers that reveal sign...
research
08/17/2016

Outlier Detection on Mixed-Type Data: An Energy-based Approach

Outlier detection amounts to finding data points that differ significant...
research
11/13/2020

Efficient Subspace Search in Data Streams

In the real world, data streams are ubiquitous – think of network traffi...
research
07/16/2020

Re-weighting and 1-Point RANSAC-Based PnP Solution to Handle Outliers

The ability to handle outliers is essential for performing the perspecti...
research
08/16/2017

Visualizing and Exploring Dynamic High-Dimensional Datasets with LION-tSNE

T-distributed stochastic neighbor embedding (tSNE) is a popular and priz...
research
05/17/2017

REMIX: Automated Exploration for Interactive Outlier Detection

Outlier detection is the identification of points in a dataset that do n...
research
09/30/2016

Outlier Detection from Network Data with Subnetwork Interpretation

Detecting a small number of outliers from a set of data observations is ...

Please sign up or login with your details

Forgot password? Click here to reset