Robust Contextual Outlier Detection: Where Context Meets Sparsity

07/28/2016
by   Jiongqian Liang, et al.
0

Outlier detection is a fundamental data science task with applications ranging from data cleaning to network security. Given the fundamental nature of the task, this has been the subject of much research. Recently, a new class of outlier detection algorithms has emerged, called contextual outlier detection, and has shown improved performance when studying anomalous behavior in a specific context. However, as we point out in this article, such approaches have limited applicability in situations where the context is sparse (i.e. lacking a suitable frame of reference). Moreover, approaches developed to date do not scale to large datasets. To address these problems, here we propose a novel and robust approach alternative to the state-of-the-art called RObust Contextual Outlier Detection (ROCOD). We utilize a local and global behavioral model based on the relevant contexts, which is then integrated in a natural and robust fashion. We also present several optimizations to improve the scalability of the approach. We run ROCOD on both synthetic and real-world datasets and demonstrate that it outperforms other competitive baselines on the axes of efficacy and efficiency (40X speedup compared to modern contextual outlier detection methods). We also drill down and perform a fine-grained analysis to shed light on the rationale for the performance gains of ROCOD and reveal its effectiveness when handling objects with sparse contexts.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/17/2016

Outlier Detection on Mixed-Type Data: An Energy-based Approach

Outlier detection amounts to finding data points that differ significant...
research
07/31/2019

Are Outlier Detection Methods Resilient to Sampling?

Outlier detection is a fundamental task in data mining and has many appl...
research
03/14/2023

RODD: Robust Outlier Detection in Data Cubes

Data cubes are multidimensional databases, often built from several sepa...
research
03/12/2020

PyODDS: An End-to-end Outlier Detection System with Automated Machine Learning

Outlier detection is an important task for various data mining applicati...
research
06/07/2020

AdaLAM: Revisiting Handcrafted Outlier Detection

Local feature matching is a critical component of many computer vision p...
research
01/27/2021

Wisdom of the Contexts: Active Ensemble Learning for Contextual Anomaly Detection

In contextual anomaly detection (CAD), an object is only considered anom...
research
11/13/2018

Nonparametric geometric outlier detection

Outlier detection is a major topic in robust statistics due to the high ...

Please sign up or login with your details

Forgot password? Click here to reset