MCODE: Multivariate Conditional Outlier Detection

05/15/2015
by   Charmgil Hong, et al.
0

Outlier detection aims to identify unusual data instances that deviate from expected patterns. The outlier detection is particularly challenging when outliers are context dependent and when they are defined by unusual combinations of multiple outcome variable values. In this paper, we develop and study a new conditional outlier detection approach for multivariate outcome spaces that works by (1) transforming the conditional detection to the outlier detection problem in a new (unconditional) space and (2) defining outlier scores by analyzing the data in the new space. Our approach relies on the classifier chain decomposition of the multi-dimensional classification problem that lets us transform the output space into a probability vector, one probability for each dimension of the output space. Outlier scores applied to these transformed vectors are then used to detect the outliers. Experiments on multiple multi-dimensional classification problems with the different outlier injection rates show that our methodology is robust and able to successfully identify outliers when outliers are either sparse (manifested in one or very few dimensions) or dense (affecting multiple dimensions).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/21/2016

Detecting Unusual Input-Output Associations in Multivariate Conditional Data

Despite tremendous progress in outlier detection research in recent year...
research
11/01/2022

Typical Yet Unlikely: Using Information Theoretic Approaches to Identify Outliers which Lie Close to the Mean

Normality, in the colloquial sense, has historically been considered an ...
research
10/23/2019

Multiple outlier detection tests for parametric models

We propose a simple multiple outlier identification method for parametri...
research
08/03/2017

Detection of Abnormal Input-Output Associations

We study a novel outlier detection problem that aims to identify abnorma...
research
07/02/2020

Outlier Detection through Null Space Analysis of Neural Networks

Many machine learning classification systems lack competency awareness. ...
research
12/05/2019

Causal structure based root cause analysis of outliers

We describe a formal approach to identify 'root causes' of outliers obse...
research
02/07/2018

Outlier Detection for Robust Multi-dimensional Scaling

Multi-dimensional scaling (MDS) plays a central role in data-exploration...

Please sign up or login with your details

Forgot password? Click here to reset