A Non-Parametric Subspace Analysis Approach with Application to Anomaly Detection Ensembles

01/13/2021
by   Marcelo Bacher, et al.
0

Identifying anomalies in multi-dimensional datasets is an important task in many real-world applications. A special case arises when anomalies are occluded in a small set of attributes, typically referred to as a subspace, and not necessarily over the entire data space. In this paper, we propose a new subspace analysis approach named Agglomerative Attribute Grouping (AAG) that aims to address this challenge by searching for subspaces that are comprised of highly correlative attributes. Such correlations among attributes represent a systematic interaction among the attributes that can better reflect the behavior of normal observations and hence can be used to improve the identification of two particularly interesting types of abnormal data samples: anomalies that are occluded in relatively small subsets of the attributes and anomalies that represent a new data class. AAG relies on a novel multi-attribute measure, which is derived from information theory measures of partitions, for evaluating the "information distance" between groups of data attributes. To determine the set of subspaces to use, AAG applies a variation of the well-known agglomerative clustering algorithm with the proposed multi-attribute measure as the underlying distance function. Finally, the set of subspaces is used in an ensemble for anomaly detection. Extensive evaluation demonstrates that, in the vast majority of cases, the proposed AAG method (i) outperforms classical and state-of-the-art subspace analysis methods when used in anomaly detection ensembles, and (ii) generates fewer subspaces with a fewer number of attributes each (on average), thus resulting in a faster training time for the anomaly detection ensemble. Furthermore, in contrast to existing methods, the proposed AAG method does not require any tuning of parameters.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/07/2022

Red PANDA: Disambiguating Anomaly Detection by Removing Nuisance Factors

Anomaly detection methods strive to discover patterns that differ from t...
research
08/27/2020

The Impact of Discretization Method on the Detection of Six Types of Anomalies in Datasets

Anomaly detection is the process of identifying cases, or groups of case...
research
06/02/2023

GAD-NR: Graph Anomaly Detection via Neighborhood Reconstruction

Graph Anomaly Detection (GAD) is a technique used to identify abnormal n...
research
08/13/2021

Random Subspace Mixture Models for Interpretable Anomaly Detection

We present a new subspace-based method to construct probabilistic models...
research
08/16/2020

SECODA: Segmentation- and Combination-Based Detection of Anomalies

This study introduces SECODA, a novel general-purpose unsupervised non-p...
research
08/16/2019

GODS: Generalized One-class Discriminative Subspaces for Anomaly Detection

One-class learning is the classic problem of fitting a model to data for...
research
02/17/2016

Anomaly Detection in Clutter using Spectrally Enhanced Ladar

Discrete return (DR) Laser Detection and Ranging (Ladar) systems provide...

Please sign up or login with your details

Forgot password? Click here to reset