Random Subspace Mixture Models for Interpretable Anomaly Detection

08/13/2021
by   Cetin Savkli, et al.
0

We present a new subspace-based method to construct probabilistic models for high-dimensional data and highlight its use in anomaly detection. The approach is based on a statistical estimation of probability density using densities of random subspaces combined with geometric averaging. In selecting random subspaces, equal representation of each attribute is used to ensure correct statistical limits. Gaussian mixture models (GMMs) are used to create the probability densities for each subspace with techniques included to mitigate singularities allowing for the ability to handle both numerical and categorial attributes. The number of components for each GMM is determined automatically through Bayesian information criterion to prevent overfitting. The proposed algorithm attains competitive AUC scores compared with prominent algorithms against benchmark anomaly detection datasets with the added benefits of being simple, scalable, and interpretable.

READ FULL TEXT
research
09/12/2016

Online Data Thinning via Multi-Subspace Tracking

In an era of ubiquitous large-scale streaming data, the availability of ...
research
03/31/2023

Time-series Anomaly Detection based on Difference Subspace between Signal Subspaces

This paper proposes a new method for anomaly detection in time-series da...
research
01/13/2021

A Non-Parametric Subspace Analysis Approach with Application to Anomaly Detection Ensembles

Identifying anomalies in multi-dimensional datasets is an important task...
research
06/06/2023

High-dimensional and Permutation Invariant Anomaly Detection

Methods for anomaly detection of new physics processes are often limited...
research
12/01/2022

Attribute-based Representations for Accurate and Interpretable Video Anomaly Detection

Video anomaly detection (VAD) is a challenging computer vision task with...
research
03/28/2022

FlexFringe: Modeling Software Behavior by Learning Probabilistic Automata

We present the efficient implementations of probabilistic deterministic ...
research
04/18/2023

Estimating Joint Probability Distribution With Low-Rank Tensor Decomposition, Radon Transforms and Dictionaries

In this paper, we describe a method for estimating the joint probability...

Please sign up or login with your details

Forgot password? Click here to reset