Factor Analysis of Mixed Data for Anomaly Detection

05/25/2020
by   Matthew Davidow, et al.
0

Anomaly detection aims to identify observations that deviate from the typical pattern of data. Anomalous observations may correspond to financial fraud, health risks, or incorrectly measured data in practice. We show detecting anomalies in high-dimensional mixed data is enhanced through first embedding the data then assessing an anomaly scoring scheme. We focus on unsupervised detection and the continuous and categorical (mixed) variable case. We propose a kurtosis-weighted Factor Analysis of Mixed Data for anomaly detection, FAMDAD, to obtain a continuous embedding for anomaly scoring. We illustrate that anomalies are highly separable in the first and last few ordered dimensions of this space, and test various anomaly scoring experiments within this subspace. Results are illustrated for both simulated and real datasets, and the proposed approach (FAMDAD) is highly accurate for high-dimensional mixed data throughout these diverse scenarios.

READ FULL TEXT

page 6

page 8

research
09/09/2019

A Flexible Framework for Anomaly Detection via Dimensionality Reduction

Anomaly detection is challenging, especially for large datasets in high ...
research
01/07/2021

Copula Quadrant Similarity for Anomaly Scores

Practical anomaly detection requires applying numerous approaches due to...
research
07/20/2016

Anomaly Detection and Localisation using Mixed Graphical Models

We propose a method that performs anomaly detection and localisation wit...
research
05/06/2018

Incorporating Privileged Information to Unsupervised Anomaly Detection

We introduce a new unsupervised anomaly detection ensemble called SPI wh...
research
07/01/2023

Applied Bayesian Structural Health Monitoring: inclinometer data anomaly detection and forecasting

Inclinometer probes are devices that can be used to measure deformations...
research
08/12/2019

Anomaly Detection in High Dimensional Data

The HDoutliers algorithm is a powerful unsupervised algorithm for detect...
research
06/15/2022

ARES: Locally Adaptive Reconstruction-based Anomaly Scoring

How can we detect anomalies: that is, samples that significantly differ ...

Please sign up or login with your details

Forgot password? Click here to reset