A Mathematical Assessment of the Isolation Tree Method for Data Anomaly Detection in Big Data

04/09/2020
by   Fernando A Morales, et al.
0

We present the mathematical analysis of the Isolation Random Forest Method (IRF Method) for anomaly detection, introduced in F. T. Liu, K. M. Ting, Z.-H. Zhou:, Isolation-based anomaly detection, TKDD 6 (2012) 3:1–3:39. We prove that the IRF space can be endowed with a probability induced by the Isolation Tree algorithm (iTree). In this setting, the convergence of the IRF method is proved, using the Law of Large Numbers. A couple of counterexamples are presented to show that the method is inconclusive and no certificate of quality can be given, when using it as a means to detect anomalies. Hence, a more robust version of the method is proposed whose mathematical foundation is fully justified. Finally, numerical experiments are presented to compare the performance of the classic method with the proposed one.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/09/2020

A Mathematical Assessment of the Isolation Tree Method for Data Anomaly Detection

We present the mathematical analysis of the Isolation Random Forest Meth...
research
03/08/2020

Isolation Mondrian Forest for Batch and Online Anomaly Detection

We propose a new method, named isolation Mondrian forest (iMondrian fore...
research
02/01/2022

Weighted Random Cut Forest Algorithm for Anomaly Detections

Random cut forest (RCF) algorithms have been developed for anomaly detec...
research
06/22/2023

OptIForest: Optimal Isolation Forest for Anomaly Detection

Anomaly detection plays an increasingly important role in various fields...
research
09/29/2019

Active Anomaly Detection for time-domain discoveries

We present the first application of adaptive machine learning to the ide...
research
02/04/2023

Unsupervised Ensemble Methods for Anomaly Detection in PLC-based Process Control

Programmable logic controller (PLC) based industrial control systems (IC...
research
12/05/2022

AIDA: Analytic Isolation and Distance-based Anomaly Detection Algorithm

We combine the metrics of distance and isolation to develop the Analytic...

Please sign up or login with your details

Forgot password? Click here to reset