Weighted Random Cut Forest Algorithm for Anomaly Detections

02/01/2022
by   Sijin Yeom, et al.
0

Random cut forest (RCF) algorithms have been developed for anomaly detection, particularly for the anomaly detection in time-series data. The RCF algorithm is the improved version of the isolation forest algorithm. Unlike the isolation forest algorithm, the RCF algorithm has the power of determining whether the real-time input has anomaly by inserting the input in the constructed tree network. There have been developed various RCF algorithms including Robust RCF (RRCF) with which the cutting procedure is adaptively chosen probabilistically. RRCF shows better performance compared to the isolation forest as the cutting dimension is decided based on the geometric range of the data. The overall data structure is, however, not considered in the adaptive cutting algorithm with the RRCF. In this paper, we propose a new RCF, so-called the weighted RCF (WRCF). In order to introduce the WRCF, we first introduce a new geometric measure, i.e., a density measure which is crucial for the construction of the WRCF. We provide various mathematical properties of the density measure. The proposed WRCF also cuts the tree network adaptively, but with consideration of the denseness of the data. The proposed method is more efficient when the data is structured and achieves the desired anomaly score more rapidly than the RRCF. We provide theorems that prove our claims with numerical examples.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/08/2020

Isolation Mondrian Forest for Batch and Online Anomaly Detection

We propose a new method, named isolation Mondrian forest (iMondrian fore...
research
04/09/2020

A Mathematical Assessment of the Isolation Tree Method for Data Anomaly Detection in Big Data

We present the mathematical analysis of the Isolation Random Forest Meth...
research
04/09/2020

A Mathematical Assessment of the Isolation Tree Method for Data Anomaly Detection

We present the mathematical analysis of the Isolation Random Forest Meth...
research
11/06/2018

Extended Isolation Forest

We present an extension to the model-free anomaly detection algorithm, I...
research
06/22/2023

OptIForest: Optimal Isolation Forest for Anomaly Detection

Anomaly detection plays an increasingly important role in various fields...
research
06/18/2022

Reduced Robust Random Cut Forest for Out-Of-Distribution detection in machine learning models

Most machine learning-based regressors extract information from data col...
research
09/20/2023

Distribution and volume based scoring for Isolation Forests

We make two contributions to the Isolation Forest method for anomaly and...

Please sign up or login with your details

Forgot password? Click here to reset