Distance approximation using Isolation Forests

10/27/2019
by   David Cortes, et al.
18

This work briefly explores the possibility of approximating spatial distance (alternatively, similarity) between data points using the Isolation Forest method envisioned for outlier detection. The logic is similar to that of isolation: the more similar or closer two points are, the more random splits it will take to separate them. The separation depth between two points can be standardized in the same way as the isolation depth, transforming it into a distance metric that is limited in range, centered, and in compliance with the axioms of distance. This metric presents some desirable properties such as being invariant to the scales of variables or being able to account for non-linear relationships between variables, which other metrics such as Euclidean or Mahalanobis distance do not. Extensions to the Isolation Forest method are also proposed for handling categorical variables and missing values, resulting in a more generalizable and robust metric.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/23/2021

Isolation forests: looking beyond tree depth

The isolation forest algorithm for outlier detection exploits a simple y...
research
09/20/2023

Distribution and volume based scoring for Isolation Forests

We make two contributions to the Isolation Forest method for anomaly and...
research
12/05/2022

AIDA: Analytic Isolation and Distance-based Anomaly Detection Algorithm

We combine the metrics of distance and isolation to develop the Analytic...
research
09/01/2021

Tukey's Depth for Object Data

We develop a novel exploratory tool for non-Euclidean object data based ...
research
10/26/2021

Revisiting randomized choices in isolation forests

Isolation forest or "iForest" is an intuitive and widely used algorithm ...
research
08/13/2017

Mahalanonbis Distance Informed by Clustering

A fundamental question in data analysis, machine learning and signal pro...
research
05/15/2023

A Sweep-plane Algorithm for Calculating the Isolation of Mountains

One established metric to classify the significance of a mountain peak i...

Please sign up or login with your details

Forgot password? Click here to reset