Approximate Principal Direction Trees

06/18/2012
by   Mark McCartin-Lim, et al.
0

We introduce a new spatial data structure for high dimensional data called the approximate principal direction tree (APD tree) that adapts to the intrinsic dimension of the data. Our algorithm ensures vector-quantization accuracy similar to that of computationally-expensive PCA trees with similar time-complexity to that of lower-accuracy RP trees. APD trees use a small number of power-method iterations to find splitting planes for recursively partitioning the data. As such they provide a natural trade-off between the running-time and accuracy achieved by RP and PCA trees. Our theoretical results establish a) strong performance guarantees regardless of the convergence rate of the power-method and b) that O( d) iterations suffice to establish the guarantee of PCA trees when the intrinsic dimension is d. We demonstrate this trade-off and the efficacy of our data structure on both the CPU and GPU.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/09/2012

Which Spatial Partition Trees are Adaptive to Intrinsic Dimension?

Recent theory work has found that a special type of spatial partition tr...
research
11/17/2012

Data Clustering via Principal Direction Gap Partitioning

We explore the geometrical interpretation of the PCA based clustering al...
research
10/19/2010

Random Projection Trees Revisited

The Random Projection Tree structures proposed in [Freund-Dasgupta STOC0...
research
07/02/2020

High Dimensional Bayesian Optimization Assisted by Principal Component Analysis

Bayesian Optimization (BO) is a surrogate-assisted global optimization t...
research
02/28/2021

Weighted Ancestors in Suffix Trees Revisited

The weighted ancestor problem is a well-known generalization of the pred...
research
02/25/2023

The Effect of Points Dispersion on the k-nn Search in Random Projection Forests

Partitioning trees are efficient data structures for k-nearest neighbor ...
research
07/11/2022

Shapley Computations Using Surrogate Model-Based Trees

Shapley-related techniques have gained attention as both global and loca...

Please sign up or login with your details

Forgot password? Click here to reset