Lightweight-Yet-Efficient: Revitalizing Ball-Tree for Point-to-Hyperplane Nearest Neighbor Search

02/21/2023
by   Qiang Huang, et al.
0

Finding the nearest neighbor to a hyperplane (or Point-to-Hyperplane Nearest Neighbor Search, simply P2HNNS) is a new and challenging problem with applications in many research domains. While existing state-of-the-art hashing schemes (e.g., NH and FH) are able to achieve sublinear time complexity without the assumption of the data being in a unit hypersphere, they require an asymmetric transformation, which increases the data dimension from d to Ω(d^2). This leads to considerable overhead for indexing and incurs significant distortion errors. In this paper, we investigate a tree-based approach for solving P2HNNS using the classical Ball-Tree index. Compared to hashing-based methods, tree-based methods usually require roughly linear costs for construction, and they provide different kinds of approximations with excellent flexibility. A simple branch-and-bound algorithm with a novel lower bound is first developed on Ball-Tree for performing P2HNNS. Then, a new tree structure named BC-Tree, which maintains the Ball and Cone structures in the leaf nodes of Ball-Tree, is described together with two effective strategies, i.e., point-level pruning and collaborative inner product computing. BC-Tree inherits both the low construction cost and lightweight property of Ball-Tree while providing a similar or more efficient search. Experimental results over 16 real-world data sets show that Ball-Tree and BC-Tree are around 1.1∼10× faster than NH and FH, and they can reduce the index size and indexing time by about 1∼3 orders of magnitudes on average. The code is available at <https://github.com/HuangQiang/BC-Tree>.

READ FULL TEXT
research
07/11/2023

Quantitative Comparison of Nearest Neighbor Search Algorithms

We compare the performance of three nearest neighbor search algorithms: ...
research
01/24/2019

Learning Sublinear-Time Indexing for Nearest Neighbor Search

Most of the efficient sublinear-time indexing algorithms for the high-di...
research
12/15/2022

Exact fixed-radius nearest neighbor search with an application to clustering

Fixed-radius nearest-neighbor search is a common database operation that...
research
05/28/2020

A Practical Index Structure Supporting Fréchet Proximity Queries Among Trajectories

We present a scalable approach for range and k nearest neighbor queries ...
research
11/07/2019

Efficient Spatial Nearest Neighbor Queries Based on Multi-layer Voronoi Diagrams

Nearest neighbor (NN) problem is an important scientific problem. The NN...
research
12/20/2021

The Cascading Metric Tree

This paper presents the Cascaded Metric Tree (CMT) for efficient satisfa...
research
03/02/2023

GBMST: An Efficient Minimum Spanning Tree Clustering Based on Granular-Ball Computing

Most of the existing clustering methods are based on a single granularit...

Please sign up or login with your details

Forgot password? Click here to reset