Scalable Nearest Neighbor Search based on kNN Graph

01/30/2017
by   Wan-Lei Zhao, et al.
0

Nearest neighbor search is known as a challenging issue that has been studied for several decades. Recently, this issue becomes more and more imminent in viewing that the big data problem arises from various fields. In this paper, a scalable solution based on hill-climbing strategy with the support of k-nearest neighbor graph (kNN) is presented. Two major issues have been considered in the paper. Firstly, an efficient kNN graph construction method based on two means tree is presented. For the nearest neighbor search, an enhanced hill-climbing procedure is proposed, which sees considerable performance boost over original procedure. Furthermore, with the support of inverted indexing derived from residue vector quantization, our method achieves close to 100 speed efficiency in two state-of-the-art evaluation benchmarks. In addition, a comparative study on both the compressional and traditional nearest neighbor search methods is presented. We show that our method achieves the best trade-off between search quality, efficiency and memory complexity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/09/2018

k-NN Graph Construction: a Generic Online Approach

Nearest neighbor search and k-nearest neighbor graph construction are tw...
research
07/11/2023

Quantitative Comparison of Nearest Neighbor Search Algorithms

We compare the performance of three nearest neighbor search algorithms: ...
research
06/24/2019

An Empirical Comparison of FAISS and FENSHSES for Nearest Neighbor Search in Hamming Space

In this paper, we compare the performances of FAISS and FENSHSES on near...
research
12/18/2018

Efficient Autotuning of Hyperparameters in Approximate Nearest Neighbor Search

Approximate nearest neighbor algorithms are used to speed up nearest nei...
research
10/17/2021

Low-Precision Quantization for Efficient Nearest Neighbor Search

Fast k-Nearest Neighbor search over real-valued vector spaces (KNN) is a...
research
10/08/2016

Boost K-Means

Due to its simplicity and versatility, k-means remains popular since it ...
research
06/16/2022

Unsupervised Space Partitioning for Nearest Neighbor Search

Approximate Nearest Neighbor Search (ANNS) in high dimensional spaces is...

Please sign up or login with your details

Forgot password? Click here to reset