Supervised Learning Approach to Approximate Nearest Neighbor Search

10/18/2019
by   Ville Hyvönen, et al.
0

Approximate nearest neighbor search is a classic algorithmic problem where the goal is to design an efficient index structure for fast approximate nearest neighbor queries. We show that it can be framed as a classification problem and solved by training a suitable multi-label classifier and using it as an index. Compared to the existing algorithms, this supervised learning approach has several advantages: it enables adapting an index to the query distribution when the query distribution and the corpus distribution differ; it allows using training sets larger than the corpus; and in principle it enables using any multi-label classifier for approximate nearest neighbor search. We demonstrate these advantages on multiple synthetic and real-world data sets by using a random forest and an ensemble of random projection trees as the base classifiers.

READ FULL TEXT
research
12/06/2018

Confirmation Sampling for Exact Nearest Neighbor Search

Locality-sensitive hashing (LSH), introduced by Indyk and Motwani in STO...
research
02/12/2018

Revisiting the Vector Space Model: Sparse Weighted Nearest-Neighbor Method for Extreme Multi-Label Classification

Machine learning has played an important role in information retrieval (...
research
06/28/2019

PUFFINN: Parameterless and Universally Fast FInding of Nearest Neighbors

We present PUFFINN, a parameterless LSH-based index for solving the k-ne...
research
12/18/2018

Efficient Autotuning of Hyperparameters in Approximate Nearest Neighbor Search

Approximate nearest neighbor algorithms are used to speed up nearest nei...
research
07/19/2023

Fast Approximate Nearest Neighbor Search with a Dynamic Exploration Graph using Continuous Refinement

For approximate nearest neighbor search, graph-based algorithms have sho...
research
09/23/2015

Fast k-NN search

Efficient index structures for fast approximate nearest neighbor queries...
research
12/31/2018

K-nearest Neighbor Search by Random Projection Forests

K-nearest neighbor (kNN) search has wide applications in many areas, inc...

Please sign up or login with your details

Forgot password? Click here to reset