Scalable k-NN graph construction

07/30/2013
by   Jingdong Wang, et al.
0

The k-NN graph has played a central role in increasingly popular data-driven techniques for various learning and vision tasks; yet, finding an efficient and effective way to construct k-NN graphs remains a challenge, especially for large-scale high-dimensional data. In this paper, we propose a new approach to construct approximate k-NN graphs with emphasis in: efficiency and accuracy. We hierarchically and randomly divide the data points into subsets and build an exact neighborhood graph over each subset, achieving a base approximate neighborhood graph; we then repeat this process for several times to generate multiple neighborhood graphs, which are combined to yield a more accurate approximate neighborhood graph. Furthermore, we propose a neighborhood propagation scheme to further enhance the accuracy. We show both theoretical and empirical accuracy and efficiency of our approach to k-NN graph construction and demonstrate significant speed-up in dealing with large scale visual data.

READ FULL TEXT

page 5

page 14

research
08/02/2019

On the Merge of k-NN Graph

K-nearest neighbor graph is the fundamental data structure in many disci...
research
03/29/2021

Large-Scale Approximate k-NN Graph Construction on GPU

k-nearest neighbor graph is a key data structure in many disciplines suc...
research
04/02/2022

Graph-based Approximate NN Search: A Revisit

Nearest neighbor search plays a fundamental role in many disciplines suc...
research
06/19/2015

A general framework for the IT-based clustering methods

Previously, we proposed a physically inspired rule to organize the data ...
research
10/08/2019

Accurate and Fast Retrieval for Complex Non-metric Data via Neighborhood Graphs

We demonstrate that a graph-based search algorithm-relying on the constr...
research
12/11/2011

Graph Construction for Learning with Unbalanced Data

Unbalanced data arises in many learning tasks such as clustering of mult...
research
04/02/2021

A Comparison of Similarity Based Instance Selection Methods for Cross Project Defect Prediction

Context: Previous studies have shown that training data instance selecti...

Please sign up or login with your details

Forgot password? Click here to reset