On the Merge of k-NN Graph

08/02/2019
by   Peng-Cheng Lin, et al.
0

K-nearest neighbor graph is the fundamental data structure in many disciplines such as information retrieval, data-mining, pattern recognition and machine learning, etc. In the literature, considerable research has been focusing on how to efficiently build an approximate k-nearest neighbor graph (k-NN graph) for a fixed dataset. Unfortunately, a closely related issue to the approximate k-NN graph construction has been long overlooked. Namely, few literature covers about how to merge existing k-NN graphs. In this paper, we address the k-NN graph merge issue of two different scenarios. One one hand, we address the problem of merging two existing graphs into one by the proposed peer merge. This makes parallel approximate k-NN graph computation in large-scale become possible. On the other hand, the problem of merging a raw set into a built k-NN graph is also addressed by the proposed joint merge. It enables the approximate k-NN graph to be built incrementally. Thus it supports approximate k-NN graph construction for an open set. Moreover, deriving from joint merge, an hierarchical graph construction approach is presented. With the support of produced graph hierarchy, superior performance is observed on the large-scale NN search task across various data types and various data dimensions, under different distance measures.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/29/2021

Large-Scale Approximate k-NN Graph Construction on GPU

k-nearest neighbor graph is a key data structure in many disciplines suc...
research
07/30/2013

Scalable k-NN graph construction

The k-NN graph has played a central role in increasingly popular data-dr...
research
04/02/2022

Graph-based Approximate NN Search: A Revisit

Nearest neighbor search plays a fundamental role in many disciplines suc...
research
04/03/2019

A Comparative Study on Hierarchical Navigable Small World Graphs

Hierarchical navigable small world (HNSW) graphs get more and more popul...
research
07/01/2010

Survey of Nearest Neighbor Techniques

The nearest neighbor (NN) technique is very simple, highly efficient and...
research
06/11/2019

k-Nearest Neighbor Optimization via Randomized Hyperstructure Convex Hull

In the k-nearest neighbor algorithm (k-NN), the determination of classes...
research
09/02/2019

An Adjusted Nearest Neighbor Algorithm Maximizing the F-Measure from Imbalanced Data

In this paper, we address the challenging problem of learning from imbal...

Please sign up or login with your details

Forgot password? Click here to reset