Optimization of Indexing Based on k-Nearest Neighbor Graph for Proximity Search in High-dimensional Data

10/17/2018
by   Masajiro Iwasaki, et al.
0

Searching for high-dimensional vector data with high accuracy is an inevitable search technology for various types of data. Graph-based indexes are known to reduce the query time for high-dimensional data. To further improve the query time by using graphs, we focused on the indegrees and outdegrees of graphs. While a sufficient number of incoming edges (indegrees) are indispensable for increasing search accuracy, an excessive number of outgoing edges (outdegrees) should be suppressed so as to not increase the query time. Therefore, we propose three degree-adjustment methods: static degree adjustment of not only outdegrees but also indegrees, dynamic degree adjustment with which outdegrees are determined by the search accuracy users require, and path adjustment to remove edges that have alternative search paths to reduce outdegrees. We also show how to obtain optimal degree-adjustment parameters and that our methods outperformed previous methods for image and textual data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/13/2019

Satellite System Graph: Towards the Efficiency Up-Boundary of Graph-Based Approximate Nearest Neighbor Search

Approximate Nearest Neighbor Search (ANNS) in high dimensional space is ...
research
12/04/2021

Revisiting k-Nearest Neighbor Graph Construction on High-Dimensional Data : Experiments and Analyses

The k-nearest neighbor graph (KNNG) on high-dimensional data is a data s...
research
07/27/2023

Robust graphs for graph-based methods

Graph-based two-sample tests and graph-based change-point detection that...
research
02/15/2012

The Future of Search and Discovery in Big Data Analytics: Ultrametric Information Spaces

Consider observation data, comprised of n observation vectors with value...
research
08/17/2021

Limiting distributions of graph-based test statistics

Two-sample tests utilizing a similarity graph on observations are useful...
research
12/05/2021

Learning Query Expansion over the Nearest Neighbor Graph

Query Expansion (QE) is a well established method for improving retrieva...
research
10/04/2019

Targeted sampling from massive Blockmodel graphs with personalized PageRank

This paper provides statistical theory and intuition for Personalized Pa...

Please sign up or login with your details

Forgot password? Click here to reset