General and Practical Tuning Method for Off-the-Shelf Graph-Based Index: SISAP Indexing Challenge Report by Team UTokyo

09/01/2023
by   Yutaro Oguri, et al.
0

Despite the efficacy of graph-based algorithms for Approximate Nearest Neighbor (ANN) searches, the optimal tuning of such systems remains unclear. This study introduces a method to tune the performance of off-the-shelf graph-based indexes, focusing on the dimension of vectors, database size, and entry points of graph traversal. We utilize a black-box optimization algorithm to perform integrated tuning to meet the required levels of recall and Queries Per Second (QPS). We applied our approach to Task A of the SISAP 2023 Indexing Challenge and got second place in the 10M and 30M tracks. It improves performance substantially compared to brute force methods. This research offers a universally applicable tuning method for graph-based indexes, extending beyond the specific conditions of the competition to broader uses.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/01/2019

Graph-based Nearest Neighbor Search: From Practice to Theory

Graph-based approaches are empirically shown to be very successful for a...
research
05/20/2021

FreshDiskANN: A Fast and Accurate Graph-Based ANN Index for Streaming Similarity Search

Approximate nearest neighbor search (ANNS) is a fundamental building blo...
research
06/22/2022

FINGER: Fast Inference for Graph-based Approximate Nearest Neighbor Search

Approximate K-Nearest Neighbor Search (AKNNS) has now become ubiquitous ...
research
06/22/2022

Proximity Graph Maintenance for Fast Online Nearest Neighbor Search

Approximate Nearest Neighbor (ANN) search is a fundamental technique for...
research
07/19/2023

Fast Approximate Nearest Neighbor Search with a Dynamic Exploration Graph using Continuous Refinement

For approximate nearest neighbor search, graph-based algorithms have sho...
research
05/07/2023

Scaling Graph-Based ANNS Algorithms to Billion-Size Datasets: A Comparative Analysis

Algorithms for approximate nearest-neighbor search (ANNS) have been the ...
research
06/04/2020

Sample Efficient Graph-Based Optimization with Noisy Observations

We study sample complexity of optimizing "hill-climbing friendly" functi...

Please sign up or login with your details

Forgot password? Click here to reset