Revisiting the Inverted Indices for Billion-Scale Approximate Nearest Neighbors

02/07/2018
by   Dmitry Baranchuk, et al.
0

This work addresses the problem of billion-scale nearest neighbor search. The state-of-the-art retrieval systems for billion-scale databases are currently based on the inverted multi-index, the recently proposed generalization of the inverted index structure. The multi-index provides a very fine-grained partition of the feature space that allows extracting concise and accurate short-lists of candidates for the search queries. In this paper, we argue that the potential of the simple inverted index was not fully exploited in previous works and advocate its usage both for the highly-entangled deep descriptors and relatively disentangled SIFT descriptors. We introduce a new retrieval system that is based on the inverted index and outperforms the multi-index by a large margin for the same memory consumption and construction complexity. For example, our system achieves the state-of-the-art recall rates up to six times faster on the dataset of one billion deep descriptors compared to the efficient implementation of the inverted multi-index from the FAISS library.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/19/2016

Large scale near-duplicate image retrieval using Triples of Adjacent Ranked Features (TARF) with embedded geometric information

Most approaches to large-scale image retrieval are based on the construc...
research
11/05/2021

SPANN: Highly-efficient Billion-scale Approximate Nearest Neighbor Search

The in-memory algorithms for approximate nearest neighbor search (ANNS) ...
research
05/20/2021

FreshDiskANN: A Fast and Accurate Graph-Based ANN Index for Streaming Similarity Search

Approximate nearest neighbor search (ANNS) is a fundamental building blo...
research
10/22/2022

OOD-DiskANN: Efficient and Scalable Graph ANNS for Out-of-Distribution Queries

State-of-the-art algorithms for Approximate Nearest Neighbor Search (ANN...
research
06/15/2018

Efficient Nearest Neighbors Search for Large-Scale Landmark Recognition

The problem of landmark recognition has achieved excellent results in sm...
research
09/27/2017

Effective Image Retrieval via Multilinear Multi-index Fusion

Multi-index fusion has demonstrated impressive performances in retrieval...
research
08/29/2023

CAPS: A Practical Partition Index for Filtered Similarity Search

With the surging popularity of approximate near-neighbor search (ANNS), ...

Please sign up or login with your details

Forgot password? Click here to reset