Similarity search in the blink of an eye with compressed indices

04/07/2023
by   Cecilia Aguerrebere, et al.
0

Nowadays, data is represented by vectors. Retrieving those vectors, among millions and billions, that are similar to a given query is a ubiquitous problem of relevance for a wide range of applications. In this work, we present new techniques for creating faster and smaller indices to run these searches. To this end, we introduce a novel vector compression method, Locally-adaptive Vector Quantization (LVQ), that simultaneously reduces memory footprint and improves search performance, with minimal impact on search accuracy. LVQ is designed to work optimally in conjunction with graph-based indices, reducing their effective bandwidth while enabling random-access-friendly fast similarity computations. Our experimental results show that LVQ, combined with key optimizations for graph-based indices in modern datacenter systems, establishes the new state of the art in terms of performance and memory footprint. For billions of vectors, LVQ outcompetes the second-best alternatives: (1) in the low-memory regime, by up to 20.7x in throughput with up to a 3x memory footprint reduction, and (2) in the high-throughput regime by 5.8x with 1.4x less memory.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/20/2021

FreshDiskANN: A Fast and Accurate Graph-Based ANN Index for Streaming Similarity Search

Approximate nearest neighbor search (ANNS) is a fundamental building blo...
research
09/11/2018

Zoom: SSD-based Vector Search for Optimizing Accuracy, Latency and Memory

With the advancement of machine learning and deep learning, vector searc...
research
10/22/2022

OOD-DiskANN: Efficient and Scalable Graph ANNS for Out-of-Distribution Queries

State-of-the-art algorithms for Approximate Nearest Neighbor Search (ANN...
research
06/22/2022

FINGER: Fast Inference for Graph-based Approximate Nearest Neighbor Search

Approximate K-Nearest Neighbor Search (AKNNS) has now become ubiquitous ...
research
12/13/2021

Does Fully Homomorphic Encryption Need Compute Acceleration?

Fully Homomorphic Encryption (FHE) allows arbitrarily complex computatio...
research
12/10/2014

Memory vectors for similarity search in high-dimensional spaces

We study an indexing architecture to store and search in a database of h...
research
03/03/2023

Extreme-scale many-against-many protein similarity search

Similarity search is one of the most fundamental computations that are r...

Please sign up or login with your details

Forgot password? Click here to reset