DESSERT: An Efficient Algorithm for Vector Set Search with Vector Set Queries

10/27/2022
by   Joshua Engels, et al.
0

We study the problem of vector set search with vector set queries. This task is analogous to traditional near-neighbor search, with the exception that both the query and each element in the collection are sets of vectors. We identify this problem as a core subroutine for many web applications and find that existing solutions are unacceptably slow. Towards this end, we present a new approximate search algorithm, DESSERT (DESSERT Effeciently Searches Sets of Embeddings via Retrieval Tables). DESSERT is a general tool with strong theoretical guarantees and excellent empirical performance. When we integrate DESSERT into ColBERT, a highly optimized state-of-the-art semantic search method, we find a 2-5x speedup on the MSMarco passage ranking task with minimal loss in recall, underscoring the effectiveness and practical applicability of our proposal.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/14/2021

Efficient Approximate Search for Sets of Vectors

We consider a similarity measure between two sets A and B of vectors, th...
research
08/29/2023

Vector Search with OpenAI Embeddings: Lucene Is All You Need

We provide a reproducible, end-to-end demonstration of vector search wit...
research
06/22/2021

Practical Near Neighbor Search via Group Testing

We present a new algorithm for the approximate near neighbor problem tha...
research
10/14/2021

Exposing Query Identification for Search Transparency

Search systems control the exposure of ranked content to searchers. In m...
research
07/16/2022

HQANN: Efficient and Robust Similarity Search for Hybrid Queries with Structured and Unstructured Constraints

The in-memory approximate nearest neighbor search (ANNS) algorithms have...
research
02/18/2019

RACE: Sub-Linear Memory Sketches for Approximate Near-Neighbor Search on Streaming Data

We demonstrate the first possibility of a sub-linear memory sketch for s...
research
12/10/2015

VRFP: On-the-fly Video Retrieval using Web Images and Fast Fisher Vector Products

VRFP is a real-time video retrieval framework based on short text input ...

Please sign up or login with your details

Forgot password? Click here to reset