Vector Search with OpenAI Embeddings: Lucene Is All You Need

08/29/2023
by   Jimmy Lin, et al.
0

We provide a reproducible, end-to-end demonstration of vector search with OpenAI embeddings using Lucene on the popular MS MARCO passage ranking test collection. The main goal of our work is to challenge the prevailing narrative that a dedicated vector store is necessary to take advantage of recent advances in deep neural networks as applied to search. Quite the contrary, we show that hierarchical navigable small-world network (HNSW) indexes in Lucene are adequate to provide vector search capabilities in a standard bi-encoder architecture. This suggests that, from a simple cost-benefit analysis, there does not appear to be a compelling reason to introduce a dedicated vector store into a modern "AI stack" for search, since such applications have already received substantial investments in existing, widely deployed infrastructure.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/27/2022

DESSERT: An Efficient Algorithm for Vector Set Search with Vector Set Queries

We study the problem of vector set search with vector set queries. This ...
research
05/23/2023

Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization

Combining end-to-end neural speaker diarization (EEND) with vector clust...
research
06/10/2017

Visual Search at eBay

In this paper, we propose a novel end-to-end approach for scalable visua...
research
12/23/2021

Customising Ranking Models for Enterprise Search on Bilingual Click-Through Dataset

In this work, we provide the details about the process of establishing a...
research
10/20/2016

Using Fast Weights to Attend to the Recent Past

Until recently, research on artificial neural networks was largely restr...
research
02/02/2021

The Hitachi-JHU DIHARD III System: Competitive End-to-End Neural Diarization and X-Vector Clustering Systems Combined by DOVER-Lap

This paper provides a detailed description of the Hitachi-JHU system tha...

Please sign up or login with your details

Forgot password? Click here to reset