Boosted Dense Retriever

12/14/2021
by   Patrick Lewis, et al.
0

We propose DrBoost, a dense retrieval ensemble inspired by boosting. DrBoost is trained in stages: each component model is learned sequentially and specialized by focusing only on retrieval mistakes made by the current ensemble. The final representation is the concatenation of the output vectors of all the component models, making it a drop-in replacement for standard dense retrievers at test time. DrBoost enjoys several advantages compared to standard dense retrieval models. It produces representations which are 4x more compact, while delivering comparable retrieval results. It also performs surprisingly well under approximate search with coarse quantization, reducing latency and bandwidth needs by another 4x. In practice, this can make the difference between serving indices from disk versus from memory, paving the way for much cheaper deployments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/31/2023

Lexically-Accelerated Dense Retrieval

Retrieval approaches that score documents based on learned dense vectors...
research
08/13/2021

On Single and Multiple Representations in Dense Passage Retrieval

The advent of contextualised language models has brought gains in search...
research
10/12/2021

Learning Discrete Representations via Constrained Clustering for Effective and Efficient Dense Retrieval

Dense Retrieval (DR) has achieved state-of-the-art first-stage ranking e...
research
06/20/2022

A Dense Representation Framework for Lexical and Semantic Matching

Lexical and semantic matching capture different successful approaches to...
research
10/25/2022

Bridging the Training-Inference Gap for Dense Phrase Retrieval

Building dense retrievers requires a series of standard procedures, incl...
research
08/29/2022

LED: Lexicon-Enlightened Dense Retriever for Large-Scale Retrieval

Retrieval models based on dense representations in semantic space have b...
research
05/18/2023

Advancing Full-Text Search Lemmatization Techniques with Paradigm Retrieval from OpenCorpora

In this paper, we unveil a groundbreaking method to amplify full-text se...

Please sign up or login with your details

Forgot password? Click here to reset