Leveraging Semantic and Lexical Matching to Improve the Recall of Document Retrieval Systems: A Hybrid Approach

10/02/2020
by   Saar Kuzi, et al.
0

Search engines often follow a two-phase paradigm where in the first stage (the retrieval stage) an initial set of documents is retrieved and in the second stage (the re-ranking stage) the documents are re-ranked to obtain the final result list. While deep neural networks were shown to improve the performance of the re-ranking stage in previous works, there is little literature about using deep neural networks to improve the retrieval stage. In this paper, we study the merits of combining deep neural network models and lexical models for the retrieval stage. A hybrid approach, which leverages both semantic (deep neural network-based) and lexical (keyword matching-based) retrieval models, is proposed. We perform an empirical study, using a publicly available TREC collection, which demonstrates the effectiveness of our approach and sheds light on the different characteristics of the semantic approach, the lexical approach, and their combination.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/29/2020

Complementing Lexical Retrieval with Semantic Residual Embedding

Information retrieval traditionally has relied on lexical matching signa...
research
06/28/2018

Beyond Precision: A Study on Recall of Initial Retrieval with Neural Representations

Vocabulary mismatch is a central problem in information retrieval (IR), ...
research
07/01/2023

Effective Matching of Patients to Clinical Trials using Entity Extraction and Neural Re-ranking

Clinical trials (CTs) often fail due to inadequate patient recruitment. ...
research
05/16/2023

Hybrid and Collaborative Passage Reranking

In passage retrieval system, the initial passage retrieval results may b...
research
06/29/2023

Exploring the Representation Power of SPLADE Models

The SPLADE (SParse Lexical AnD Expansion) model is a highly effective ap...
research
12/20/2022

HYRR: Hybrid Infused Reranking for Passage Retrieval

We present Hybrid Infused Reranking for Passages Retrieval (HYRR), a fra...
research
10/21/2022

An Analysis of Fusion Functions for Hybrid Retrieval

We study hybrid search in text retrieval where lexical and semantic sear...

Please sign up or login with your details

Forgot password? Click here to reset