A Modern Perspective on Query Likelihood with Deep Generative Retrieval Models

06/25/2021
by   Oleg Lesota, et al.
0

Existing neural ranking models follow the text matching paradigm, where document-to-query relevance is estimated through predicting the matching score. Drawing from the rich literature of classical generative retrieval models, we introduce and formalize the paradigm of deep generative retrieval models defined via the cumulative probabilities of generating query terms. This paradigm offers a grounded probabilistic view on relevance estimation while still enabling the use of modern neural architectures. In contrast to the matching paradigm, the probabilistic nature of generative rankers readily offers a fine-grained measure of uncertainty. We adopt several current neural generative models in our framework and introduce a novel generative ranker (T-PGN), which combines the encoding capacity of Transformers with the Pointer Generator Network model. We conduct an extensive set of evaluation experiments on passage retrieval, leveraging the MS MARCO Passage Re-ranking and TREC Deep Learning 2019 Passage Re-ranking collections. Our results show the significantly higher performance of the T-PGN model when compared with other generative models. Lastly, we demonstrate that exploiting the uncertainty information of deep generative rankers opens new perspectives to query/collection understanding, and significantly improves the cut-off prediction task.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/06/2020

Beyond [CLS] through Ranking by Generation

Generative models for Information Retrieval, where ranking of documents ...
research
05/10/2021

Not All Relevance Scores are Equal: Efficient Uncertainty and Calibration Modeling for Deep Retrieval Models

In any ranking system, the retrieval model outputs a single score for a ...
research
08/01/2023

Generative Query Reformulation for Effective Adhoc Search

Performing automatic reformulations of a user's query is a popular parad...
research
06/22/2023

On the Robustness of Generative Retrieval Models: An Out-of-Distribution Perspective

Recently, we have witnessed generative retrieval increasingly gaining at...
research
06/20/2023

Generative Retrieval as Dense Retrieval

Generative retrieval is a promising new neural retrieval paradigm that a...
research
07/11/2022

Topic-Grained Text Representation-based Model for Document Retrieval

Document retrieval enables users to find their required documents accura...
research
08/23/2016

Unsupervised, Efficient and Semantic Expertise Retrieval

We introduce an unsupervised discriminative model for the task of retrie...

Please sign up or login with your details

Forgot password? Click here to reset