Document Expansion by Query Prediction

04/17/2019
by   Rodrigo Nogueira, et al.
0

One technique to improve the retrieval effectiveness of a search engine is to expand documents with terms that are related or representative of the documents' content. From the perspective of a question answering system, a useful representation of a document might comprise the questions it can potentially answer. Following this observation, we propose a simple method that predicts which queries will be issued for a given document and then expands it with those predictions. Our predictions are made with a vanilla sequence-to-sequence model trained with supervised learning using a dataset of pairs of query and relevant documents. By combining our method with a highly-effective re-ranking component, we achieve the state of the art in two retrieval tasks. In a latency-critical regime, retrieval results alone (without the re-ranking component) approach the effectiveness of more computationally expensive neural re-rankers while taking only a fraction of the query latency.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/05/2019

A Passage-Based Approach to Learning to Rank Documents

According to common relevance-judgments regimes, such as TREC's, a docum...
research
10/08/2022

Enhanced vectors for top-k document retrieval in Question Answering

Modern day applications, especially information retrieval webapps that i...
research
07/14/2017

Cross-genre Document Retrieval: Matching between Conversational and Formal Writings

This paper challenges a cross-genre document retrieval task, where the q...
research
11/08/2019

Unsupervised Common Question Generation from Multiple Documents using Reinforced Contrastive Coordinator

Web search engines today return a ranked list of document links in respo...
research
10/14/2021

Exposing Query Identification for Search Transparency

Search systems control the exposure of ranked content to searchers. In m...
research
09/13/2018

Interpreting search result rankings through intent modeling

Given the recent interest in arguably accurate yet non-interpretable neu...
research
06/16/2021

A Neural Model for Joint Document and Snippet Ranking in Question Answering for Large Document Collections

Question answering (QA) systems for large document collections typically...

Please sign up or login with your details

Forgot password? Click here to reset