Query expansion with artificially generated texts

12/16/2020
by   Vincent Claveau, et al.
0

A well-known way to improve the performance of document retrieval is to expand the user's query. Several approaches have been proposed in the literature, and some of them are considered as yielding state-of-the-art results in IR. In this paper, we explore the use of text generation to automatically expand the queries. We rely on a well-known neural generative model, GPT-2, that comes with pre-trained models for English but can also be fine-tuned on specific corpora. Through different experiments, we show that text generation is a very effective way to improve the performance of an IR system, with a large margin (+10 baselines also relying on query expansion (LM+RM3). This conceptually simple approach can easily be implemented on any IR system thanks to the availability of GPT code and models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/03/2021

Unsupervised Document Expansion for Information Retrieval with Stochastic Text Generation

One of the challenges in information retrieval (IR) is the vocabulary mi...
research
08/13/2021

GQE-PRF: Generative Query Expansion with Pseudo-Relevance Feedback

Query expansion with pseudo-relevance feedback (PRF) is a powerful appro...
research
02/06/2018

Texygen: A Benchmarking Platform for Text Generation Models

We introduce Texygen, a benchmarking platform to support research on ope...
research
03/05/2020

RecipeGPT: Generative Pre-training Based Cooking Recipe Generation and Evaluation System

Interests in the automatic generation of cooking recipes have been growi...
research
09/02/2021

ConQX: Semantic Expansion of Spoken Queries for Intent Detection based on Conditioned Text Generation

Intent detection of spoken queries is a challenging task due to their no...
research
11/22/2022

Linear Interpolation In Parameter Space is Good Enough for Fine-Tuned Language Models

The simplest way to obtain continuous interpolation between two points i...
research
07/14/2023

QontSum: On Contrasting Salient Content for Query-focused Summarization

Query-focused summarization (QFS) is a challenging task in natural langu...

Please sign up or login with your details

Forgot password? Click here to reset