The Simplest Thing That Can Possibly Work: Pseudo-Relevance Feedback Using Text Classification

04/18/2019
by   Jimmy Lin, et al.
0

Motivated by recent commentary that has questioned today's pursuit of ever-more complex models and mathematical formalisms in applied machine learning and whether meaningful empirical progress is actually being made, this paper tries to tackle the decades-old problem of pseudo-relevance feedback with "the simplest thing that can possibly work". I present a technique based on training a document relevance classifier for each information need using pseudo-labels from an initial ranked list and then applying the classifier to rerank the retrieved documents. Experiments demonstrate significant improvements across a number of newswire collections, with initial rankings supplied by "bag of words" BM25 as well as from a well-tuned query expansion model. While this simple technique draws elements from several well-known threads in the literature, to my knowledge this exact combination has not previously been proposed and evaluated.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/25/2023

Generative Relevance Feedback with Large Language Models

Current query expansion models use pseudo-relevance feedback to improve ...
research
06/29/2023

Re-Rank - Expand - Repeat: Adaptive Query Expansion for Document Retrieval Using Words and Entities

Sparse and dense pseudo-relevance feedback (PRF) approaches perform poor...
research
08/13/2021

GQE-PRF: Generative Query Expansion with Pseudo-Relevance Feedback

Query expansion with pseudo-relevance feedback (PRF) is a powerful appro...
research
11/08/2018

Deep Neural Networks for Query Expansion using Word Embeddings

Query expansion is a method for alleviating the vocabulary mismatch prob...
research
08/25/2021

Pseudo Relevance Feedback with Deep Language Models and Dense Retrievers: Successes and Pitfalls

Pseudo Relevance Feedback (PRF) is known to improve the effectiveness of...
research
05/12/2022

How does Feedback Signal Quality Impact Effectiveness of Pseudo Relevance Feedback for Passage Retrieval?

Pseudo-Relevance Feedback (PRF) assumes that the top results retrieved b...

Please sign up or login with your details

Forgot password? Click here to reset