Query Performance Prediction for Neural IR: Are We There Yet?

02/20/2023
by   Guglielmo Faggioli, et al.
0

Evaluation in Information Retrieval relies on post-hoc empirical procedures, which are time-consuming and expensive operations. To alleviate this, Query Performance Prediction (QPP) models have been developed to estimate the performance of a system without the need for human-made relevance judgements. Such models, usually relying on lexical features from queries and corpora, have been applied to traditional sparse IR methods - with various degrees of success. With the advent of neural IR and large Pre-trained Language Models, the retrieval paradigm has significantly shifted towards more semantic signals. In this work, we study and analyze to what extent current QPP models can predict the performance of such systems. Our experiments consider seven traditional bag-of-words and seven BERT-based IR approaches, as well as nineteen state-of-the-art QPPs evaluated on two collections, Deep Learning '19 and Robust '04. Our findings show that QPPs perform statistically significantly worse on neural IR systems. In settings where semantic signals are prominent (e.g., passage retrieval), their performance on neural models drops by as much as 10 scenarios, QPPs fail to predict performance for neural IR systems on those queries where they differ from traditional approaches the most.

READ FULL TEXT
research
09/03/2020

Multi-Perspective Semantic Information Retrieval

Information Retrieval (IR) is the task of obtaining pieces of data (such...
research
02/07/2018

To Phrase or Not to Phrase - Impact of User versus System Term Dependence Upon Retrieval

When submitting queries to information retrieval (IR) systems, users oft...
research
07/08/2022

An Efficiency Study for SPLADE Models

Latency and efficiency issues are often overlooked when evaluating IR mo...
research
06/15/2023

Prompt Performance Prediction for Generative IR

The ability to predict the performance of a query in Information Retriev...
research
05/24/2022

GraphQ IR: Unifying Semantic Parsing of Graph Query Language with Intermediate Representation

Subject to the semantic gap lying between natural and formal language, n...
research
11/25/2021

Evaluating the Robustness of Retrieval Pipelines with Query Variation Generators

Heavily pre-trained transformers for language modelling, such as BERT, h...
research
04/07/2019

Scalable Change Retrieval Using Deep 3D Neural Codes

We present a novel scalable framework for image change detection (ICD) f...

Please sign up or login with your details

Forgot password? Click here to reset