An Axiomatic Analysis of Diversity Evaluation Metrics: Introducing the Rank-Biased Utility Metric

05/07/2018
by   Enrique Amigó, et al.
0

Many evaluation metrics have been defined to evaluate the effectiveness ad-hoc retrieval and search result diversification systems. However, it is often unclear which evaluation metric should be used to analyze the performance of retrieval systems given a specific task. Axiomatic analysis is an informative mechanism to understand the fundamentals of metrics and their suitability for particular scenarios. In this paper, we define a constraint-based axiomatic framework to study the suitability of existing metrics in search result diversification scenarios. The analysis informed the definition of Rank-Biased Utility (RBU) -- an adaptation of the well-known Rank-Biased Precision metric -- that takes into account redundancy and the user effort associated to the inspection of documents in the ranking. Our experiments over standard diversity evaluation campaigns show that the proposed metric captures quality criteria reflected by different metrics, being suitable in the absence of knowledge about particular features of the scenario under study.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/25/2022

Offline Retrieval Evaluation Without Evaluation Metrics

Offline evaluation of information retrieval and recommendation has tradi...
research
02/13/2022

An Analysis of Variations in the Effectiveness of Query Performance Prediction

A query performance predictor estimates the retrieval effectiveness of a...
research
07/11/2018

A Formal Account of Effectiveness Evaluation and Ranking Fusion

This paper proposes a theoretical framework which models the information...
research
08/16/2021

Computational extraction of metrics and normative data on the alternative uses test on a set of 420 household objects

The Alternative Uses Test (AUT) is a classical test which has long been ...
research
04/05/2023

The Right Variety: Improving Expressive Range Analysis with Metric Selection Methods

Expressive Range Analysis (ERA), an approach for visualising the output ...
research
03/26/2023

An Evaluation of Memory Optimization Methods for Training Neural Networks

As models continue to grow in size, the development of memory optimizati...
research
06/13/2023

Best-Case Retrieval Evaluation: Improving the Sensitivity of Reciprocal Rank with Lexicographic Precision

Across a variety of ranking tasks, researchers use reciprocal rank to me...

Please sign up or login with your details

Forgot password? Click here to reset