Principled Multi-Aspect Evaluation Measures of Rankings

12/01/2022
by   Maria Maistro, et al.
0

Information Retrieval evaluation has traditionally focused on defining principled ways of assessing the relevance of a ranked list of documents with respect to a query. Several methods extend this type of evaluation beyond relevance, making it possible to evaluate different aspects of a document ranking (e.g., relevance, usefulness, or credibility) using a single measure (multi-aspect evaluation). However, these methods either are (i) tailor-made for specific aspects and do not extend to other types or numbers of aspects, or (ii) have theoretical anomalies, e.g. assign maximum score to a ranking where all documents are labelled with the lowest grade with respect to all aspects (e.g., not relevant, not credible, etc.). We present a theoretically principled multi-aspect evaluation method that can be used for any number, and any type, of aspects. A thorough empirical evaluation using up to 5 aspects and a total of 425 runs officially submitted to 10 TREC tracks shows that our method is more discriminative than the state-of-the-art and overcomes theoretical limitations of the state-of-the-art.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/12/2020

Fine-Grained Relevance Annotations for Multi-Task Document Ranking and Question Answering

There are many existing retrieval and question answering datasets. Howev...
research
11/01/2020

Cheap IR Evaluation: Fewer Topics, No Relevance Judgements, and Crowdsourced Assessments

To evaluate Information Retrieval (IR) effectiveness, a possible approac...
research
12/14/2019

Leveraging Multi-Method Evaluation for Multi-Stakeholder Settings

In this paper, we focus on recommendation settings with multiple stakeho...
research
04/26/2020

Choppy: Cut Transformer For Ranked List Truncation

Work in information retrieval has traditionally focused on ranking and r...
research
08/23/2017

Evaluation Measures for Relevance and Credibility in Ranked Lists

Recent discussions on alternative facts, fake news, and post truth polit...
research
02/22/2023

Recall as a Measure of Ranking Robustness

Researchers use recall to evaluate rankings across a variety of retrieva...
research
06/09/2020

Directional Multivariate Ranking

User-provided multi-aspect evaluations manifest users' detailed feedback...

Please sign up or login with your details

Forgot password? Click here to reset