Evaluation Metrics for Item Recommendation under Sampling

12/04/2019
by   Steffen Rendle, et al.
0

The task of item recommendation requires ranking a large catalogue of items given a context. Item recommendation algorithms are evaluated using ranking metrics that depend on the positions of relevant items. To speed up the computation of metrics, recent work often uses sampled metrics where only a smaller set of random items and the relevant items are ranked. This paper investigates sampled metrics in more detail and shows that sampled metrics are inconsistent with their exact version. Sampled metrics do not persist relative statements, e.g., 'algorithm A is better than B', not even in expectation. Moreover the smaller the sampling size, the less difference between metrics, and for very small sampling size, all metrics collapse to the AUC metric.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/27/2021

A Case Study on Sampling Strategies for Evaluating Neural Sequential Item Recommendation Models

At the present time, sequential item recommendation models are compared ...
research
02/07/2023

On the Theories Behind Hard Negative Sampling for Recommendation

Negative sampling has been heavily used to train recommender models on l...
research
11/28/2022

Towards Reliable Item Sampling for Recommendation Evaluation

Since Rendle and Krichene argued that commonly used sampling-based evalu...
research
03/02/2021

On Estimating Recommendation Evaluation Metrics under Sampling

Since the recent study (Krichene and Rendle 2020) done by Krichene and R...
research
10/09/2020

Revisiting Alternative Experimental Settings for Evaluating Top-N Item Recommendation Algorithms

Top-N item recommendation has been a widely studied task from implicit f...
research
07/27/2023

On (Normalised) Discounted Cumulative Gain as an Offline Evaluation Metric for Top-n Recommendation

Approaches to recommendation are typically evaluated in one of two ways:...
research
06/27/2019

User Validation of Recommendation Serendipity Metrics

Though it has been recognized that recommending serendipitous (i.e., sur...

Please sign up or login with your details

Forgot password? Click here to reset