It's Time to Consider "Time" when Evaluating Recommender-System Algorithms [Proposal]

08/28/2017
by   Joeran Beel, et al.
0

In this position paper, we question the current practice of calculating evaluation metrics for recommender systems as single numbers (e.g. precision p=.28 or mean absolute error MAE = 1.21). We argue that single numbers express only average effectiveness over a usually rather long period (e.g. a year or even longer), which provides only a vague and static view of the data. We propose that recommender-system researchers should instead calculate metrics for time-series such as weeks or months, and plot the results in e.g. a line chart. This way, results show how algorithms' effectiveness develops over time, and hence the results allow drawing more meaningful conclusions about how an algorithm will perform in the future. In this paper, we explain our reasoning, provide an example to illustrate our reasoning and present suggestions for what the community should do next.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/12/2023

Distributionally-Informed Recommender System Evaluation

Current practice for evaluating recommender systems typically focuses on...
research
09/10/2018

Off-line vs. On-line Evaluation of Recommender Systems in Small E-commerce

In this paper, we present our work towards comparing on-line and off-lin...
research
06/29/2019

One Size Does Not Fit All: Modeling Users' Personal Curiosity in Recommender Systems

Today's recommender systems are criticized for recommending items that a...
research
06/26/2022

Quality Metrics in Recommender Systems: Do We Calculate Metrics Consistently?

Offline evaluation is a popular approach to determine the best algorithm...
research
10/06/2021

Optimized Recommender Systems with Deep Reinforcement Learning

Recommender Systems have been the cornerstone of online retailers. Tradi...
research
07/21/2022

Efficient Graph-Friendly COCO Metric Computation for Train-Time Model Evaluation

Evaluating the COCO mean average precision (MaP) and COCO recall metrics...
research
07/19/2022

Group Validation in Recommender Systems: Framework for Multi-layer Performance Evaluation

Interpreting the performance results of models that attempt to realize u...

Please sign up or login with your details

Forgot password? Click here to reset