A Distributed and Accountable Approach to Offline Recommender Systems Evaluation

10/11/2018
by   Diego Monti, et al.
0

Different software tools have been developed with the purpose of performing offline evaluations of recommender systems. However, the results obtained with these tools may be not directly comparable because of subtle differences in the experimental protocols and metrics. Furthermore, it is difficult to analyze in the same experimental conditions several algorithms without disclosing their implementation details. For these reasons, we introduce RecLab, an open source software for evaluating recommender systems in a distributed fashion. By relying on consolidated web protocols, we created RESTful APIs for training and querying recommenders remotely. In this way, it is possible to easily integrate into the same toolkit algorithms realized with different technologies. In details, the experimenter can perform an evaluation by simply visiting a web interface provided by RecLab. The framework will then interact with all the selected recommenders and it will compute and display a comprehensive set of measures, each representing a different metric. The results of all experiments are permanently stored and publicly available in order to support accountability and comparative analyses.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/11/2018

Sequeval: A Framework to Assess and Benchmark Sequence-based Recommender Systems

In this paper, we present sequeval, a software tool capable of performin...
research
09/10/2018

The LKPY Package for Recommender Systems Experiments: Next-Generation Tools and Lessons Learned from the LensKit Project

Since 2010, we have built and maintained LensKit, an open-source toolkit...
research
07/27/2023

Widespread Flaws in Offline Evaluation of Recommender Systems

Even though offline evaluation is just an imperfect proxy of online perf...
research
11/02/2022

Where Do We Go From Here? Guidelines For Offline Recommender Evaluation

Various studies in recent years have pointed out large issues in the off...
research
09/19/2020

Modeling Online Behavior in Recommender Systems: The Importance of Temporal Context

Simulating online recommender system performance is notoriously difficul...
research
06/26/2022

Quality Metrics in Recommender Systems: Do We Calculate Metrics Consistently?

Offline evaluation is a popular approach to determine the best algorithm...
research
09/12/2019

How robust is MovieLens? A dataset analysis for recommender systems

Research publication requires public datasets. In recommender systems, s...

Please sign up or login with your details

Forgot password? Click here to reset