How robust is MovieLens? A dataset analysis for recommender systems

09/12/2019
by   Anne-Marie Tousch, et al.
0

Research publication requires public datasets. In recommender systems, some datasets are largely used to compare algorithms against a –supposedly– common benchmark. Problem: for various reasons, these datasets are heavily preprocessed, making the comparison of results across papers difficult. This paper makes explicit the variety of preprocessing and evaluation protocols to test the robustness of a dataset (or lack of flexibility). While robustness is good to compare results across papers, for flexible datasets we propose a method to select a preprocessing protocol and share results more transparently.

READ FULL TEXT
research
08/25/2022

Lib-SibGMU – A University Library Circulation Dataset for Recommender Systems Developmen

We opensource under CC BY 4.0 license Lib-SibGMU - a university library ...
research
08/15/2023

Impression-Aware Recommender Systems

Novel data sources bring new opportunities to improve the quality of rec...
research
09/12/2020

FuxiCTR: An Open Benchmark for Click-Through Rate Prediction

In many applications, such as recommender systems, online advertising, a...
research
02/04/2019

Recommender Systems Notation: Proposed Common Notation for Teaching and Research

As the field of recommender systems has developed, authors have used a m...
research
02/20/2023

Mysterious and Manipulative Black Boxes: A Qualitative Analysis of Perceptions on Recommender Systems

Recommender systems are used to provide relevant suggestions on various ...
research
10/11/2018

A Distributed and Accountable Approach to Offline Recommender Systems Evaluation

Different software tools have been developed with the purpose of perform...
research
05/04/2019

On the Difficulty of Evaluating Baselines: A Study on Recommender Systems

Numerical evaluations with comparisons to baselines play a central role ...

Please sign up or login with your details

Forgot password? Click here to reset