Beyond NDCG: behavioral testing of recommender systems with RecList

11/18/2021
by   Patrick John Chia, et al.
0

As with most Machine Learning systems, recommender systems are typically evaluated through performance metrics computed over held-out data points. However, real-world behavior is undoubtedly nuanced: ad hoc error analysis and deployment-specific tests must be employed to ensure the desired quality in actual deployments. In this paper, we propose RecList, a behavioral-based testing methodology. RecList organizes recommender systems by use case and introduces a general plug-and-play procedure to scale up behavioral testing. We demonstrate its capabilities by analyzing known algorithms and black-box commercial systems, and we release RecList as an open source, extensible package for the community.

READ FULL TEXT
research
07/29/2019

Computing the Value of Data: Towards Applied Data Minimalism

We present an approach to compute the monetary value of individual data ...
research
04/14/2023

EvalRS 2023. Well-Rounded Recommender Systems For Real-World Deployments

EvalRS aims to bring together practitioners from industry and academia t...
research
08/28/2023

RecRec: Algorithmic Recourse for Recommender Systems

Recommender systems play an essential role in the choices people make in...
research
08/21/2018

Who is Really Affected by Fraudulent Reviews? An analysis of shilling attacks on recommender systems in real-world scenarios

We present the results of an initial analysis conducted on a real-life s...
research
09/19/2021

Scaling Enterprise Recommender Systems for Decentralization

Within decentralized organizations, the local demand for recommender sys...
research
06/19/2020

Feature Interaction Interpretability: A Case for Explaining Ad-Recommendation Systems via Neural Interaction Detection

Recommendation is a prevalent application of machine learning that affec...
research
05/24/2022

recommenderlab: An R Framework for Developing and Testing Recommendation Algorithms

Algorithms that create recommendations based on observed data have signi...

Please sign up or login with your details

Forgot password? Click here to reset