Distributionally-Informed Recommender System Evaluation

09/12/2023
by   Michael D. Ekstrand, et al.
0

Current practice for evaluating recommender systems typically focuses on point estimates of user-oriented effectiveness metrics or business metrics, sometimes combined with additional metrics for considerations such as diversity and novelty. In this paper, we argue for the need for researchers and practitioners to attend more closely to various distributions that arise from a recommender system (or other information access system) and the sources of uncertainty that lead to these distributions. One immediate implication of our argument is that both researchers and practitioners must report and examine more thoroughly the distribution of utility between and within different stakeholder groups. However, distributions of various forms arise in many more aspects of the recommender systems experimental process, and distributional thinking has substantial ramifications for how we design, evaluate, and present recommender systems evaluation and research results. Leveraging and emphasizing distributions in the evaluation of recommender systems is a necessary step to ensure that the systems provide appropriate and equitably-distributed benefit to the people they affect.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/08/2022

Practitioners Versus Users: A Value-Sensitive Evaluation of Current Industrial Recommender System Design

Recommender systems are playing an increasingly important role in allevi...
research
08/28/2017

It's Time to Consider "Time" when Evaluating Recommender-System Algorithms [Proposal]

In this position paper, we question the current practice of calculating ...
research
07/19/2022

Group Validation in Recommender Systems: Framework for Multi-layer Performance Evaluation

Interpreting the performance results of models that attempt to realize u...
research
12/12/2022

Evaluation of Synthetic Datasets for Conversational Recommender Systems

For researchers leveraging Large-Language Models (LLMs) in the generatio...
research
06/26/2022

Quality Metrics in Recommender Systems: Do We Calculate Metrics Consistently?

Offline evaluation is a popular approach to determine the best algorithm...
research
09/10/2018

Off-line vs. On-line Evaluation of Recommender Systems in Small E-commerce

In this paper, we present our work towards comparing on-line and off-lin...
research
06/27/2022

Supply-Side Equilibria in Recommender Systems

Digital recommender systems such as Spotify and Netflix affect not only ...

Please sign up or login with your details

Forgot password? Click here to reset