A Methodology for the Offline Evaluation of Recommender Systems in a User Interface with Multiple Carousels

05/13/2021
by   Nicolò Felicioni, et al.
0

Many video-on-demand and music streaming services provide the user with a page consisting of several recommendation lists, i.e. widgets or swipeable carousels, each built with a specific criterion (e.g. most recent, TV series, etc.). Finding efficient strategies to select which carousels to display is an active research topic of great industrial interest. In this setting, the overall quality of the recommendations of a new algorithm cannot be assessed by measuring solely its individual recommendation quality. Rather, it should be evaluated in a context where other recommendation lists are already available, to account for how they complement each other. This is not considered by traditional offline evaluation protocols. Hence, we propose an offline evaluation protocol for a carousel setting in which the recommendation quality of a model is measured by how much it improves upon that of an already available set of carousels. We report experiments on publicly available datasets on the movie domain and notice that under a carousel setting the ranking of the algorithms change. In particular, when a SLIM carousel is available, matrix factorization models tend to be preferred, while item-based models are penalized. We also propose to extend ranking metrics to the two-dimensional carousel layout in order to account for a known position bias, i.e. users will not explore the lists sequentially, but rather concentrate on the top-left corner of the screen.

READ FULL TEXT
research
05/14/2021

Measuring the User Satisfaction in a Recommendation Interface with Multiple Carousels

It is common for video-on-demand and music streaming services to adopt a...
research
01/08/2019

Using offline metrics and user behavior analysis to combine multiple systems for music recommendation

There are many offline metrics that can be used as a reference for evalu...
research
06/06/2022

Offline Evaluation of Ranked Lists using Parametric Estimation of Propensities

Search engines and recommendation systems attempt to continually improve...
research
07/31/2019

Sudden Death: A New Way to Compare Recommendation Diversification

This paper describes problems with the current way we compare the divers...
research
07/03/2014

Reducing Offline Evaluation Bias in Recommendation Systems

Recommendation systems have been integrated into the majority of large o...
research
11/07/2020

Do Offline Metrics Predict Online Performance in Recommender Systems?

Recommender systems operate in an inherently dynamical setting. Past rec...

Please sign up or login with your details

Forgot password? Click here to reset