KuaiRec: A Fully-observed Dataset for Recommender Systems

02/22/2022
by   Chongming Gao, et al.
0

Recommender systems are usually developed and evaluated on the historical user-item logs. However, most offline recommendation datasets are highly sparse and contain various biases, which hampers the evaluation of recommendation policies. Existing efforts aim to improve the data quality by collecting users' preferences on randomly selected items (e.g., Yahoo! and Coat). However, they still suffer from the high variance issue caused by the sparsely observed data. To fundamentally solve the problem, we present KuaiRec, a fully-observed dataset collected from the social video-sharing mobile App, Kuaishou. The feedback of 1,411 users on almost all of the 3,327 videos is explicitly observed. To the best of our knowledge, this is the first real-world fully-observed dataset with millions of user-item interactions in recommendation. To demonstrate the advantage of KuaiRec, we leverage it to explore the key questions in evaluating conversational recommender systems. The experimental results show that two factors in traditional partially-observed data – the data density and the exposure bias – greatly affect the evaluation results. This entails the significance of our fully-observed data in researching many directions in recommender systems, e.g., the unbiased recommendation, interactive/conversational recommendation, and evaluation. We release the dataset and the pipeline implementation for evaluation at https://chongminggao.github.io/KuaiRec/.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/08/2022

Towards Fair Conversational Recommender Systems

Conversational recommender systems have demonstrated great success. They...
research
04/26/2022

Cross Pairwise Ranking for Unbiased Item Recommendation

Most recommender systems optimize the model on observed interaction data...
research
06/24/2019

Query-based Interactive Recommendation by Meta-Path and Adapted Attention-GRU

Recently, interactive recommender systems are becoming increasingly popu...
research
06/15/2023

ReLoop2: Building Self-Adaptive Recommendation Models via Responsive Error Compensation Loop

Industrial recommender systems face the challenge of operating in non-st...
research
04/18/2021

The Simpson's Paradox in the Offline Evaluation of Recommendation Systems

Recommendation systems are often evaluated based on user's interactions ...
research
08/10/2022

DVR: Micro-Video Recommendation Optimizing Watch-Time-Gain under Duration Bias

Recommender systems are prone to be misled by biases in the data. Models...
research
07/03/2014

Reducing Offline Evaluation Bias in Recommendation Systems

Recommendation systems have been integrated into the majority of large o...

Please sign up or login with your details

Forgot password? Click here to reset