An Image Dataset for Benchmarking Recommender Systems with Raw Pixels

09/13/2023
by   Yu Cheng, et al.
0

Recommender systems (RS) have achieved significant success by leveraging explicit identification (ID) features. However, the full potential of content features, especially the pure image pixel features, remains relatively unexplored. The limited availability of large, diverse, and content-driven image recommendation datasets has hindered the use of raw images as item representations. In this regard, we present PixelRec, a massive image-centric recommendation dataset that includes approximately 200 million user-image interactions, 30 million users, and 400,000 high-quality cover images. By providing direct access to raw image pixels, PixelRec enables recommendation models to learn item representation directly from them. To demonstrate its utility, we begin by presenting the results of several classical pure ID-based baseline models, termed IDNet, trained on PixelRec. Then, to show the effectiveness of the dataset's image features, we substitute the itemID embeddings (from IDNet) with a powerful vision encoder that represents items using their raw image pixels. This new model is dubbed PixelNet.Our findings indicate that even in standard, non-cold start recommendation settings where IDNet is recognized as highly effective, PixelNet can already perform equally well or even better than IDNet. Moreover, PixelNet has several other notable advantages over IDNet, such as being more effective in cold-start and cross-domain recommendation scenarios. These results underscore the importance of visual features in PixelRec. We believe that PixelRec can serve as a critical resource and testing ground for research on recommendation models that emphasize image pixel content. The dataset, code, and leaderboard will be available at https://github.com/westlake-repl/PixelRec.

READ FULL TEXT
research
09/14/2023

NineRec: A Benchmark Dataset Suite for Evaluating Transferable Recommendation

Learning a recommender system model from an item's raw modality features...
research
03/24/2023

Where to Go Next for Recommender Systems? ID- vs. Modality-based Recommender Models Revisited

Recommendation models that utilize unique identities (IDs) to represent ...
research
10/13/2022

Tenrec: A Large-scale Multipurpose Benchmark Dataset for Recommender Systems

Existing benchmark datasets for recommender systems (RS) either are crea...
research
06/29/2023

Towards Personalized Cold-Start Recommendation with Prompts

Recommender systems play a crucial role in helping users discover inform...
research
07/27/2020

Towards Multi-Language Recipe Personalisation and Recommendation

Multi-language recipe personalisation and recommendation is an under-exp...
research
06/13/2022

TransRec: Learning Transferable Recommendation from Mixture-of-Modality Feedback

Learning big models and then transfer has become the de facto practice i...
research
07/29/2021

Sparse Feature Factorization for Recommender Systems with Knowledge Graphs

Deep Learning and factorization-based collaborative filtering recommenda...

Please sign up or login with your details

Forgot password? Click here to reset