Better Generalization with Semantic IDs: A case study in Ranking for Recommendations

06/13/2023
by   Anima Singh, et al.
0

Training good representations for items is critical in recommender models. Typically, an item is assigned a unique randomly generated ID, and is commonly represented by learning an embedding corresponding to the value of the random ID. Although widely used, this approach have limitations when the number of items are large and items are power-law distributed – typical characteristics of real-world recommendation systems. This leads to the item cold-start problem, where the model is unable to make reliable inferences for tail and previously unseen items. Removing these ID features and their learned embeddings altogether to combat cold-start issue severely degrades the recommendation quality. Content-based item embeddings are more reliable, but they are expensive to store and use, particularly for users' past item interaction sequence. In this paper, we use Semantic IDs, a compact discrete item representations learned from content embeddings using RQ-VAE that captures hierarchy of concepts in items. We showcase how we use them as a replacement of item IDs in a resource-constrained ranking model used in an industrial-scale video sharing platform. Moreover, we show how Semantic IDs improves the generalization ability of our system, without sacrificing top-level metrics.

READ FULL TEXT
research
02/28/2023

Item Cold Start Recommendation via Adversarial Variational Auto-encoder Warm-up

The gap between the randomly initialized item ID embedding and the well-...
research
09/23/2022

M2TRec: Metadata-aware Multi-task Transformer for Large-scale and Cold-start free Session-based Recommendations

Session-based recommender systems (SBRSs) have shown superior performanc...
research
05/27/2022

Improving Item Cold-start Recommendation via Model-agnostic Conditional Variational Autoencoder

Embedding MLP has become a paradigm for modern large-scale recommend...
research
03/14/2023

CoMeta: Enhancing Meta Embeddings with Collaborative Information in Cold-start Problem of Recommendation

The cold-start problem is quite challenging for existing recommendation ...
research
05/31/2020

Content-aware Neural Hashing for Cold-start Recommendation

Content-aware recommendation approaches are essential for providing mean...
research
07/07/2019

Search-Based Serving Architecture of Embeddings-Based Recommendations

Over the past 10 years, many recommendation techniques have been based o...
research
02/21/2022

GIFT: Graph-guIded Feature Transfer for Cold-Start Video Click-Through Rate Prediction

Short video has witnessed rapid growth in China and shows a promising ma...

Please sign up or login with your details

Forgot password? Click here to reset