Search-Based Serving Architecture of Embeddings-Based Recommendations

07/07/2019
by   Sonya Liberman, et al.
0

Over the past 10 years, many recommendation techniques have been based on embedding users and items in latent vector spaces, where the inner product of a (user,item) pair of vectors represents the predicted affinity of the user to the item. A wealth of literature has focused on the various modeling approaches that result in embeddings, and has compared their quality metrics, learning complexity, etc. However, much less attention has been devoted to the issues surrounding productization of an embeddings-based high throughput, low latency recommender system. In particular, how the system might keep up with the changing embeddings as new models are learnt. This paper describes a reference architecture of a high-throughput, large scale recommendation service which leverages a search engine as its runtime core. We describe how the search index and the query builder adapt to changes in the embeddings, which often happen at a different cadence than index builds. We provide solutions for both id-based and feature-based embeddings, as well as for batch indexing and incremental indexing setups. The described system is at the core of a Web content discovery service that serves tens of billions recommendations per day in response to billions of user requests.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/25/2016

Meta-Prod2Vec - Product Embeddings Using Side-Information for Recommendation

We propose Meta-Prod2vec, a novel method to compute item similarities fo...
research
12/11/2018

Learning Item-Interaction Embeddings for User Recommendations

Industry-scale recommendation systems have become a cornerstone of the e...
research
10/14/2021

Reverse Maximum Inner Product Search: How to efficiently find users who would like to buy my item?

The MIPS (maximum inner product search), which finds the item with the h...
research
06/05/2017

To Index or Not to Index: Optimizing Maximum Inner Product Search

Making top-K predictions for state-of-the-art Matrix Factorization model...
research
06/13/2023

Better Generalization with Semantic IDs: A case study in Ranking for Recommendations

Training good representations for items is critical in recommender model...
research
08/10/2022

A Scalable Probabilistic Model for Reward Optimizing Slate Recommendation

We introduce Probabilistic Rank and Reward model (PRR), a scalable proba...
research
06/21/2019

Embedding models for recommendation under contextual constraints

Embedding models, which learn latent representations of users and items ...

Please sign up or login with your details

Forgot password? Click here to reset