Exploration in two-stage recommender systems

09/01/2020
by   Jiri Hron, et al.
0

Two-stage recommender systems are widely adopted in industry due to their scalability and maintainability. These systems produce recommendations in two steps: (i) multiple nominators preselect a small number of items from a large pool using cheap-to-compute item embeddings; (ii) with a richer set of features, a ranker rearranges the nominated items and serves them to the user. A key challenge of this setup is that optimal performance of each stage in isolation does not imply optimal global performance. In response to this issue, Ma et al. (2020) proposed a nominator training objective importance weighted by the ranker's probability of recommending each item. In this work, we focus on the complementary issue of exploration. Modeled as a contextual bandit problem, we find LinUCB (a near optimal exploration strategy for single-stage systems) may lead to linear regret when deployed in two-stage recommenders. We therefore propose a method of synchronising the exploration strategies between the ranker and the nominators. Our algorithm only relies on quantities already computed by standard LinUCB at each stage and can be implemented in three lines of additional code. We end by demonstrating the effectiveness of our algorithm experimentally.

READ FULL TEXT
research
06/28/2021

On component interactions in two-stage recommender systems

Thanks to their scalability, two-stage recommenders are used by many of ...
research
05/30/2022

Fairness in the First Stage of Two-Stage Recommender Systems

Many large-scale recommender systems consist of two stages, where the fi...
research
03/17/2016

Cascading Bandits for Large-Scale Recommendation Problems

Most recommender systems recommend a list of items. The user examines th...
research
01/29/2023

Recommender system as an exploration coordinator: a bounded O(1) regret algorithm for large platforms

On typical modern platforms, users are only able to try a small fraction...
research
11/15/2015

Expressive recommender systems through normalized nonnegative models

We introduce normalized nonnegative models (NNM) for explorative data an...
research
07/23/2023

Scalable solution to crossed random effects model with random slopes

The crossed random-effects model is widely used in applied statistics, f...
research
10/06/2021

Two-level monotonic multistage recommender systems

A recommender system learns to predict the user-specific preference or i...

Please sign up or login with your details

Forgot password? Click here to reset