Lessons Learned Addressing Dataset Bias in Model-Based Candidate Generation at Twitter

05/13/2021
by   Alim Virani, et al.
0

Traditionally, heuristic methods are used to generate candidates for large scale recommender systems. Model-based candidate generation promises multiple potential advantages, primarily that we can explicitly optimize the same objective as the downstream ranking model. However, large scale model-based candidate generation approaches suffer from dataset bias problems caused by the infeasibility of obtaining representative data on very irrelevant candidates. Popular techniques to correct dataset bias, such as inverse propensity scoring, do not work well in the context of candidate generation. We first explore the dynamics of the dataset bias problem and then demonstrate how to use random sampling techniques to mitigate it. Finally, in a novel application of fine-tuning, we show performance gains when applying our candidate generation system to Twitter's home timeline.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/12/2019

Candidate Generation with Binary Codes for Large-Scale Top-N Recommendation

Generating the Top-N recommendations from a large corpus is computationa...
research
05/20/2020

Contrastive Learning for Debiased Candidate Generation at Scale

Deep candidate generation has become an increasingly popular choice depl...
research
05/12/2022

kNN-Embed: Locally Smoothed Embedding Mixtures For Multi-interest Candidate Retrieval

Candidate generation is the first stage in recommendation systems, where...
research
05/20/2020

Contrastive Learning for Debiased Candidate Generation in Large-Scale Recommender Systems

Deep candidate generation (DCG) that narrows down the collection of rele...
research
02/27/2023

TwERC: High Performance Ensembled Candidate Generation for Ads Recommendation at Twitter

Recommendation systems are a core feature of social media companies with...
research
05/17/2023

BAD: BiAs Detection for Large Language Models in the context of candidate screening

Application Tracking Systems (ATS) have allowed talent managers, recruit...
research
10/10/2020

MS-Ranker: Accumulating Evidence from Potentially Correct Candidates for Answer Selection

As conventional answer selection (AS) methods generally match the questi...

Please sign up or login with your details

Forgot password? Click here to reset