Asymptotically Unbiased Estimation for Delayed Feedback Modeling via Label Correction

02/14/2022
by   Yu Chen, et al.
5

Alleviating the delayed feedback problem is of crucial importance for the conversion rate(CVR) prediction in online advertising. Previous delayed feedback modeling methods using an observation window to balance the trade-off between waiting for accurate labels and consuming fresh feedback. Moreover, to estimate CVR upon the freshly observed but biased distribution with fake negatives, the importance sampling is widely used to reduce the distribution bias. While effective, we argue that previous approaches falsely treat fake negative samples as real negative during the importance weighting and have not fully utilized the observed positive samples, leading to suboptimal performance. In this work, we propose a new method, DElayed Feedback modeling with UnbiaSed Estimation, (DEFUSE), which aim to respectively correct the importance weights of the immediate positive, the fake negative, the real negative, and the delay positive samples at finer granularity. Specifically, we propose a two-step optimization approach that first infers the probability of fake negatives among observed negatives before applying importance sampling. To fully exploit the ground-truth immediate positives from the observed distribution, we further develop a bi-distribution modeling framework to jointly model the unbiased immediate positives and the biased delay conversions. Experimental results on both public and our industrial datasets validate the superiority of DEFUSE. Codes are available at https://github.com/ychen216/DEFUSE.git.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/29/2021

Real Negatives Matter: Continuous Training with Real Negatives for Delayed Feedback Modeling

One of the difficulties of conversion rate (CVR) prediction is that the ...
research
06/27/2023

Value-aware Importance Weighting for Off-policy Reinforcement Learning

Importance sampling is a central idea underlying off-policy prediction i...
research
10/04/2019

Dual Learning Algorithm for Delayed Feedback in Display Advertising

In display advertising, predicting the conversion rate, that is, the pro...
research
09/28/2020

Learning Classifiers under Delayed Feedback with a Time Window Assumption

We consider training a binary classifier under delayed feedback (DF Lear...
research
01/06/2021

Handling many conversions per click in modeling delayed feedback

Predicting the expected value or number of post-click conversions (purch...
research
09/14/2022

Improved proteasomal cleavage prediction with positive-unlabeled learning

Accurate in silico modeling of the antigen processing pathway is crucial...
research
07/27/2022

NICEST: Noisy Label Correction and Training for Robust Scene Graph Generation

Nearly all existing scene graph generation (SGG) models have overlooked ...

Please sign up or login with your details

Forgot password? Click here to reset