Addressing Delayed Feedback for Continuous Training with Neural Networks in CTR prediction

07/15/2019
by   Ira Ktena, et al.
1

One of the challenges in display advertising is that the distribution of features and click through rate (CTR) can exhibit large shifts over time due to seasonality, changes to ad campaigns and other factors. The predominant strategy to keep up with these shifts is to train predictive models continuously, on fresh data, in order to prevent them from becoming stale. However, in many ad systems positive labels are only observed after a possibly long and random delay. These delayed labels pose a challenge to data freshness in continuous training: fresh data may not have complete label information at the time they are ingested by the training algorithm. Naive strategies which consider any data point a negative example until a positive label becomes available tend to underestimate CTR, resulting in inferior user experience and suboptimal performance for advertisers. The focus of this paper is to identify the best combination of loss functions and models that enable large-scale learning from a continuous stream of data in the presence of delayed labels. In this work, we compare 5 different loss functions, 3 of them applied to this problem for the first time. We benchmark their performance in offline settings on both public and proprietary datasets in conjunction with shallow and deep model architectures. We also discuss the engineering cost associated with implementing each loss function in a production environment. Finally, we carried out online experiments with the top performing methods, in order to validate their performance in a continuous training scheme. While training on 668 million in-house data points offline, our proposed methods outperform previous state-of-the-art by 3 experiments, we observed 55 against naive log loss.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/05/2022

AugLoss: A Learning Methodology for Real-World Dataset Corruption

Deep Learning (DL) models achieve great successes in many domains. Howev...
research
04/29/2021

Real Negatives Matter: Continuous Training with Real Negatives for Delayed Feedback Modeling

One of the difficulties of conversion rate (CVR) prediction is that the ...
research
10/04/2019

Dual Learning Algorithm for Delayed Feedback in Display Advertising

In display advertising, predicting the conversion rate, that is, the pro...
research
06/29/2016

Non-linear Label Ranking for Large-scale Prediction of Long-Term User Interests

We consider the problem of personalization of online services from the v...
research
04/14/2021

Joint Negative and Positive Learning for Noisy Labels

Training of Convolutional Neural Networks (CNNs) with data with noisy la...
research
06/10/2019

Deep Spatio-Temporal Neural Networks for Click-Through Rate Prediction

Click-through rate (CTR) prediction is a critical task in online adverti...
research
07/08/2020

Unbiased Lift-based Bidding System

Conventional bidding strategies for online display ad auction heavily re...

Please sign up or login with your details

Forgot password? Click here to reset