Exploration in Online Advertising Systems with Deep Uncertainty-Aware Learning

11/25/2020
by   Chao Du, et al.
0

Modern online advertising systems inevitably rely on personalization methods, such as click-through rate (CTR) prediction. Recent progress in CTR prediction enjoys the rich representation capabilities of deep learning and achieves great success in large-scale industrial applications. However, these methods can suffer from lack of exploration. Another line of prior work addresses the exploration-exploitation trade-off problem with contextual bandit methods, which are less studied in the industry recently due to the difficulty in extending their flexibility with deep models. In this paper, we propose a novel Deep Uncertainty-Aware Learning (DUAL) method to learn deep CTR models based on Gaussian processes, which can provide efficient uncertainty estimations along with the CTR predictions while maintaining the flexibility of deep neural networks. By linking the ability to estimate predictive uncertainties of DUAL to well-known bandit algorithms, we further present DUAL-based Ad-ranking strategies to boost up long-term utilities such as the social welfare in advertising systems. Experimental results on several public datasets demonstrate the effectiveness of our methods. Remarkably, an online A/B test deployed in the Alibaba display advertising platform shows an 8.2% social welfare improvement and an 8.0% revenue lift.

READ FULL TEXT
research
05/25/2021

We Know What You Want: An Advertising Strategy Recommender System for Online Advertising

Advertising expenditures have become the major source of revenue for e-c...
research
12/21/2021

Adversarial Gradient Driven Exploration for Deep Click-Through Rate Prediction

Nowadays, data-driven deep neural models have already shown remarkable p...
research
05/20/2022

NMA: Neural Multi-slot Auctions with Externalities for Online Advertising

Online advertising driven by auctions brings billions of dollars in reve...
research
07/18/2021

GuideBoot: Guided Bootstrap for Deep Contextual Bandits

The exploration/exploitation (E E) dilemma lies at the core of interac...
research
04/02/2019

Operation-aware Neural Networks for User Response Prediction

User response prediction makes a crucial contribution to the rapid devel...
research
08/03/2020

Deep Bayesian Bandits: Exploring in Online Personalized Recommendations

Recommender systems trained in a continuous learning fashion are plagued...
research
09/20/2016

Deep CTR Prediction in Display Advertising

Click through rate (CTR) prediction of image ads is the core task of onl...

Please sign up or login with your details

Forgot password? Click here to reset