Adversarial Gradient Driven Exploration for Deep Click-Through Rate Prediction

12/21/2021
by   Kailun Wu, et al.
0

Nowadays, data-driven deep neural models have already shown remarkable progress on Click-through Rate (CTR) prediction. Unfortunately, the effectiveness of such models may fail when there are insufficient data. To handle this issue, researchers often adopt exploration strategies to examine items based on the estimated reward, e.g., UCB or Thompson Sampling. In the context of Exploitation-and-Exploration for CTR prediction, recent studies have attempted to utilize the prediction uncertainty along with model prediction as the reward score. However, we argue that such an approach may make the final ranking score deviate from the original distribution, and thereby affect model performance in the online system. In this paper, we propose a novel exploration method called Adversarial Gradient Driven Exploration (AGE). Specifically, we propose a Pseudo-Exploration Module to simulate the gradient updating process, which can approximate the influence of the samples of to-be-explored items for the model. In addition, for better exploration efficiency, we propose an Dynamic Threshold Unit to eliminate the effects of those samples with low potential CTR. The effectiveness of our approach was demonstrated on an open-access academic dataset. Meanwhile, AGE has also been deployed in a real-world display advertising platform and all online metrics have been significantly improved.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/25/2020

Exploration in Online Advertising Systems with Deep Uncertainty-Aware Learning

Modern online advertising systems inevitably rely on personalization met...
research
08/03/2022

Exploration with Model Uncertainty at Extreme Scale in Real-Time Bidding

In this work, we present a scalable and efficient system for exploring t...
research
11/11/2020

Proximal Policy Optimization via Enhanced Exploration Efficiency

Proximal policy optimization (PPO) algorithm is a deep reinforcement lea...
research
11/19/2019

Implicit Generative Modeling for Efficient Exploration

Efficient exploration remains a challenging problem in reinforcement lea...
research
11/03/2019

Regularized Adversarial Sampling and Deep Time-aware Attention for Click-Through Rate Prediction

Improving the performance of click-through rate (CTR) prediction remains...
research
03/13/2020

Action for Better Prediction

Good prediction is necessary for autonomous robotics to make informed de...
research
06/06/2023

COPR: Consistency-Oriented Pre-Ranking for Online Advertising

Cascading architecture has been widely adopted in large-scale advertisin...

Please sign up or login with your details

Forgot password? Click here to reset