Learning to Infer User Hidden States for Online Sequential Advertising

09/03/2020
by   Zhaoqing Peng, et al.
8

To drive purchase in online advertising, it is of the advertiser's great interest to optimize the sequential advertising strategy whose performance and interpretability are both important. The lack of interpretability in existing deep reinforcement learning methods makes it not easy to understand, diagnose and further optimize the strategy. In this paper, we propose our Deep Intents Sequential Advertising (DISA) method to address these issues. The key part of interpretability is to understand a consumer's purchase intent which is, however, unobservable (called hidden states). In this paper, we model this intention as a latent variable and formulate the problem as a Partially Observable Markov Decision Process (POMDP) where the underlying intents are inferred based on the observable behaviors. Large-scale industrial offline and online experiments demonstrate our method's superior performance over several baselines. The inferred hidden states are analyzed, and the results prove the rationality of our inference.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/08/2023

Goal-oriented inference of environment from redundant observations

The agent learns to organize decision behavior to achieve a behavioral g...
research
08/19/2019

Learning to Advertise for Organic Traffic Maximization in E-Commerce Product Feeds

Most e-commerce product feeds provide blended results of advertised prod...
research
06/29/2020

Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising

In E-commerce, advertising is essential for merchants to reach their tar...
research
07/07/2022

Multi-objective Optimization of Notifications Using Offline Reinforcement Learning

Mobile notification systems play a major role in a variety of applicatio...
research
12/10/2021

Blockwise Sequential Model Learning for Partially Observable Reinforcement Learning

This paper proposes a new sequential model learning architecture to solv...
research
07/24/2023

Analyzing the Strategy of Propaganda using Inverse Reinforcement Learning: Evidence from the 2022 Russian Invasion of Ukraine

The 2022 Russian invasion of Ukraine was accompanied by a large-scale, p...
research
02/24/2022

A Unified Framework for Campaign Performance Forecasting in Online Display Advertising

Advertisers usually enjoy the flexibility to choose criteria like target...

Please sign up or login with your details

Forgot password? Click here to reset