Marketing Budget Allocation with Offline Constrained Deep Reinforcement Learning

09/06/2023
by   Tianchi Cai, et al.
0

We study the budget allocation problem in online marketing campaigns that utilize previously collected offline data. We first discuss the long-term effect of optimizing marketing budget allocation decisions in the offline setting. To overcome the challenge, we propose a novel game-theoretic offline value-based reinforcement learning method using mixed policies. The proposed method reduces the need to store infinitely many policies in previous methods to only constantly many policies, which achieves nearly optimal policy efficiency, making it practical and favorable for industrial usage. We further show that this method is guaranteed to converge to the optimal policy, which cannot be achieved by previous value-based reinforcement learning methods for marketing budget allocation. Our experiments on a large-scale marketing campaign with tens-of-millions users and more than one billion budget verify the theoretical results and show that the proposed method outperforms various baseline methods. The proposed method has been successfully deployed to serve all the traffic of this marketing campaign.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/29/2021

A Policy Efficient Reduction Approach to Convex Constrained Deep Reinforcement Learning

Although well-established in general reinforcement learning (RL), value-...
research
06/03/2021

Towards Cost-Optimal Policies for DAGs to Utilize IaaS Clouds with Online Learning

Premier cloud service providers (CSPs) offer two types of purchase optio...
research
02/09/2023

An End-to-End Framework for Marketing Effectiveness Optimization under Budget Constraint

Online platforms often incentivize consumers to improve user engagement ...
research
06/18/2021

Active Offline Policy Selection

This paper addresses the problem of policy selection in domains with abu...
research
04/01/2019

Dynamically optimal treatment allocation using Reinforcement Learning

Consider a situation wherein a stream of individuals arrive sequentially...
research
07/28/2017

Learning to Teach Reinforcement Learning Agents

In this article we study the transfer learning model of action advice un...
research
08/23/2020

Spending Money Wisely: Online Electronic Coupon Allocation based on Real-Time User Intent Detection

Online electronic coupon (e-coupon) is becoming a primary tool for e-com...

Please sign up or login with your details

Forgot password? Click here to reset