Dynamic Learning of Sequential Choice Bandit Problem under Marketing Fatigue

03/19/2019
by   Junyu Cao, et al.
0

Motivated by the observation that overexposure to unwanted marketing activities leads to customer dissatisfaction, we consider a setting where a platform offers a sequence of messages to its users and is penalized when users abandon the platform due to marketing fatigue. We propose a novel sequential choice model to capture multiple interactions taking place between the platform and its user: Upon receiving a message, a user decides on one of the three actions: accept the message, skip and receive the next message, or abandon the platform. Based on user feedback, the platform dynamically learns users' abandonment distribution and their valuations of messages to determine the length of the sequence and the order of the messages, while maximizing the cumulative payoff over a horizon of length T. We refer to this online learning task as the sequential choice bandit problem. For the offline combinatorial optimization problem, we show that an efficient polynomial-time algorithm exists. For the online problem, we propose an algorithm that balances exploration and exploitation, and characterize its regret bound. Lastly, we demonstrate how to extend the model with user contexts to incorporate personalization.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/22/2020

Fatigue-aware Bandits for Dependent Click Models

As recommender systems send a massive amount of content to keep users en...
research
01/05/2021

Sequential Choice Bandits with Feedback for Personalizing users' experience

In this work, we study sequential choice bandits with feedback. We propo...
research
01/23/2019

Thompson Sampling for a Fatigue-aware Online Recommendation System

In this paper we consider an online recommendation setting, where a plat...
research
11/30/2021

Online Learning for Receding Horizon Control with Provable Regret Guarantees

We address the problem of learning to control an unknown linear dynamica...
research
04/04/2023

Online Joint Assortment-Inventory Optimization under MNL Choices

We study an online joint assortment-inventory optimization problem, in w...
research
10/04/2018

Tight Information Theoretic Converse Results for some Pliable Index Coding Problems

This paper studies the Pliable Index CODing problem (PICOD), which model...
research
07/07/2021

POSLAN: Disentangling Chat with Positional and Language encoded Post Embeddings

Most online message threads inherently will be cluttered and any new use...

Please sign up or login with your details

Forgot password? Click here to reset