Should I send this notification? Optimizing push notifications decision making by modeling the future

02/17/2022
by   Conor O'Brien, et al.
8

Most recommender systems are myopic, that is they optimize based on the immediate response of the user. This may be misaligned with the true objective, such as creating long term user satisfaction. In this work we focus on mobile push notifications, where the long term effects of recommender system decisions can be particularly strong. For example, sending too many or irrelevant notifications may annoy a user and cause them to disable notifications. However, a myopic system will always choose to send a notification since negative effects occur in the future. This is typically mitigated using heuristics. However, heuristics can be hard to reason about or improve, require retuning each time the system is changed, and may be suboptimal. To counter these drawbacks, there is significant interest in recommender systems that optimize directly for long-term value (LTV). Here, we describe a method for maximising LTV by using model-based reinforcement learning (RL) to make decisions about whether to send push notifications. We model the effects of sending a notification on the user's future behavior. Much of the prior work applying RL to maximise LTV in recommender systems has focused on session-based optimization, while the time horizon for notification decision making in this work extends over several days. We test this approach in an A/B test on a major social network. We show that by optimizing decisions about push notifications we are able to send less notifications and obtain a higher open rate than the baseline system, while generating the same level of user engagement on the platform as the existing, heuristic-based, system.

READ FULL TEXT

page 2

page 8

research
05/29/2019

Reinforcement Learning for Slate-based Recommender Systems: A Tractable Decomposition and Practical Methodology

Most practical recommender systems focus on estimating immediate user en...
research
02/13/2019

Reinforcement Learning to Optimize Long-term User Engagement in Recommender Systems

Recommender systems play a crucial role in our daily lives. Feed streami...
research
12/06/2022

PrefRec: Preference-based Recommender Systems for Reinforcing Long-term User Engagement

Current advances in recommender systems have been remarkably successful ...
research
05/23/2023

Optimizing Long-term Value for Auction-Based Recommender Systems via On-Policy Reinforcement Learning

Auction-based recommender systems are prevalent in online advertising pl...
research
04/25/2022

Long-run User Value Optimization in Recommender Systems through Content Creation Modeling

Content recommender systems are generally adept at maximizing immediate ...
research
06/22/2023

Don't Treat the Symptom, Find the Cause! Efficient Artificial-Intelligence Methods for (Interactive) Debugging

In the modern world, we are permanently using, leveraging, interacting w...
research
12/20/2020

Reinforcement Learning-based Product Delivery Frequency Control

Frequency control is an important problem in modern recommender systems....

Please sign up or login with your details

Forgot password? Click here to reset