Reinforcing User Retention in a Billion Scale Short Video Recommender System

by   Qingpeng Cai, et al.

Recently, short video platforms have achieved rapid user growth by recommending interesting content to users. The objective of the recommendation is to optimize user retention, thereby driving the growth of DAU (Daily Active Users). Retention is a long-term feedback after multiple interactions of users and the system, and it is hard to decompose retention reward to each item or a list of items. Thus traditional point-wise and list-wise models are not able to optimize retention. In this paper, we choose reinforcement learning methods to optimize the retention as they are designed to maximize the long-term performance. We formulate the problem as an infinite-horizon request-based Markov Decision Process, and our objective is to minimize the accumulated time interval of multiple sessions, which is equal to improving the app open frequency and user retention. However, current reinforcement learning algorithms can not be directly applied in this setting due to uncertainty, bias, and long delay time incurred by the properties of user retention. We propose a novel method, dubbed RLUR, to address the aforementioned challenges. Both offline and live experiments show that RLUR can significantly improve user retention. RLUR has been fully launched in Kuaishou app for a long time, and achieves consistent performance improvement on user retention and DAU.


page 1

page 2

page 3

page 4


Two-Stage Constrained Actor-Critic for Short Video Recommendation

The wide popularity of short videos on social media poses new opportunit...

Constrained Reinforcement Learning for Short Video Recommendation

The wide popularity of short videos on social media poses new opportunit...

Enhancing the long-term performance of recommender system

Recommender system is a critically important tool in online commercial s...

Unknowable Manipulators: Social Network Curator Algorithms

For a social networking service to acquire and retain users, it must fin...

Reinforcement Learning to Optimize Lifetime Value in Cold-Start Recommendation

Recommender system plays a crucial role in modern E-commerce platform. D...

CAViaR: Context Aware Video Recommendations

Many recommendation systems rely on point-wise models, which score items...

JDRec: Practical Actor-Critic Framework for Online Combinatorial Recommender System

A combinatorial recommender (CR) system feeds a list of items to a user ...

Please sign up or login with your details

Forgot password? Click here to reset