Reinforcement Learning to Optimize Long-term User Engagement in Recommender Systems

02/13/2019
by   Lixin Zou, et al.
1

Recommender systems play a crucial role in our daily lives. Feed streaming mechanism has been widely used in the recommender system, especially on the mobile Apps. The feed streaming setting provides users the interactive manner of recommendation in never-ending feeds. In such an interactive manner, a good recommender system should pay more attention to user stickiness, which is far beyond classical instant metrics, and typically measured by long-term user engagement. Directly optimizing the long-term user engagement is a non-trivial problem, as the learning target is usually not available for conventional supervised learning methods. Though reinforcement learning (RL) naturally fits the problem of maximizing the long term rewards, applying RL to optimize long-term user engagement is still facing challenges: user behaviors are versatile and difficult to model, which typically consists of both instant feedback ( clicks, ordering) and delayed feedback ( dwell time, revisit); in addition, performing effective off-policy learning is still immature, especially when combining bootstrapping and function approximation. To address these issues, in this work, we introduce a reinforcement learning framework --- FeedRec to optimize the long-term user engagement. FeedRec includes two components: 1) a Q-Network which designed in hierarchical LSTM takes charge of modeling complex user behaviors, and 2) an S-Network, which simulates the environment, assists the Q-Network and voids the instability of convergence in policy learning. Extensive experiments on synthetic data and a real-world large scale data show that FeedRec effectively optimizes the long-term user engagement and outperforms state-of-the-arts.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/06/2022

PrefRec: Preference-based Recommender Systems for Reinforcing Long-term User Engagement

Current advances in recommender systems have been remarkably successful ...
research
02/13/2023

On Modeling Long-Term User Engagement from Stochastic Feedback

An ultimate goal of recommender systems (RS) is to improve user engageme...
research
06/01/2022

ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor

Long-term engagement is preferred over immediate engagement in sequentia...
research
02/17/2022

Should I send this notification? Optimizing push notifications decision making by modeling the future

Most recommender systems are myopic, that is they optimize based on the ...
research
12/20/2020

Reinforcement Learning-based Product Delivery Frequency Control

Frequency control is an important problem in modern recommender systems....
research
08/22/2023

Towards Validating Long-Term User Feedbacks in Interactive Recommendation Systems

Interactive Recommender Systems (IRSs) have attracted a lot of attention...
research
11/24/2022

Learning to Take a Break: Sustainable Optimization of Long-Term User Engagement

Optimizing user engagement is a key goal for modern recommendation syste...

Please sign up or login with your details

Forgot password? Click here to reset