On Modeling Long-Term User Engagement from Stochastic Feedback

02/13/2023
by   Guoxi Zhang, et al.
0

An ultimate goal of recommender systems (RS) is to improve user engagement. Reinforcement learning (RL) is a promising paradigm for this goal, as it directly optimizes overall performance of sequential recommendation. However, many existing RL-based approaches induce huge computational overhead, because they require not only the recommended items but also all other candidate items to be stored. This paper proposes an efficient alternative that does not require the candidate items. The idea is to model the correlation between user engagement and items directly from data. Moreover, the proposed approach consider randomness in user feedback and termination behavior, which are ubiquitous for RS but rarely discussed in RL-based prior work. With online A/B experiments on real-world RS, we confirm the efficacy of the proposed approach and the importance of modeling the two types of randomness.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/13/2019

Reinforcement Learning to Optimize Long-term User Engagement in Recommender Systems

Recommender systems play a crucial role in our daily lives. Feed streami...
research
06/02/2020

Maximizing Cumulative User Engagement in Sequential Recommendation: An Online Optimization Perspective

To maximize cumulative user engagement (e.g. cumulative clicks) in seque...
research
10/15/2021

Value Penalized Q-Learning for Recommender Systems

Scaling reinforcement learning (RL) to recommender systems (RS) is promi...
research
06/18/2020

Interactive Recommender System via Knowledge Graph-enhanced Reinforcement Learning

Interactive recommender system (IRS) has drawn huge attention because of...
research
12/12/2020

Learning over no-Preferred and Preferred Sequence of items for Robust Recommendation

In this paper, we propose a theoretically founded sequential strategy fo...
research
01/20/2023

Generative Slate Recommendation with Reinforcement Learning

Recent research has employed reinforcement learning (RL) algorithms to o...
research
07/14/2021

Plan-Based Relaxed Reward Shaping for Goal-Directed Tasks

In high-dimensional state spaces, the usefulness of Reinforcement Learni...

Please sign up or login with your details

Forgot password? Click here to reset