Model-Based Reinforcement Learning for Whole-Chain Recommendations

02/11/2019
by   Xiangyu Zhao, et al.
0

With the recent prevalence of Reinforcement Learning (RL), there have been tremendous interests in developing RL-based recommender systems. In practical recommendation sessions, users will sequentially access multiple scenarios, such as the entrance pages and the item detail pages, and each scenario has its own recommendation strategy. However, the majority of existing RL-based recommender systems focus on separately optimizing each strategy, which could lead to sub-optimal overall performance, because independently optimizing each scenario (i) overlooks the sequential correlation among scenarios, (ii) ignores users' behavior data from other scenarios, and (iii) only optimizes its own objective but neglects the overall objective of a session. Therefore, in this paper, we study the recommendation problem with multiple (consecutive) scenarios, i.e., whole-chain recommendations. We propose a multi-agent reinforcement learning based approach (DeepChain), which can capture the sequential correlation among different scenarios and jointly optimize multiple recommendation strategies. To be specific, all recommender agents share the same memory of users' historical behaviors, and they work collaboratively to maximize the overall reward of a session. Note that optimizing multiple recommendation strategies jointly faces two challenges - (i) it requires huge amounts of user behavior data, and (ii) the distribution of reward (users' feedback) are extremely unbalanced. In this paper, we introduce model-based reinforcement learning techniques to reduce the training data requirement and execute more accurate strategy updates. The experimental results based on data from a real e-commerce platform demonstrate the effectiveness of the proposed framework. Further experiments have been conducted to validate the importance of each component of DeepChain.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/27/2019

Toward Simulating Environments in Reinforcement Learning Based Recommendations

With the recent advances in Reinforcement Learning (RL), there have been...
research
09/09/2019

Deep Reinforcement Learning for Online Advertising in Recommender Systems

With the recent prevalence of Reinforcement Learning (RL), there have be...
research
06/15/2022

Rethinking Reinforcement Learning for Recommendation: A Prompt Perspective

Modern recommender systems aim to improve user experience. As reinforcem...
research
12/30/2017

Deep Reinforcement Learning for List-wise Recommendations

Recommender systems play a crucial role in mitigating the problem of inf...
research
09/17/2018

Learning to Collaborate: Multi-Scenario Ranking via Multi-Agent Reinforcement Learning

Ranking is a fundamental and widely studied problem in scenarios such as...
research
11/29/2020

Cluster Based Deep Contextual Reinforcement Learning for top-k Recommendations

Rapid advancements in the E-commerce sector over the last few decades ha...
research
02/07/2023

Multi-Task Recommendations with Reinforcement Learning

In recent years, Multi-task Learning (MTL) has yielded immense success i...

Please sign up or login with your details

Forgot password? Click here to reset