Toward Simulating Environments in Reinforcement Learning Based Recommendations

06/27/2019
by   Xiangyu Zhao, et al.
0

With the recent advances in Reinforcement Learning (RL), there have been tremendous interests in employing RL for recommender systems. RL-based recommender systems have two key advantages: (i) they can continuously update their recommendation strategies according to users' real-time feedback, and (ii) the optimal strategy maximizes the long-term reward from users, such as the total revenue of a recommendation session. However, directly training and evaluating a new RL-based recommendation algorithm needs to collect users' real-time feedback in the real system, which is time and efforts consuming and could negatively impact on users' experiences. Thus, it calls for a user simulator that can mimic real users' behaviors where we can pre-train and evaluate new recommendation algorithms. Simulating users' behaviors in a dynamic system faces immense challenges -- (i) the underlining item distribution is complex, and (ii) historical logs for each user are limited. In this paper, we develop a user simulator base on Generative Adversarial Network (GAN). To be specific, we design the generator to capture the underlining distribution of users' historical logs and generate realistic logs that can be considered as augmentations of real logs; while the discriminator is developed to not only distinguish real and fake logs but also predict users' behaviors. The experimental results based on real-world e-commerce data demonstrate the effectiveness of the proposed simulator. Further experiments have been conducted to understand the importance of each component in the simulator.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/11/2019

Model-Based Reinforcement Learning for Whole-Chain Recommendations

With the recent prevalence of Reinforcement Learning (RL), there have be...
research
09/09/2019

Deep Reinforcement Learning for Online Advertising in Recommender Systems

With the recent prevalence of Reinforcement Learning (RL), there have be...
research
02/19/2018

Recommendations with Negative Feedback via Pairwise Deep Reinforcement Learning

Recommender systems play a crucial role in mitigating the problem of inf...
research
07/06/2020

Understanding Echo Chambers in E-commerce Recommender Systems

Personalized recommendation benefits users in accessing contents of inte...
research
11/11/2020

Adaptive Neural Architectures for Recommender Systems

Deep learning has proved an effective means to capture the non-linear as...
research
12/18/2018

Reinforcement Learning for Online Information Seeking

Information seeking techniques, satisfying users' information needs by s...
research
02/23/2019

Behavioral Petri Net Mining and Automated Analysis for Human-Computer Interaction Recommendations in Multi-Application Environments

Process Mining is a famous technique which is frequently applied to Soft...

Please sign up or login with your details

Forgot password? Click here to reset