Rethinking Reinforcement Learning for Recommendation: A Prompt Perspective

06/15/2022
by   Xin Xin, et al.
0

Modern recommender systems aim to improve user experience. As reinforcement learning (RL) naturally fits this objective – maximizing an user's reward per session – it has become an emerging topic in recommender systems. Developing RL-based recommendation methods, however, is not trivial due to the offline training challenge. Specifically, the keystone of traditional RL is to train an agent with large amounts of online exploration making lots of `errors' in the process. In the recommendation setting, though, we cannot afford the price of making `errors' online. As a result, the agent needs to be trained through offline historical implicit feedback, collected under different recommendation policies; traditional RL algorithms may lead to sub-optimal policies under these offline training settings. Here we propose a new learning paradigm – namely Prompt-Based Reinforcement Learning (PRL) – for the offline training of RL-based recommendation agents. While traditional RL algorithms attempt to map state-action input pairs to their expected rewards (e.g., Q-values), PRL directly infers actions (i.e., recommended items) from state-reward inputs. In short, the agents are trained to predict a recommended item given the prior interactions and an observed reward value – with simple supervised learning. At deployment time, this historical (training) data acts as a knowledge base, while the state-reward pairs are used as a prompt. The agents are thus used to answer the question: Which item should be recommended given the prior interactions & the prompted reward value? We implement PRL with four notable recommendation models and conduct experiments on two real-world e-commerce datasets. Experimental results demonstrate the superior performance of our proposed methods.

READ FULL TEXT
research
05/18/2023

Contrastive State Augmentations for Reinforcement Learning-Based Recommender Systems

Learning reinforcement learning (RL)-based recommenders from historical ...
research
08/02/2018

RecoGym: A Reinforcement Learning Environment for the problem of Product Recommendation in Online Advertising

Recommender Systems are becoming ubiquitous in many settings and take ma...
research
07/26/2019

On the Value of Bandit Feedback for Offline Recommender System Evaluation

In academic literature, recommender systems are often evaluated on the t...
research
12/09/2020

Interactive Search Based on Deep Reinforcement Learning

With the continuous development of machine learning technology, major e-...
research
04/17/2023

Causal Decision Transformer for Recommender Systems via Offline Reinforcement Learning

Reinforcement learning-based recommender systems have recently gained po...
research
03/11/2023

User Retention-oriented Recommendation with Decision Transformer

Improving user retention with reinforcement learning (RL) has attracted ...
research
02/11/2019

Model-Based Reinforcement Learning for Whole-Chain Recommendations

With the recent prevalence of Reinforcement Learning (RL), there have be...

Please sign up or login with your details

Forgot password? Click here to reset