Locality-Sensitive Experience Replay for Online Recommendation

10/21/2021
by   Xiaocong Chen, et al.
0

Online recommendation requires handling rapidly changing user preferences. Deep reinforcement learning (DRL) is gaining interest as an effective means of capturing users' dynamic interest during interactions with recommender systems. However, it is challenging to train a DRL agent, due to large state space (e.g., user-item rating matrix and user profiles), action space (e.g., candidate items), and sparse rewards. Existing studies encourage the agent to learn from past experience via experience replay (ER). They adapt poorly to the complex environment of online recommender systems and are inefficient in determining an optimal strategy from past experience. To address these issues, we design a novel state-aware experience replay model, which uses locality-sensitive hashing to map high dimensional data into low-dimensional representations and a prioritized reward-driven strategy to replay more valuable experience at a higher chance. Our model can selectively pick the most relevant and salient experiences and recommend the agent with the optimal policy. Experiments on three online simulation platforms demonstrate our model' feasibility and superiority toseveral existing experience replay methods.

READ FULL TEXT
research
06/19/2019

Experience Replay Optimization

Experience replay enables reinforcement learning agents to memorize and ...
research
08/01/2022

A Maintenance Planning Framework using Online and Offline Deep Reinforcement Learning

Cost-effective asset management is an area of interest across several in...
research
09/17/2022

Intrinsically Motivated Reinforcement Learning based Recommendation with Counterfactual Data Augmentation

Deep reinforcement learning (DRL) has been proven its efficiency in capt...
research
02/17/2018

A Deep Q-Learning Agent for the L-Game with Variable Batch Training

We employ the Deep Q-Learning algorithm with Experience Replay to train ...
research
09/24/2019

Invariant Transform Experience Replay

Deep reinforcement learning (DRL) is a promising approach for adaptive r...
research
03/12/2023

AutoDenoise: Automatic Data Instance Denoising for Recommendations

Historical user-item interaction datasets are essential in training mode...
research
05/07/2017

Item Recommendation with Continuous Experience Evolution of Users using Brownian Motion

Online review communities are dynamic as users join and leave, adopt new...

Please sign up or login with your details

Forgot password? Click here to reset