Large-scale Interactive Recommendation with Tree-structured Policy Gradient

11/14/2018
by   Haokun Chen, et al.
0

Reinforcement learning (RL) has recently been introduced to interactive recommender systems (IRS) because of its nature of learning from dynamic interactions and planning for long-run performance. As IRS is always with thousands of items to recommend (i.e., thousands of actions), most existing RL-based methods, however, fail to handle such a large discrete action space problem and thus become inefficient. The existing work that tries to deal with the large discrete action space problem by utilizing the deep deterministic policy gradient framework suffers from the inconsistency between the continuous action representation (the output of the actor network) and the real discrete action. To avoid such inconsistency and achieve high efficiency and recommendation effectiveness, in this paper, we propose a Tree-structured Policy Gradient Recommendation (TPGR) framework, where a balanced hierarchical clustering tree is built over the items and picking an item is formulated as seeking a path from the root to a certain leaf of the tree. Extensive experiments on carefully-designed environments based on two real-world datasets demonstrate that our model provides superior recommendation performance and significant efficiency improvement over state-of-the-art methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/14/2020

A Text-based Deep Reinforcement Learning Framework for Interactive Recommendation

Due to its nature of learning from dynamic interactions and planning for...
research
06/18/2020

Interactive Recommender System via Knowledge Graph-enhanced Reinforcement Learning

Interactive recommender system (IRS) has drawn huge attention because of...
research
08/10/2022

Deep Reinforcement Learning for Dynamic Recommendation with Model-agnostic Counterfactual Policy Synthesis

Recent advances in recommender systems have proved the potential of Rein...
research
01/17/2018

Reinforcement Learning based Recommender System using Biclustering Technique

A recommender system aims to recommend items that a user is interested i...
research
04/07/2023

Continuous Input Embedding Size Search For Recommender Systems

Latent factor models are the most popular backbones for today's recommen...
research
02/10/2020

Discrete Action On-Policy Learning with Action-Value Critic

Reinforcement learning (RL) in discrete action space is ubiquitous in re...
research
04/24/2022

Adaptive Task Planning for Large-Scale Robotized Warehouses

Robotized warehouses are deployed to automatically distribute millions o...

Please sign up or login with your details

Forgot password? Click here to reset