Model-agnostic Counterfactual Synthesis Policy for Interactive Recommendation

04/01/2022
by   Siyu Wang, et al.
0

Interactive recommendation is able to learn from the interactive processes between users and systems to confront the dynamic interests of users. Recent advances have convinced that the ability of reinforcement learning to handle the dynamic process can be effectively applied in the interactive recommendation. However, the sparsity of interactive data may hamper the performance of the system. We propose to train a Model-agnostic Counterfactual Synthesis Policy to generate counterfactual data and address the data sparsity problem by modelling from observation and counterfactual distribution. The proposed policy can identify and replace the trivial components for any state in the training process with other agents, which can be deployed in any RL-based algorithm. The experimental results demonstrate the effectiveness and generality of our proposed policy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/10/2022

Deep Reinforcement Learning for Dynamic Recommendation with Model-agnostic Counterfactual Policy Synthesis

Recent advances in recommender systems have proved the potential of Rein...
research
04/14/2020

A Text-based Deep Reinforcement Learning Framework for Interactive Recommendation

Due to its nature of learning from dynamic interactions and planning for...
research
07/14/2022

Reinforced Path Reasoning for Counterfactual Explainable Recommendation

Counterfactual explanations interpret the recommendation mechanism via e...
research
10/19/2022

Data-Augmented Counterfactual Learning for Bundle Recommendation

Bundle Recommendation (BR) aims at recommending bundled items on online ...
research
09/11/2021

CauseRec: Counterfactual User Sequence Synthesis for Sequential Recommendation

Learning user representations based on historical behaviors lies at the ...
research
05/14/2019

Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models

We introduce an off-policy evaluation procedure for highlighting episode...
research
12/01/2016

Large-scale Validation of Counterfactual Learning Methods: A Test-Bed

The ability to perform effective off-policy learning would revolutionize...

Please sign up or login with your details

Forgot password? Click here to reset