Developing Multi-Task Recommendations with Long-Term Rewards via Policy Distilled Reinforcement Learning

01/27/2020
by   Xi Liu, et al.
13

With the explosive growth of online products and content, recommendation techniques have been considered as an effective tool to overcome information overload, improve user experience, and boost business revenue. In recent years, we have observed a new desideratum of considering long-term rewards of multiple related recommendation tasks simultaneously. The consideration of long-term rewards is strongly tied to business revenue and growth. Learning multiple tasks simultaneously could generally improve the performance of individual task due to knowledge sharing in multi-task learning. While a few existing works have studied long-term rewards in recommendations, they mainly focus on a single recommendation task. In this paper, we propose PoDiRe: a policy distilled recommender that can address long-term rewards of recommendations and simultaneously handle multiple recommendation tasks. This novel recommendation solution is based on a marriage of deep reinforcement learning and knowledge distillation techniques, which is able to establish knowledge sharing among different tasks and reduce the size of a learning model. The resulting model is expected to attain better performance and lower response latency for real-time recommendation services. In collaboration with Samsung Game Launcher, one of the world's largest commercial mobile game platforms, we conduct a comprehensive experimental study on large-scale real data with hundreds of millions of events and show that our solution outperforms many state-of-the-art methods in terms of several standard evaluation metrics.

READ FULL TEXT

page 1

page 18

page 21

page 24

page 26

research
09/01/2020

From Clicks to Conversions: Recommendation for long-term reward

Recommender systems are often optimised for short-term reward: a recomme...
research
10/29/2018

Deep Reinforcement Learning based Recommendation with Explicit User-Item Interactions Modeling

Recommendation is crucial in both academia and industry, and various tec...
research
08/09/2022

Multi-Task Fusion via Reinforcement Learning for Long-Term User Satisfaction in Recommender Systems

Recommender System (RS) is an important online application that affects ...
research
06/14/2020

Multi-Purchase Behavior: Modeling and Optimization

We study the problem of modeling purchase of multiple items and utilizin...
research
04/01/2019

Enhancing the long-term performance of recommender system

Recommender system is a critically important tool in online commercial s...
research
01/18/2022

Leaving No One Behind: A Multi-Scenario Multi-Task Meta Learning Approach for Advertiser Modeling

Advertisers play an essential role in many e-commerce platforms like Tao...
research
10/19/2020

Paid and hypothetical time preferences are the same: Lab, field and online evidence

The use of hypothetical instead of real decision-making incentives remai...

Please sign up or login with your details

Forgot password? Click here to reset