Edge-Compatible Reinforcement Learning for Recommendations

12/10/2021
by   James E. Kostas, et al.
0

Most reinforcement learning (RL) recommendation systems designed for edge computing must either synchronize during recommendation selection or depend on an unprincipled patchwork collection of algorithms. In this work, we build on asynchronous coagent policy gradient algorithms <cit.> to propose a principled solution to this problem. The class of algorithms that we propose can be distributed over the internet and run asynchronously and in real-time. When a given edge fails to respond to a request for data with sufficient speed, this is not a problem; the algorithm is designed to function and learn in the edge setting, and network issues are part of this setting. The result is a principled, theoretically grounded RL algorithm designed to be distributed in and learn in this asynchronous environment. In this work, we describe this algorithm and a proposed class of architectures in detail, and demonstrate that they work well in practice in the asynchronous setting, even as the network quality degrades.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/29/2015

Robotic Search & Rescue via Online Multi-task Reinforcement Learning

Reinforcement learning (RL) is a general and well-known method that a ro...
research
03/03/2019

Asynchronous Episodic Deep Deterministic Policy Gradient: Towards Continuous Control in Computationally Complex Environments

Deep Deterministic Policy Gradient (DDPG) has been proved to be a succes...
research
12/27/2018

Neural Model-Based Reinforcement Learning for Recommendation

There are great interests as well as many challenges in applying reinfor...
research
12/27/2018

Generative Adversarial User Model for Reinforcement Learning Based Recommendation System

There are great interests as well as many challenges in applying reinfor...
research
03/07/2018

Accelerated Methods for Deep Reinforcement Learning

Deep reinforcement learning (RL) has achieved many recent successes, yet...
research
06/29/2021

Structure-aware reinforcement learning for node-overload protection in mobile edge computing

Mobile Edge Computing (MEC) refers to the concept of placing computation...
research
02/22/2018

Asynchronous stochastic approximations with asymptotically biased errors and deep multi-agent learning

Asynchronous stochastic approximations are an important class of model-f...

Please sign up or login with your details

Forgot password? Click here to reset