Deep Reinforcement Learning: Framework, Applications, and Embedded Implementations

10/10/2017
by   Hongjia Li, et al.
0

The recent breakthroughs of deep reinforcement learning (DRL) technique in Alpha Go and playing Atari have set a good example in handling large state and actions spaces of complicated control problems. The DRL technique is comprised of (i) an offline deep neural network (DNN) construction phase, which derives the correlation between each state-action pair of the system and its value function, and (ii) an online deep Q-learning phase, which adaptively derives the optimal action and updates value estimates. In this paper, we first present the general DRL framework, which can be widely utilized in many applications with different optimization objectives. This is followed by the introduction of three specific applications: the cloud computing resource allocation problem, the residential smart grid task scheduling problem, and building HVAC system optimal control problem. The effectiveness of the DRL technique in these three cyber-physical applications have been validated. Finally, this paper investigates the stochastic computing-based hardware implementations of the DRL framework, which consumes a significant improvement in area efficiency and power consumption compared with binary-based implementation counterparts.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/26/2023

A Constraint Enforcement Deep Reinforcement Learning Framework for Optimal Energy Storage Systems Dispatch

The optimal dispatch of energy storage systems (ESSs) presents formidabl...
research
06/21/2018

A New Approach for Resource Scheduling with Deep Reinforcement Learning

With the rapid development of deep learning, deep reinforcement learning...
research
01/24/2020

Stacked Auto Encoder Based Deep Reinforcement Learning for Online Resource Scheduling in Large-Scale MEC Networks

An online resource scheduling framework is proposed for minimizing the s...
research
04/20/2023

TempoRL: laser pulse temporal shape optimization with Deep Reinforcement Learning

High Power Laser's (HPL) optimal performance is essential for the succes...
research
04/09/2022

MR-iNet Gym: Framework for Edge Deployment of Deep Reinforcement Learning on Embedded Software Defined Radio

Dynamic resource allocation plays a critical role in the next generation...
research
01/28/2018

Deep Reinforcement Learning for Dynamic Treatment Regimes on Medical Registry Data

This paper presents the first deep reinforcement learning (DRL) framewor...
research
05/21/2017

Shallow Updates for Deep Reinforcement Learning

Deep reinforcement learning (DRL) methods such as the Deep Q-Network (DQ...

Please sign up or login with your details

Forgot password? Click here to reset