An FPGA-Based On-Device Reinforcement Learning Approach using Online Sequential Learning

05/10/2020
by   Hirohisa Watanabe, et al.
20

DQN (Deep Q-Network) is a method to perform Q-learning for reinforcement learning using deep neural networks. DQNs require large buffers for experience reply and rely on backpropagation based iterative optimization, making them difficult to be implemented on resource-limited edge devices. In this paper, we propose a lightweight on-device reinforcement learning approach for low-cost FPGA devices. It exploits a recently proposed neural-network based on-device learning approach that does not rely on the backpropagation method but uses ELM (Extreme Learning Machine) and OS-ELM (Online Sequential ELM) based training algorithms. In addition, we propose a combination of L2 regularization and spectral normalization for the on-device reinforcement learning, so that output values of the neural networks can be fit into a certain range and the reinforcement learning becomes stable. The proposed reinforcement learning approach is designed for Xilinx PYNQ-Z1 board as a low-cost FPGA platform. The experiment results using OpenAI Gym demonstrate that the proposed algorithm and its FPGA implementation complete a CartPole-v0 task 29.76x and 126.06x faster than a conventional DQN-based approach when the number of hidden-layer nodes is 64.

READ FULL TEXT

page 8

page 10

page 11

page 12

research
02/27/2020

An On-Device Federated Learning Approach for Cooperative Anomaly Detection

Most edge AI focuses on prediction tasks on resource-limited edge device...
research
12/31/2020

Accelerating ODE-Based Neural Networks on Low-Cost FPGAs

ODENet is a deep neural network architecture in which a stacking structu...
research
04/27/2020

Demo: A Reinforcement Learning-based Flexible Duplex System for B5G with Sub-6 GHz

In this paper, we propose a reinforcement learning-based flexible duplex...
research
09/10/2023

Chasing the Intruder: A Reinforcement Learning Approach for Tracking Intruder Drones

Drones are becoming versatile in a myriad of applications. This has led ...
research
09/03/2020

Sparse Meta Networks for Sequential Adaptation and its Application to Adaptive Language Modelling

Training a deep neural network requires a large amount of single-task da...
research
03/02/2022

On-Device Learning: A Neural Network Based Field-Trainable Edge AI

In real-world edge AI applications, their accuracy is often affected by ...
research
04/11/2022

Implementing Online Reinforcement Learning with Temporal Neural Networks

A Temporal Neural Network (TNN) architecture for implementing efficient ...

Please sign up or login with your details

Forgot password? Click here to reset