Binarized P-Network: Deep Reinforcement Learning of Robot Control from Raw Images on FPGA

09/10/2021
by   Yuki Kadokawa, et al.
0

This paper explores a Deep Reinforcement Learning (DRL) approach for designing image-based control for edge robots to be implemented on Field Programmable Gate Arrays (FPGAs). Although FPGAs are more power-efficient than CPUs and GPUs, a typical DRL method cannot be applied since they are composed of many Logic Blocks (LBs) for high-speed logical operations but low-speed real-number operations. To cope with this problem, we propose a novel DRL algorithm called Binarized P-Network (BPN), which learns image-input control policies using Binarized Convolutional Neural Networks (BCNNs). To alleviate the instability of reinforcement learning caused by a BCNN with low function approximation accuracy, our BPN adopts a robust value update scheme called Conservative Value Iteration, which is tolerant of function approximation errors. We confirmed the BPN's effectiveness through applications to a visual tracking task in simulation and real-robot experiments with FPGA.

READ FULL TEXT

page 1

page 6

page 7

research
10/16/2019

Creativity in Robot Manipulation with Deep Reinforcement Learning

Deep Reinforcement Learning (DRL) has emerged as a powerful control tech...
research
09/01/2019

Approximating two value functions instead of one: towards characterizing a new family of Deep Reinforcement Learning algorithms

This paper makes one step forward towards characterizing a new family of...
research
12/01/2015

Efficient Edge Detection on Low-Cost FPGAs

Improving the efficiency of edge detection in embedded applications, suc...
research
09/30/2018

Deep Quality-Value (DQV) Learning

We introduce a novel Deep Reinforcement Learning (DRL) algorithm called ...
research
08/08/2019

Learning to Grasp from 2.5D images: a Deep Reinforcement Learning Approach

In this paper, we propose a deep reinforcement learning (DRL) solution t...
research
07/20/2020

A Deep Learning-Based FPGA Function Block Detection Method with Bitstream to Image Transformation

In the context of various application scenarios and/or for the sake of s...
research
02/25/2022

Consolidated Adaptive T-soft Update for Deep Reinforcement Learning

Demand for deep reinforcement learning (DRL) is gradually increased to e...

Please sign up or login with your details

Forgot password? Click here to reset