Memristor Hardware-Friendly Reinforcement Learning

01/20/2020
by   Nan Wu, et al.
0

Recently, significant progress has been made in solving sophisticated problems among various domains by using reinforcement learning (RL), which allows machines or agents to learn from interactions with environments rather than explicit supervision. As the end of Moore's law seems to be imminent, emerging technologies that enable high performance neuromorphic hardware systems are attracting increasing attention. Namely, neuromorphic architectures that leverage memristors, the programmable and nonvolatile two-terminal devices, as synaptic weights in hardware neural networks, are candidates of choice to realize such highly energy-efficient and complex nervous systems. However, one of the challenges for memristive hardware with integrated learning capabilities is prohibitively large number of write cycles that might be required during learning process, and this situation is even exacerbated under RL situations. In this work we propose a memristive neuromorphic hardware implementation for the actor-critic algorithm in RL. By introducing a two-fold training procedure (i.e., ex-situ pre-training and in-situ re-training) and several training techniques, the number of weight updates can be significantly reduced and thus it will be suitable for efficient in-situ learning implementations. As a case study, we consider the task of balancing an inverted pendulum, a classical problem in both RL and control theory. We believe that this study shows the promise of using memristor-based hardware neural networks for handling complex tasks through in-situ reinforcement learning.

READ FULL TEXT
research
03/05/2021

A Dual-Memory Architecture for Reinforcement Learning on Neuromorphic Platforms

Reinforcement learning (RL) is a foundation of learning in biological sy...
research
07/06/2023

A Neuromorphic Architecture for Reinforcement Learning from Real-Valued Observations

Reinforcement Learning (RL) provides a powerful framework for decision-m...
research
08/03/2022

AACC: Asymmetric Actor-Critic in Contextual Reinforcement Learning

Reinforcement Learning (RL) techniques have drawn great attention in man...
research
04/22/2019

Transfer and Online Reinforcement Learning in STT-MRAM Based Embedded Systems for Autonomous Drones

In this paper we present an algorithm-hardware codesign for camera-based...
research
02/02/2023

MARLIN: Soft Actor-Critic based Reinforcement Learning for Congestion Control in Real Networks

Fast and efficient transport protocols are the foundation of an increasi...
research
12/31/2021

Single-Shot Pruning for Offline Reinforcement Learning

Deep Reinforcement Learning (RL) is a powerful framework for solving com...
research
07/04/2010

A Reinforcement Learning Model Using Neural Networks for Music Sight Reading Learning Problem

Music Sight Reading is a complex process in which when it is occurred in...

Please sign up or login with your details

Forgot password? Click here to reset