Biologically Plausible Variational Policy Gradient with Spiking Recurrent Winner-Take-All Networks

10/21/2022
by   Zhile Yang, et al.
0

One stream of reinforcement learning research is exploring biologically plausible models and algorithms to simulate biological intelligence and fit neuromorphic hardware. Among them, reward-modulated spike-timing-dependent plasticity (R-STDP) is a recent branch with good potential in energy efficiency. However, current R-STDP methods rely on heuristic designs of local learning rules, thus requiring task-specific expert knowledge. In this paper, we consider a spiking recurrent winner-take-all network, and propose a new R-STDP method, spiking variational policy gradient (SVPG), whose local learning rules are derived from the global policy gradient and thus eliminate the need for heuristic designs. In experiments of MNIST classification and Gym InvertedPendulum, our SVPG achieves good training performance, and also presents better robustness to various kinds of noises than conventional methods.

READ FULL TEXT
research
12/17/2018

A Biologically Plausible Supervised Learning Method for Spiking Neural Networks Using the Symmetric STDP Rule

Spiking neural networks (SNNs) possess energy-efficient potential due to...
research
05/20/2022

Towards biologically plausible Dreaming and Planning

Humans and animals can learn new skills after practicing for a few hours...
research
10/27/2021

BioGrad: Biologically Plausible Gradient-Based Learning for Spiking Neural Networks

Spiking neural networks (SNN) are delivering energy-efficient, massively...
research
11/18/2021

Continuous learning of spiking networks trained with local rules

Artificial neural networks (ANNs) experience catastrophic forgetting (CF...
research
06/26/2020

Biologically Plausible Learning of Text Representation with Spiking Neural Networks

This study proposes a novel biologically plausible mechanism for generat...
research
11/01/2021

Learning Event-based Spatio-Temporal Feature Descriptors via Local Synaptic Plasticity: A Biologically-Plausible Perspective of Computer Vision

We present an optimization-based theory describing spiking cortical ense...
research
05/01/2021

Neko: a Library for Exploring Neuromorphic Learning Rules

The field of neuromorphic computing is in a period of active exploration...

Please sign up or login with your details

Forgot password? Click here to reset