A Spiking Neural Network Learning Markov Chain

09/20/2022
by   Mikhail Kiselev, et al.
0

In this paper, the question how spiking neural network (SNN) learns and fixes in its internal structures a model of external world dynamics is explored. This question is important for implementation of the model-based reinforcement learning (RL), the realistic RL regime where the decisions made by SNN and their evaluation in terms of reward/punishment signals may be separated by significant time interval and sequence of intermediate evaluation-neutral world states. In the present work, I formalize world dynamics as a Markov chain with unknown a priori state transition probabilities, which should be learnt by the network. To make this problem formulation more realistic, I solve it in continuous time, so that duration of every state in the Markov chain may be different and is unknown. It is demonstrated how this task can be accomplished by an SNN with specially designed structure and local synaptic plasticity rules. As an example, we show how this network motif works in the simple but non-trivial world where a ball moves inside a square box and bounces from its walls with a random new direction and velocity.

READ FULL TEXT
research
04/09/2022

A Spiking Neural Network Structure Implementing Reinforcement Learning

At present, implementation of learning mechanisms in spiking neural netw...
research
02/24/2022

Evolving-to-Learn Reinforcement Learning Tasks with Spiking Neural Networks

Inspired by the natural nervous system, synaptic plasticity rules are ap...
research
12/29/2017

Non-linear motor control by local learning in spiking neural networks

Learning weights in a spiking neural network with hidden neurons, using ...
research
01/16/2012

A Spiking Neural Learning Classifier System

Learning Classifier Systems (LCS) are population-based reinforcement lea...
research
07/16/2023

POMDP inference and robust solution via deep reinforcement learning: An application to railway optimal maintenance

Partially Observable Markov Decision Processes (POMDPs) can model comple...
research
10/23/2020

Quantum Superposition Spiking Neural Network

Quantum brain as a novel hypothesis states that some non-trivial mechani...
research
11/25/2021

Continuous-time Markov chain as a generic trait-based evolutionary model

More than ever, today we are left with the abundance of molecular data o...

Please sign up or login with your details

Forgot password? Click here to reset