Novelty Producing Synaptic Plasticity

02/10/2020
by   Anil Yaman, et al.
0

A learning process with the plasticity property often requires reinforcement signals to guide the process. However, in some tasks (e.g. maze-navigation), it is very difficult (or impossible) to measure the performance of an agent (i.e. a fitness value) to provide reinforcements since the position of the goal is not known. This requires finding the correct behavior among a vast number of possible behaviors without having the knowledge of the reinforcement signals. In these cases, an exhaustive search may be needed. However, this might not be feasible especially when optimizing artificial neural networks in continuous domains. In this work, we introduce novelty producing synaptic plasticity (NPSP), where we evolve synaptic plasticity rules to produce as many novel behaviors as possible to find the behavior that can solve the problem. We evaluate the NPSP on maze-navigation on deceptive maze environments that require complex actions and the achievement of subgoals to complete. Our results show that the search heuristic used with the proposed NPSP is indeed capable of producing much more novel behaviors in comparison with a random search taken as baseline.

READ FULL TEXT

page 3

page 4

page 7

research
03/22/2019

Learning with Delayed Synaptic Plasticity

The plasticity property of biological neural networks allows them to per...
research
02/24/2022

Evolving-to-Learn Reinforcement Learning Tasks with Spiking Neural Networks

Inspired by the natural nervous system, synaptic plasticity rules are ap...
research
07/24/2022

A Parallel Novelty Search Metaheuristic Applied to a Wildfire Prediction System

Wildfires are a highly prevalent multi-causal environmental phenomenon. ...
research
04/18/2017

Discovering Evolutionary Stepping Stones through Behavior Domination

Behavior domination is proposed as a tool for understanding and harnessi...
research
04/25/2023

Leveraging Human Feedback to Evolve and Discover Novel Emergent Behaviors in Robot Swarms

Robot swarms often exhibit emergent behaviors that are fascinating to ob...
research
03/26/2015

An Evolutionary Algorithm for Error-Driven Learning via Reinforcement

Although different learning systems are coordinated to afford complex be...

Please sign up or login with your details

Forgot password? Click here to reset