Short-term plasticity as cause-effect hypothesis testing in distal reward learning

02/04/2014
by   Andrea Soltoggio, et al.
0

Asynchrony, overlaps and delays in sensory-motor signals introduce ambiguity as to which stimuli, actions, and rewards are causally related. Only the repetition of reward episodes helps distinguish true cause-effect relationships from coincidental occurrences. In the model proposed here, a novel plasticity rule employs short and long-term changes to evaluate hypotheses on cause-effect relationships. Transient weights represent hypotheses that are consolidated in long-term memory only when they consistently predict or cause future rewards. The main objective of the model is to preserve existing network topologies when learning with ambiguous information flows. Learning is also improved by biasing the exploration of the stimulus-response space towards actions that in the past occurred before rewards. The model indicates under which conditions beliefs can be consolidated in long-term memory, it suggests a solution to the plasticity-stability dilemma, and proposes an interpretation of the role of short-term plasticity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/07/2020

Predicting the Transition from Short-term to Long-term Memory based on Deep Neural Network

Memory is an essential element in people's daily life based on experienc...
research
12/22/2017

Learning Based on CC1 and CC4 Neural Networks

We propose that a general learning system should have three kinds of age...
research
07/21/2023

Improve Long-term Memory Learning Through Rescaling the Error Temporally

This paper studies the error metric selection for long-term memory learn...
research
07/19/2023

Impatient Bandits: Optimizing Recommendations for the Long-Term Without Delay

Recommender systems are a ubiquitous feature of online platforms. Increa...
research
09/01/2020

From Clicks to Conversions: Recommendation for long-term reward

Recommender systems are often optimised for short-term reward: a recomme...
research
02/24/2021

Synthetic Returns for Long-Term Credit Assignment

Since the earliest days of reinforcement learning, the workhorse method ...
research
08/17/2020

Online Multitask Learning with Long-Term Memory

We introduce a novel online multitask setting. In this setting each task...

Please sign up or login with your details

Forgot password? Click here to reset