A nonlinear hidden layer enables actor-critic agents to learn multiple paired association navigation

06/25/2021
by   M Ganesh Kumar, et al.
4

Navigation to multiple cued reward locations has been increasingly used to study rodent learning. Though deep reinforcement learning agents have been shown to be able to learn the task, they are not biologically plausible. Biologically plausible classic actor-critic agents have been shown to learn to navigate to single reward locations, but which biologically plausible agents are able to learn multiple cue-reward location tasks has remained unclear. In this computational study, we show versions of classic agents that learn to navigate to a single reward location, and adapt to reward location displacement, but are not able to learn multiple paired association navigation. The limitation is overcome by an agent in which place cell and cue information are first processed by a feedforward nonlinear hidden layer with synapses to the actor and critic subject to temporal difference error-modulated plasticity. Faster learning is obtained when the feedforward layer is replaced by a recurrent reservoir network.

READ FULL TEXT

page 5

page 7

page 9

page 11

page 13

page 30

page 31

research
09/16/2009

A Convergent Online Single Time Scale Actor Critic Algorithm

Actor-Critic based approaches were among the first to address reinforcem...
research
10/28/2017

Diff-DAC: Distributed Actor-Critic for Multitask Deep Reinforcement Learning

We propose a multiagent distributed actor-critic algorithm for multitask...
research
06/07/2021

One-shot learning of paired associations by a reservoir computing model with Hebbian plasticity

One-shot learning can be achieved by algorithms and animals, but how the...
research
05/10/2023

Sequence-Agnostic Multi-Object Navigation

The Multi-Object Navigation (MultiON) task requires a robot to localize ...
research
11/27/2018

Target Driven Visual Navigation with Hybrid Asynchronous Universal Successor Representations

Being able to navigate to a target with minimal supervision and prior kn...
research
11/15/2018

Seq2Seq Mimic Games: A Signaling Perspective

We study the emergence of communication in multiagent adversarial settin...
research
10/01/2019

Augmenting learning using symmetry in a biologically-inspired domain

Invariances to translation, rotation and other spatial transformations a...

Please sign up or login with your details

Forgot password? Click here to reset