Neuronal Circuit Policies

03/22/2018
by   Mathias Lechner, et al.
0

We propose an effective way to create interpretable control agents, by re-purposing the function of a biological neural circuit model, to govern simulated and real world reinforcement learning (RL) test-beds. We model the tap-withdrawal (TW) neural circuit of the nematode, C. elegans, a circuit responsible for the worm's reflexive response to external mechanical touch stimulations, and learn its synaptic and neuronal parameters as a policy for controlling basic RL tasks. We also autonomously park a real rover robot on a pre-defined trajectory, by deploying such neuronal circuit policies learned in a simulated environment. For reconfiguration of the purpose of the TW neural circuit, we adopt a search-based RL algorithm. We show that our neuronal policies perform as good as deep neural network policies with the advantage of realizing interpretable dynamics at the cell level.

READ FULL TEXT
research
09/11/2018

Re-purposing Compact Neuronal Circuit Policies to Govern Reinforcement Learning Tasks

We propose an effective method for creating interpretable control agents...
research
11/09/2017

Worm-level Control through Search-based Reinforcement Learning

Through natural evolution, nervous systems of organisms formed near-opti...
research
06/10/2020

Reinforcement Learning from a Mixture of Interpretable Experts

Reinforcement learning (RL) has demonstrated its ability to solve high d...
research
03/30/2021

Learning Deep Neural Policies with Stability Guarantees

Reinforcement learning (RL) has been successfully used to solve various ...
research
08/11/2020

Hardware as Policy: Mechanical and Computational Co-Optimization using Deep Reinforcement Learning

Deep Reinforcement Learning (RL) has shown great success in learning com...
research
08/27/2020

Controlling Level of Unconsciousness by Titrating Propofol with Deep Reinforcement Learning

Reinforcement Learning (RL) can be used to fit a mapping from patient st...
research
09/26/2021

Finite State Machine Policies Modulating Trajectory Generator

Deep reinforcement learning (deep RL) has emerged as an effective tool f...

Please sign up or login with your details

Forgot password? Click here to reset