Re-purposing Compact Neuronal Circuit Policies to Govern Reinforcement Learning Tasks

09/11/2018
by   Ramin M. Hasani, et al.
2

We propose an effective method for creating interpretable control agents, by re-purposing the function of a biological neural circuit model, to govern simulated and real world reinforcement learning (RL) test-beds. Inspired by the structure of the nervous system of the soil-worm, C. elegans, we introduce Neuronal Circuit Policies (NCPs) as a novel recurrent neural network instance with liquid time-constants, universal approximation capabilities and interpretable dynamics. We theoretically show that they can approximate any finite simulation time of a given continuous n-dimensional dynamical system, with n output units and some hidden units. We model instances of the policies and learn their synaptic and neuronal parameters to control standard RL tasks and demonstrate its application for autonomous parking of a real rover robot on a pre-defined trajectory. For reconfiguration of the purpose of the neural circuit, we adopt a search-based RL algorithm. We show that our neuronal circuit policies perform as good as deep neural network policies with the advantage of realizing interpretable dynamics at the cell-level. We theoretically find bounds for the time-varying dynamics of the circuits, and introduce a novel way to reason about networks' dynamics.

READ FULL TEXT
research
03/22/2018

Neuronal Circuit Policies

We propose an effective way to create interpretable control agents, by r...
research
11/01/2018

Liquid Time-constant Recurrent Neural Networks as Universal Approximators

In this paper, we introduce the notion of liquid time-constant (LTC) rec...
research
11/09/2017

Worm-level Control through Search-based Reinforcement Learning

Through natural evolution, nervous systems of organisms formed near-opti...
research
03/30/2021

Learning Deep Neural Policies with Stability Guarantees

Reinforcement learning (RL) has been successfully used to solve various ...
research
02/04/2022

Learning Interpretable, High-Performing Policies for Continuous Control Problems

Gradient-based approaches in reinforcement learning (RL) have achieved t...
research
07/31/2023

Discovering Adaptable Symbolic Algorithms from Scratch

Autonomous robots deployed in the real world will need control policies ...
research
04/30/2020

GCN-RL Circuit Designer: Transferable Transistor Sizing with Graph Neural Networks and Reinforcement Learning

Automatic transistor sizing is a challenging problem in circuit design d...

Please sign up or login with your details

Forgot password? Click here to reset