Synthesizing Neural Network Controllers with Probabilistic Model based Reinforcement Learning

We present an algorithm for rapidly learning controllers for robotics systems. The algorithm follows the model-based reinforcement learning paradigm, and improves upon existing algorithms; namely Probabilistic learning in Control (PILCO) and a sample-based version of PILCO with neural network dynamics (Deep-PILCO). We propose training a neural network dynamics model using variational dropout with truncated Log-Normal noise. This allows us to obtain a dynamics model with calibrated uncertainty, which can be used to simulate controller executions via rollouts. We also describe set of techniques, inspired by viewing PILCO as a recurrent neural network model, that are crucial to improve the convergence of the method. We test our method on a variety of benchmark tasks, demonstrating data-efficiency that is competitive with PILCO, while being able to optimize complex neural network controllers. Finally, we assess the performance of the algorithm for learning motor controllers for a six legged autonomous underwater vehicle. This demonstrates the potential of the algorithm for scaling up the dimensionality and dataset sizes, in more complex control tasks.

READ FULL TEXT

page 1

page 5

research
10/26/2021

Learning Robust Controllers Via Probabilistic Model-Based Policy Search

Model-based Reinforcement Learning estimates the true environment throug...
research
03/10/2020

Undefined-behavior guarantee by switching to model-based controller according to the embedded dynamics in Recurrent Neural Network

For robotic applications, its task performance and operation must be gua...
research
01/11/2019

Low Level Control of a Quadrotor with Deep Model-Based Reinforcement learning

Generating low-level robot controllers often requires manual parameters ...
research
06/27/2023

A Population-Level Analysis of Neural Dynamics in Robust Legged Robots

Recurrent neural network-based reinforcement learning systems are capabl...
research
07/27/2021

Reinforcement Learning with Formal Performance Metrics for Quadcopter Attitude Control under Non-nominal Contexts

We explore the reinforcement learning approach to designing controllers ...
research
07/28/2023

Worrisome Properties of Neural Network Controllers and Their Symbolic Representations

We raise concerns about controllers' robustness in simple reinforcement ...
research
02/03/2020

Proportional integral derivative controller assisted reinforcement learning for path following by autonomous underwater vehicles

Control theory provides engineers with a multitude of tools to design co...

Please sign up or login with your details

Forgot password? Click here to reset