Lyapunov Design for Robust and Efficient Robotic Reinforcement Learning

08/13/2022
by   Tyler Westenbroek, et al.
0

Recent advances in the reinforcement learning (RL) literature have enabled roboticists to automatically train complex policies in simulated environments. However, due to the poor sample complexity of these methods, solving reinforcement learning problems using real-world data remains a challenging problem. This paper introduces a novel cost-shaping method which aims to reduce the number of samples needed to learn a stabilizing controller. The method adds a term involving a control Lyapunov function (CLF) – an `energy-like' function from the model-based control literature – to typical cost formulations. Theoretical results demonstrate the new costs lead to stabilizing controllers when smaller discount factors are used, which is well-known to reduce sample complexity. Moreover, the addition of the CLF term `robustifies' the search for a stabilizing controller by ensuring that even highly sub-optimal polices will stabilize the system. We demonstrate our approach with two hardware examples where we learn stabilizing controllers for a cartpole and an A1 quadruped with only seconds and a few minutes of fine-tuning data, respectively.

READ FULL TEXT
research
12/31/2021

Importance of Empirical Sample Complexity Analysis for Offline Reinforcement Learning

We hypothesize that empirically studying the sample complexity of offlin...
research
08/16/2022

A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free Reinforcement Learning

Deep reinforcement learning is a promising approach to learning policies...
research
04/19/2023

Sample-efficient Model-based Reinforcement Learning for Quantum Control

We propose a model-based reinforcement learning (RL) approach for noisy ...
research
05/21/2018

Data-Efficient Hierarchical Reinforcement Learning

Hierarchical reinforcement learning (HRL) is a promising approach to ext...
research
11/07/2020

Leveraging Forward Model Prediction Error for Learning Control

Learning for model based control can be sample-efficient and generalize ...
research
03/06/2023

Value Guided Exploration with Sub-optimal Controllers for Learning Dexterous Manipulation

Recently, reinforcement learning has allowed dexterous manipulation skil...
research
01/11/2019

Low Level Control of a Quadrotor with Deep Model-Based Reinforcement learning

Generating low-level robot controllers often requires manual parameters ...

Please sign up or login with your details

Forgot password? Click here to reset