Configuration Path Control

04/05/2022
by   Sergey Pankov, et al.
0

Reinforcement learning methods often produce brittle policies – policies that perform well during training, but generalize poorly beyond their direct training experience, thus becoming unstable under small disturbances. To address this issue, we propose a method for stabilizing a control policy in the space of configuration paths. It is applied post-training and relies purely on the data produced during training, as well as on an instantaneous control-matrix estimation. The approach is evaluated empirically on a planar bipedal walker subjected to a variety of perturbations. The control policies obtained via reinforcement learning are compared against their stabilized counterparts. Across different experiments, we find two- to four-fold increase in stability, when measured in terms of the perturbation amplitudes. We also provide a zero-dynamics interpretation of our approach.

READ FULL TEXT
research
03/29/2019

Mesh-based Tools to Analyze Deep Reinforcement Learning Policies for Underactuated Biped Locomotion

In this paper, we present a mesh-based approach to analyze stability and...
research
11/02/2021

Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics

Offline reinforcement learning leverages large datasets to train policie...
research
05/07/2020

Cascade Attribute Network: Decomposing Reinforcement Learning Control Policies using Hierarchical Neural Networks

Reinforcement learning methods have been developed to achieve great succ...
research
04/06/2020

Learning Stabilizing Control Policies for a Tensegrity Hopper with Augmented Random Search

In this paper, we consider tensegrity hopper - a novel tensegrity-based ...
research
10/24/2022

Understanding the Evolution of Linear Regions in Deep Reinforcement Learning

Policies produced by deep reinforcement learning are typically character...
research
08/08/2023

Online identification and control of PDEs via Reinforcement Learning methods

We focus on the control of unknown Partial Differential Equations (PDEs)...
research
08/01/2022

Model-based graph reinforcement learning for inductive traffic signal control

Most reinforcement learning methods for adaptive-traffic-signal-control ...

Please sign up or login with your details

Forgot password? Click here to reset