H_inf Model-free Reinforcement Learning with Robust Stability Guarantee

11/07/2019
by   Minghao Han, et al.
0

Reinforcement learning is showing great potentials in robotics applications, including autonomous driving, robot manipulation and locomotion. However, with complex uncertainties in the real-world environment, it is difficult to guarantee the successful generalization and sim-to-real transfer of learned policies theoretically. In this paper, we introduce and extend the idea of robust stability and H_∞ control to design policies with both stability and robustness guarantee. Specifically, a sample-based approach for analyzing the Lyapunov stability and performance robustness of a learning-based control system is proposed. Based on the theoretical results, a maximum entropy algorithm is developed for searching Lyapunov function and designing a policy with provable robust stability guarantee. Without any specific domain knowledge, our method can find a policy that is robust to various uncertainties and generalizes well to different test environments. In our experiments, we show that our method achieves better robustness to both large impulsive disturbances and parametric variations in the environment than the state-of-art results in both robust and generic RL, as well as classic control. Anonymous code is available to reproduce the experimental results at https://github.com/RobustStabilityGuaranteeRL/RobustStabilityGuaranteeRL.

READ FULL TEXT
research
11/07/2019

H_∞ Model-free Reinforcement Learning with Robust Stability Guarantee

Reinforcement learning is showing great potentials in robotics applicati...
research
07/07/2022

Retro-RL: Reinforcing Nominal Controller With Deep Reinforcement Learning for Tilting-Rotor Drones

Studies that broaden drone applications into complex tasks require a sta...
research
07/11/2021

Stabilizing Neural Control Using Self-Learned Almost Lyapunov Critics

The lack of stability guarantee restricts the practical use of learning-...
research
12/05/2020

Mixed robustness: Analysis of systems with uncertain deterministic and random parameters using the example of linear systems

Robustness of linear systems with constant coefficients is considered. T...
research
09/15/2022

Causal Coupled Mechanisms: A Control Method with Cooperation and Competition for Complex System

Complex systems are ubiquitous in the real world and tend to have compli...
research
07/14/2021

Model-free Reinforcement Learning for Robust Locomotion Using Trajectory Optimization for Exploration

In this work we present a general, two-stage reinforcement learning appr...
research
05/31/2022

Human-AI Shared Control via Frequency-based Policy Dissection

Human-AI shared control allows human to interact and collaborate with AI...

Please sign up or login with your details

Forgot password? Click here to reset