Robustifying Reinforcement Learning Policies with ℒ_1 Adaptive Control

06/04/2021
by   Yikun Cheng, et al.
1

A reinforcement learning (RL) policy trained in a nominal environment could fail in a new/perturbed environment due to the existence of dynamic variations. Existing robust methods try to obtain a fixed policy for all envisioned dynamic variation scenarios through robust or adversarial training. These methods could lead to conservative performance due to emphasis on the worst case, and often involve tedious modifications to the training environment. We propose an approach to robustifying a pre-trained non-robust RL policy with ℒ_1 adaptive control. Leveraging the capability of an ℒ_1 control law in the fast estimation of and active compensation for dynamic variations, our approach can significantly improve the robustness of an RL policy trained in a standard (i.e., non-robust) way, either in a simulator or in the real world. Numerical experiments are provided to validate the efficacy of the proposed approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/03/2021

Improving the Robustness of Reinforcement Learning Policies with ℒ_1 Adaptive Control

A reinforcement learning (RL) control policy trained in a nominal enviro...
research
01/27/2023

Single-Trajectory Distributionally Robust Reinforcement Learning

As a framework for sequential decision-making, Reinforcement Learning (R...
research
02/14/2022

Robust Policy Learning over Multiple Uncertainty Sets

Reinforcement learning (RL) agents need to be robust to variations in sa...
research
02/15/2022

User-Oriented Robust Reinforcement Learning

Recently, improving the robustness of policies across different environm...
research
06/18/2019

Robust Reinforcement Learning for Continuous Control with Model Misspecification

We provide a framework for incorporating robustness -- to perturbations ...
research
09/23/2022

Quantification before Selection: Active Dynamics Preference for Robust Reinforcement Learning

Training a robust policy is critical for policy deployment in real-world...
research
01/26/2023

Policy Optimization with Robustness Certificates

We present a policy optimization framework in which the learned policy c...

Please sign up or login with your details

Forgot password? Click here to reset