DOB-Net: Actively Rejecting Unknown Excessive Time-Varying Disturbances

07/10/2019
by   Tianming Wang, et al.
0

This paper presents an observer-integrated Reinforcement Learning (RL) approach, called Disturbance OBserver Network (DOB-Net), for robots operating in environments where disturbances are unknown and time-varying, and may frequently exceed robot control capabilities. The DOB-Net integrates a disturbance dynamics observer network and a controller network. Originated from classical DOB mechanisms, the observer is built and enhanced via Recurrent Neural Networks (RNNs), encoding estimation of past values and prediction of future values of unknown disturbances in RNN hidden state. Such encoding allows the controller generate optimal control signals to actively reject disturbances, under the constraints of robot control capabilities. The observer and the controller are jointly learned within policy optimization by advantage actor critic. Numerical simulations on position regulation tasks have demonstrated that the proposed DOB-Net significantly outperforms a canonical feedback controller and classical RL algorithms.

READ FULL TEXT

page 1

page 4

research
12/18/2021

Model-Based Safe Reinforcement Learning with Time-Varying State and Control Constraints: An Application to Intelligent Vehicles

Recently, barrier function-based safe reinforcement learning (RL) with t...
research
07/23/2022

Epersist: A Self Balancing Robot Using PID Controller And Deep Reinforcement Learning

A two-wheeled self-balancing robot is an example of an inverse pendulum ...
research
10/23/2022

Active Learning of Discrete-Time Dynamics for Uncertainty-Aware Model Predictive Control

Model-based control requires an accurate model of the system dynamics fo...
research
08/19/2020

Kinematic Resolutions of Redundant Robot Manipulators using Integration-Enhanced RNNs

Recently, a time-varying quadratic programming (QP) framework that descr...
research
01/29/2019

Emergence of Hierarchy via Reinforcement Learning Using a Multiple Timescale Stochastic RNN

Although recurrent neural networks (RNNs) for reinforcement learning (RL...
research
01/11/2022

Learning Robust Policies for Generalized Debris Capture with an Automated Tether-Net System

Tether-net launched from a chaser spacecraft provides a promising method...
research
11/01/2019

A2: Extracting Cyclic Switchings from DOB-nets for Rejecting Excessive Disturbances

Reinforcement Learning (RL) is limited in practice by its gray-box natur...

Please sign up or login with your details

Forgot password? Click here to reset