Complementary Meta-Reinforcement Learning for Fault-Adaptive Control

09/26/2020
by   Ibrahim Ahmed, et al.
3

Faults are endemic to all systems. Adaptive fault-tolerant control maintains degraded performance when faults occur as opposed to unsafe conditions or catastrophic events. In systems with abrupt faults and strict time constraints, it is imperative for control to adapt quickly to system changes to maintain system operations. We present a meta-reinforcement learning approach that quickly adapts its control policy to changing conditions. The approach builds upon model-agnostic meta learning (MAML). The controller maintains a complement of prior policies learned under system faults. This "library" is evaluated on a system after a new fault to initialize the new policy. This contrasts with MAML, where the controller derives intermediate policies anew, sampled from a distribution of similar systems, to initialize a new policy. Our approach improves sample efficiency of the reinforcement learning process. We evaluate our approach on an aircraft fuel transfer system under abrupt faults.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/10/2020

Fault-Tolerant Control of Degrading Systems with On-Policy Reinforcement Learning

We propose a novel adaptive reinforcement learning control approach for ...
research
08/10/2020

Comparison of Model Predictive and Reinforcement Learning Methods for Fault Tolerant Control

A desirable property in fault-tolerant controllers is adaptability to sy...
research
12/10/2020

Performance-Weighed Policy Sampling for Meta-Reinforcement Learning

This paper discusses an Enhanced Model-Agnostic Meta-Learning (E-MAML) a...
research
01/19/2021

Meta-Reinforcement Learning for Adaptive Motor Control in Changing Robot Dynamics and Environments

This work developed a meta-learning approach that adapts the control pol...
research
01/10/2023

Imbalanced Classification In Faulty Turbine Data: New Proximal Policy Optimization

There is growing importance to detecting faults and implementing the bes...
research
10/12/2022

DQLAP: Deep Q-Learning Recommender Algorithm with Update Policy for a Real Steam Turbine System

In modern industrial systems, diagnosing faults in time and using the be...
research
02/11/2013

RIO: Minimizing User Interaction in Debugging of Knowledge Bases

The best currently known interactive debugging systems rely upon some me...

Please sign up or login with your details

Forgot password? Click here to reset