Log In Sign Up

Online Model-Free Reinforcement Learning for the Automatic Control of a Flexible Wing Aircraft

by   Mohammed Abouheaf, et al.

The control problem of the flexible wing aircraft is challenging due to the prevailing and high nonlinear deformations in the flexible wing system. This urged for new control mechanisms that are robust to the real-time variations in the wing's aerodynamics. An online control mechanism based on a value iteration reinforcement learning process is developed for flexible wing aerial structures. It employs a model-free control policy framework and a guaranteed convergent adaptive learning architecture to solve the system's Bellman optimality equation. A Riccati equation is derived and shown to be equivalent to solving the underlying Bellman equation. The online reinforcement learning solution is implemented using means of an adaptive-critic mechanism. The controller is proven to be asymptotically stable in the Lyapunov sense. It is assessed through computer simulations and its superior performance is demonstrated on two scenarios under different operating conditions.


Model-Free Robust Reinforcement Learning with Linear Function Approximation

This paper addresses the problem of model-free reinforcement learning fo...

Learning-based vs Model-free Adaptive Control of a MAV under Wind Gust

Navigation problems under unknown varying conditions are among the most ...

Responding to Illegal Activities Along the Canadian Coastlines Using Reinforcement Learning

This article elaborates on how machine learning (ML) can leverage the so...

Nonuniqueness and Convergence to Equivalent Solutions in Observer-based Inverse Reinforcement Learning

A key challenge in solving the deterministic inverse reinforcement learn...

Self-optimizing adaptive optics control with Reinforcement Learning for high-contrast imaging

Current and future high-contrast imaging instruments require extreme ada...

Learning Event-triggered Control from Data through Joint Optimization

We present a framework for model-free learning of event-triggered contro...