Reinforcement Meta-Learning for Interception of Maneuvering Exoatmospheric Targets with Parasitic Attitude Loop

04/18/2020
by   Brian Gaudet, et al.
0

We use Reinforcement Meta-Learning to optimize an adaptive integrated guidance, navigation, and control system suitable for exoatmospheric interception of a maneuvering target. The system maps observations consisting of strapdown seeker angles and rate gyro measurements directly to thruster on-off commands. Using a high fidelity six degree-of-freedom simulator, we demonstrate that the optimized policy can adapt to parasitic effects including seeker angle measurement lag, thruster control lag, the parasitic attitude loop resulting from scale factor errors and Gaussian noise on angle and rotational velocity measurements, and a time varying center of mass caused by fuel consumption and slosh. Importantly, the optimized policy gives good performance over a wide range of challenging target maneuvers. Unlike previous work that enhances range observability by inducing line of sight oscillations, our system is optimized to use only measurements available from the seeker and rate gyros. Through extensive Monte Carlo simulation of randomized exoatmospheric interception scenarios, we demonstrate that the optimized policy gives performance close to that of augmented proportional navigation with perfect knowledge of the full engagement state. The optimized system is computationally efficient and requires minimal memory, and should be compatible with today's flight processors.

READ FULL TEXT

page 1

page 5

page 17

page 27

research
09/08/2021

Integrated and Adaptive Guidance and Control for Endoatmospheric Missiles via Reinforcement Learning

We apply the meta reinforcement learning framework to optimize an integr...
research
04/29/2022

Line of Sight Curvature for Missile Guidance using Reinforcement Meta-Learning

We use reinforcement meta learning to optimize a line of sight curvature...
research
07/13/2019

Seeker based Adaptive Guidance via Reinforcement Meta-Learning Applied to Asteroid Close Proximity Operations

Current practice for asteroid close proximity maneuvers requires extreme...
research
10/01/2021

Terminal Adaptive Guidance for Autonomous Hypersonic Strike Weapons via Reinforcement Learning

An adaptive guidance system suitable for the terminal phase trajectory o...
research
11/16/2019

Six Degree-of-Freedom Hovering using LIDAR Altimetry via Reinforcement Meta-Learning

We optimize a six degrees of freedom hovering policy using reinforcement...
research
07/30/2021

Adaptive Approach Phase Guidance for a Hypersonic Glider via Reinforcement Meta Learning

We use Reinforcement Meta Learning to optimize an adaptive guidance syst...
research
12/16/2021

Integrated Guidance and Control for Lunar Landing using a Stabilized Seeker

We develop an integrated guidance and control system that in conjunction...

Please sign up or login with your details

Forgot password? Click here to reset