Line of Sight Curvature for Missile Guidance using Reinforcement Meta-Learning

04/29/2022
by   Brian Gaudet, et al.
0

We use reinforcement meta learning to optimize a line of sight curvature policy that increases the effectiveness of a guidance system against maneuvering targets. The policy is implemented as a recurrent neural network that maps navigation system outputs to a Euler 321 attitude representation. The attitude representation is then used to construct a direction cosine matrix that biases the observed line of sight vector. The line of sight rotation rate derived from the biased line of sight is then mapped to a commanded acceleration by the guidance system. By varying the bias as a function of navigation system outputs, the policy enhances accuracy against highly maneuvering targets. Importantly, our method does not require an estimate of target acceleration. In our experiments, we demonstrate that when our method is combined with proportional navigation, the system significantly outperforms augmented proportional navigation with perfect knowledge of target acceleration, achieving improved accuracy with less control effort against a wide range of target maneuvers.

READ FULL TEXT
research
04/18/2020

Reinforcement Meta-Learning for Interception of Maneuvering Exoatmospheric Targets with Parasitic Attitude Loop

We use Reinforcement Meta-Learning to optimize an adaptive integrated gu...
research
07/30/2021

Adaptive Approach Phase Guidance for a Hypersonic Glider via Reinforcement Meta Learning

We use Reinforcement Meta Learning to optimize an adaptive guidance syst...
research
01/12/2019

Adaptive Guidance with Reinforcement Meta-Learning

This paper proposes a novel adaptive guidance system developed using rei...
research
09/08/2021

Integrated and Adaptive Guidance and Control for Endoatmospheric Missiles via Reinforcement Learning

We apply the meta reinforcement learning framework to optimize an integr...
research
11/16/2019

Six Degree-of-Freedom Hovering using LIDAR Altimetry via Reinforcement Meta-Learning

We optimize a six degrees of freedom hovering policy using reinforcement...
research
10/01/2021

Terminal Adaptive Guidance for Autonomous Hypersonic Strike Weapons via Reinforcement Learning

An adaptive guidance system suitable for the terminal phase trajectory o...
research
04/25/2019

Faster and More Accurate Learning with Meta Trace Adaptation

Learning speed and accuracy are of universal interest for reinforcement ...

Please sign up or login with your details

Forgot password? Click here to reset