Adaptive Guidance with Reinforcement Meta-Learning

01/12/2019
by   Brian Gaudet, et al.
0

This paper proposes a novel adaptive guidance system developed using reinforcement meta-learning with a recurrent policy and value function approximator. The use of recurrent network layers allows the deployed policy to adapt real time to environmental forces acting on the agent. We compare the performance of the DR/DV guidance law, an RL agent with a non-recurrent policy, and an RL agent with a recurrent policy in four difficult tasks with unknown but highly variable dynamics. These tasks include a safe Mars landing with random engine failure and a landing on an asteroid with unknown environmental dynamics. We also demonstrate the ability of a recurrent policy to navigate using only Doppler radar altimeter returns, thus integrating guidance and navigation.

READ FULL TEXT

page 8

page 15

research
07/13/2019

Seeker based Adaptive Guidance via Reinforcement Meta-Learning Applied to Asteroid Close Proximity Operations

Current practice for asteroid close proximity maneuvers requires extreme...
research
04/15/2019

Learning to Guide: Guidance Law Based on Deep Meta-learning and Model Predictive Path Integral Control

In this paper, we present a novel guidance scheme based on model-based d...
research
11/16/2019

Six Degree-of-Freedom Hovering using LIDAR Altimetry via Reinforcement Meta-Learning

We optimize a six degrees of freedom hovering policy using reinforcement...
research
04/29/2022

Line of Sight Curvature for Missile Guidance using Reinforcement Meta-Learning

We use reinforcement meta learning to optimize a line of sight curvature...
research
03/17/2022

Meta Reinforcement Learning for Adaptive Control: An Offline Approach

Meta-learning is a branch of machine learning which trains neural networ...
research
11/23/2022

Stackelberg Meta-Learning for Strategic Guidance in Multi-Robot Trajectory Planning

Guided cooperation is a common task in many multi-agent teaming applicat...
research
10/01/2021

Terminal Adaptive Guidance for Autonomous Hypersonic Strike Weapons via Reinforcement Learning

An adaptive guidance system suitable for the terminal phase trajectory o...

Please sign up or login with your details

Forgot password? Click here to reset