Six Degree-of-Freedom Hovering using LIDAR Altimetry via Reinforcement Meta-Learning

11/16/2019
by   Brian Gaudet, et al.
0

We optimize a six degrees of freedom hovering policy using reinforcement meta-learning. The policy maps flash LIDAR measurements directly to on/off spacecraft body-frame thrust commands, allowing hovering at a fixed position and attitude in the asteroid body-fixed reference frame. Importantly, the policy does not require position and velocity estimates, and can operate in environments with unknown dynamics, and without an asteroid shape model or navigation aids. Indeed, during optimization the agent is confronted with a new randomly generated asteroid for each episode, insuring that it does not learn an asteroid's shape, texture, or environmental dynamics. This allows the deployed policy to generalize well to novel asteroid characteristics, which we demonstrate in our experiments. The hovering controller has the potential to simplify mission planning by allowing asteroid body-fixed hovering immediately upon the spacecraft's arrival to an asteroid. This in turn simplifies shape model generation and allows resource mapping via remote sensing immediately upon arrival at the target asteroid.

READ FULL TEXT

page 3

page 4

page 8

research
07/13/2019

Seeker based Adaptive Guidance via Reinforcement Meta-Learning Applied to Asteroid Close Proximity Operations

Current practice for asteroid close proximity maneuvers requires extreme...
research
01/12/2019

Adaptive Guidance with Reinforcement Meta-Learning

This paper proposes a novel adaptive guidance system developed using rei...
research
04/18/2020

Reinforcement Meta-Learning for Interception of Maneuvering Exoatmospheric Targets with Parasitic Attitude Loop

We use Reinforcement Meta-Learning to optimize an adaptive integrated gu...
research
04/29/2022

Line of Sight Curvature for Missile Guidance using Reinforcement Meta-Learning

We use reinforcement meta learning to optimize a line of sight curvature...
research
10/09/2020

Characterizing Policy Divergence for Personalized Meta-Reinforcement Learning

Despite ample motivation from costly exploration and limited trajectory ...
research
07/16/2020

Collision Avoidance Robotics Via Meta-Learning (CARML)

This paper presents an approach to exploring a multi-objective reinforce...

Please sign up or login with your details

Forgot password? Click here to reset