Sample Efficient Learning of Path Following and Obstacle Avoidance Behavior for Quadrotors

06/28/2019
by   Stefan Stevsic, et al.
0

In this paper we propose an algorithm for the training of neural network control policies for quadrotors. The learned control policy computes control commands directly from sensor inputs and is hence computationally efficient. An imitation learning algorithm produces a policy that reproduces the behavior of a path following control algorithm with collision avoidance. Due to the generalization ability of neural networks, the resulting policy performs local collision avoidance of unseen obstacles while following a global reference path. The algorithm uses a time-free model predictive path-following controller as a supervisor. The controller generates demonstrations by following few example paths. This enables an easy to implement learning algorithm that is robust to errors of the model used in the model predictive controller. The policy is trained on the real quadrotor, which requires collision-free exploration around the example path. An adapted version of the supervisor is used to enable exploration. Thus, the policy can be trained from a relatively small number of examples on the real quadrotor, making the training sample efficient.

READ FULL TEXT
research
08/17/2020

Model-Reference Reinforcement Learning for Collision-Free Tracking Control of Autonomous Surface Vehicles

This paper presents a novel model-reference reinforcement learning algor...
research
06/21/2023

Robotic Navigation with Convergence Guarantees in Complex Dynamic Environments

This article addresses the obstacle avoidance problem for setpoint stabi...
research
03/10/2021

On the Dual Implementation of Collision-Avoidance Constraints in Path-Following MPC for Underactuated Surface Vessels

A path-following collision-avoidance model predictive control (MPC) meth...
research
09/16/2021

Fast-Replanning Motion Control with Short-Term Aborting A*

Autonomously driving vehicles must be able to navigate in dynamic and un...
research
02/05/2020

Fast and Safe Path-Following Control using a State-Dependent Directional Metric

This paper considers the problem of fast and safe autonomous navigation ...
research
03/21/2023

Provably Correct Sensor-driven Path-following for Unicycles using Monotonic Score Functions

This paper develops a provably stable sensor-driven controller for path-...

Please sign up or login with your details

Forgot password? Click here to reset