User-Conditioned Neural Control Policies for Mobile Robotics

11/22/2022
by   Leonard Bauersfeld, et al.
0

Recently, learning-based controllers have been shown to push mobile robotic systems to their limits and provide the robustness needed for many real-world applications. However, only classical optimization-based control frameworks offer the inherent flexibility to be dynamically adjusted during execution by, for example, setting target speeds or actuator limits. We present a framework to overcome this shortcoming of neural controllers by conditioning them on an auxiliary input. This advance is enabled by including a feature-wise linear modulation layer (FiLM). We use model-free reinforcement-learning to train quadrotor control policies for the task of navigating through a sequence of waypoints in minimum time. By conditioning the policy on the maximum available thrust or the viewing direction relative to the next waypoint, a user can regulate the aggressiveness of the quadrotor's flight during deployment. We demonstrate in simulation and in real-world experiments that a single control policy can achieve close to time-optimal flight performance across the entire performance envelope of the robot, reaching up to 60 km/h and 4.5g in acceleration. The ability to guide a learned controller during task execution has implications beyond agile quadrotor flight, as conditioning the control policy on human intent helps safely bringing learning based systems out of the well-defined laboratory environment into the wild.

READ FULL TEXT
research
02/22/2022

A Benchmark Comparison of Learned Control Policies for Agile Quadrotor Flight

Quadrotors are highly nonlinear dynamical systems that require carefully...
research
09/18/2023

Contrastive Learning for Enhancing Robust Scene Transfer in Vision-based Agile Flight

Scene transfer for vision-based mobile robotics applications is a highly...
research
09/14/2019

Flight Controller Synthesis Via Deep Reinforcement Learning

Traditional control methods are inadequate in many deployment settings i...
research
05/04/2023

Guidance Control Networks for Time-Optimal Quadcopter Flight

Reaching fast and autonomous flight requires computationally efficient a...
research
08/11/2020

Learning Event-triggered Control from Data through Joint Optimization

We present a framework for model-free learning of event-triggered contro...
research
02/22/2023

Learning Agile Flights through Narrow Gaps with Varying Angles using Onboard Sensing

This paper addresses the problem of traversing through unknown, tilted, ...
research
07/15/2020

Developmental Reinforcement Learning of Control Policy of a Quadcopter UAV with Thrust Vectoring Rotors

In this paper, we present a novel developmental reinforcement learning-b...

Please sign up or login with your details

Forgot password? Click here to reset