Weighted Maximum Likelihood for Controller Tuning

10/20/2022
by   Angel Romero, et al.
0

Recently, Model Predictive Contouring Control (MPCC) has arisen as the state-of-the-art approach for model-based agile flight. MPCC benefits from great flexibility in trading-off between progress maximization and path following at runtime without relying on globally optimized trajectories. However, finding the optimal set of tuning parameters for MPCC is challenging because (i) the full quadrotor dynamics are non-linear, (ii) the cost function is highly non-convex, and (iii) of the high dimensionality of the hyperparameter space. This paper leverages a probabilistic Policy Search method - Weighted Maximum Likelihood (WML)- to automatically learn the optimal objective for MPCC. WML is sample-efficient due to its closed-form solution for updating the learning parameters. Additionally, the data efficiency provided by the use of a model-based approach allows us to directly train in a high-fidelity simulator, which in turn makes our approach able to transfer zero-shot to the real world. We validate our approach in the real world, where we show that our method outperforms both the previous manually tuned controller and the state-of-the-art auto-tuning baseline reaching speeds of 75 km/h.

READ FULL TEXT

page 1

page 3

page 4

page 6

research
09/09/2021

Performance, Precision, and Payloads: Adaptive Nonlinear MPC for Quadrotors

Agile quadrotor flight in challenging environments has the potential to ...
research
08/30/2021

Model Predictive Contouring Control for Near-Time-Optimal Quadrotor Flight

We tackle the problem of flying time-optimal trajectories through multip...
research
04/15/2021

Auto-Tuned Sim-to-Real Transfer

Policies trained in simulation often fail when transferred to the real w...
research
08/15/2020

Crossing The Gap: A Deep Dive into Zero-Shot Sim-to-Real Transfer for Dynamics

Zero-shot sim-to-real transfer of tasks with complex dynamics is a highl...
research
03/06/2023

Learning Humanoid Locomotion with Transformers

We present a sim-to-real learning-based approach for real-world humanoid...
research
12/06/2022

DiffTune^+: Hyperparameter-Free Auto-Tuning using Auto-Differentiation

Controller tuning is a vital step to ensure the controller delivers its ...
research
03/07/2022

Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer

Hyperparameter (HP) tuning in deep learning is an expensive process, pro...

Please sign up or login with your details

Forgot password? Click here to reset