Wasserstein Gradient Flows for Optimizing Gaussian Mixture Policies

05/17/2023
by   Hanna Ziesche, et al.
10

Robots often rely on a repertoire of previously-learned motion policies for performing tasks of diverse complexities. When facing unseen task conditions or when new task requirements arise, robots must adapt their motion policies accordingly. In this context, policy optimization is the de facto paradigm to adapt robot policies as a function of task-specific objectives. Most commonly-used motion policies carry particular structures that are often overlooked in policy optimization algorithms. We instead propose to leverage the structure of probabilistic policies by casting the policy optimization as an optimal transport problem. Specifically, we focus on robot motion policies that build on Gaussian mixture models (GMMs) and formulate the policy optimization as a Wassertein gradient flow over the GMMs space. This naturally allows us to constrain the policy updates via the L^2-Wasserstein distance between GMMs to enhance the stability of the policy optimization process. Furthermore, we leverage the geometry of the Bures-Wasserstein manifold to optimize the Gaussian distributions of the GMM policy via Riemannian optimization. We evaluate our approach on common robotic settings: Reaching motions, collision-avoidance behaviors, and multi-goal tasks. Our results show that our method outperforms common policy optimization baselines in terms of task success rate and low-variance solutions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/24/2020

Towards Coordinated Robot Motions: End-to-End Learning of Motion Policies on Transform Trees

Robotic tasks often require generation of motions that satisfy multiple ...
research
02/14/2019

Multi-Objective Policy Generation for Multi-Robot Systems Using Riemannian Motion Policies

In the multi-robot systems literature, control policies are typically ob...
research
12/04/2022

Hierarchical Policy Blending As Optimal Transport

We present hierarchical policy blending as optimal transport (HiPBOT). T...
research
09/05/2023

Task Generalization with Stability Guarantees via Elastic Dynamical System Motion Policies

Dynamical System (DS) based Learning from Demonstration (LfD) allows lea...
research
07/25/2020

RMPflow: A Geometric Framework for Generation of Multi-Task Motion Policies

Generating robot motion for multiple tasks in dynamic environments is ch...
research
05/19/2020

Riemannian Proximal Policy Optimization

In this paper, We propose a general Riemannian proximal optimization alg...
research
10/07/2019

Riemannian Motion Policy Fusion through Learnable Lyapunov Function Reshaping

RMPflow is a recently proposed policy-fusion framework based on differen...

Please sign up or login with your details

Forgot password? Click here to reset