Composable Energy Policies for Reactive Motion Generation and Reinforcement Learning

05/11/2021
by   Julen Urain, et al.
0

Reactive motion generation problems are usually solved by computing actions as a sum of policies. However, these policies are independent of each other and thus, they can have conflicting behaviors when summing their contributions together. We introduce Composable Energy Policies (CEP), a novel framework for modular reactive motion generation. CEP computes the control action by optimization over the product of a set of stochastic policies. This product of policies will provide a high probability to those actions that satisfy all the components and low probability to the others. Optimizing over the product of the policies avoids the detrimental effect of conflicting behaviors between policies choosing an action that satisfies all the objectives. Besides, we show that CEP naturally adapts to the Reinforcement Learning problem allowing us to integrate, in a hierarchical fashion, any distribution as prior, from multimodal distributions to non-smooth distributions and learn a new policy given them.

READ FULL TEXT

page 1

page 6

research
10/14/2022

Hierarchical Policy Blending as Inference for Reactive Robot Control

Motion generation in cluttered, dense, and dynamic environments is a cen...
research
11/16/2018

RMPflow: A Computational Graph for Automatic Motion Policy Generation

We develop a novel policy synthesis algorithm, RMPflow, based on geometr...
research
03/31/2016

Reactive Policies with Planning for Action Languages

We describe a representation in a high-level transition system for polic...
research
07/25/2020

RMPflow: A Geometric Framework for Generation of Multi-Task Motion Policies

Generating robot motion for multiple tasks in dynamic environments is ch...
research
09/10/2020

A framework for reinforcement learning with autocorrelated actions

The subject of this paper is reinforcement learning. Policies are consid...
research
02/22/2022

Reward-Free Policy Space Compression for Reinforcement Learning

In reinforcement learning, we encode the potential behaviors of an agent...
research
09/20/2022

Towards Task-Prioritized Policy Composition

Combining learned policies in a prioritized, ordered manner is desirable...

Please sign up or login with your details

Forgot password? Click here to reset