Causal Policy Gradient for Whole-Body Mobile Manipulation

05/04/2023
by   Jiaheng Hu, et al.
0

Developing the next generation of household robot helpers requires combining locomotion and interaction capabilities, which is generally referred to as mobile manipulation (MoMa). MoMa tasks are difficult due to the large action space of the robot and the common multi-objective nature of the task, e.g., efficiently reaching a goal while avoiding obstacles. Current approaches often segregate tasks into navigation without manipulation and stationary manipulation without locomotion by manually matching parts of the action space to MoMa sub-objectives (e.g. base actions for locomotion objectives and arm actions for manipulation). This solution prevents simultaneous combinations of locomotion and interaction degrees of freedom and requires human domain knowledge for both partitioning the action space and matching the action parts to the sub-objectives. In this paper, we introduce Causal MoMa, a new framework to train policies for typical MoMa tasks that makes use of the most favorable subspace of the robot's action space to address each sub-objective. Causal MoMa automatically discovers the causal dependencies between actions and terms of the reward function and exploits these dependencies in a causal policy learning procedure that reduces gradient variance compared to previous state-of-the-art policy gradient algorithms, improving convergence and results. We evaluate the performance of Causal MoMa on three types of simulated robots across different MoMa tasks and demonstrate success in transferring the policies trained in simulation directly to a real robot, where our agent is able to follow moving goals and react to dynamic obstacles while simultaneously and synergistically controlling the whole-body: base, arm, and head. More information at https://sites.google.com/view/causal-moma.

READ FULL TEXT

page 1

page 3

page 6

page 7

page 9

page 16

research
05/13/2019

Learning Novel Policies For Tasks

In this work, we present a reinforcement learning algorithm that can fin...
research
10/18/2022

Deep Whole-Body Control: Learning a Unified Policy for Manipulation and Locomotion

An attached arm can significantly increase the applicability of legged r...
research
09/18/2018

Supervised Autonomous Locomotion and Manipulation for Disaster Response with a Centaur-like Robot

Mobile manipulation tasks are one of the key challenges in the field of ...
research
11/28/2022

CLAS: Coordinating Multi-Robot Manipulation with Central Latent Action Spaces

Multi-robot manipulation tasks involve various control entities that can...
research
02/20/2021

Causal Policy Gradients

Policy gradient methods can solve complex tasks but often fail when the ...
research
06/22/2020

dm_control: Software and Tasks for Continuous Control

The dm_control software package is a collection of Python libraries and ...
research
12/15/2021

Omni-Roach: A legged robot capable of traversing multiple types of large obstacles and self-righting

Robots excel at avoiding obstacles but still struggle to traverse comple...

Please sign up or login with your details

Forgot password? Click here to reset