Adversarial Skill Chaining for Long-Horizon Robot Manipulation via Terminal State Regularization

11/15/2021
by   Youngwoon Lee, et al.
9

Skill chaining is a promising approach for synthesizing complex behaviors by sequentially combining previously learned skills. Yet, a naive composition of skills fails when a policy encounters a starting state never seen during its training. For successful skill chaining, prior approaches attempt to widen the policy's starting state distribution. However, these approaches require larger state distributions to be covered as more policies are sequenced, and thus are limited to short skill sequences. In this paper, we propose to chain multiple policies without excessively large initial state distributions by regularizing the terminal state distributions in an adversarial learning framework. We evaluate our approach on two complex long-horizon manipulation tasks of furniture assembly. Our results have shown that our method establishes the first model-free reinforcement learning algorithm to solve these tasks; whereas prior skill chaining approaches fail. The code and videos are available at https://clvrai.com/skill-chaining

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/22/2020

Accelerating Reinforcement Learning with Learned Skill Priors

Intelligent agents rely heavily on prior experience when learning a new ...
research
09/06/2022

Multi-skill Mobile Manipulation for Object Rearrangement

We study a modular approach to tackle long-horizon mobile manipulation t...
research
07/31/2023

Value-Informed Skill Chaining for Policy Learning of Long-Horizon Tasks with Surgical Robot

Reinforcement learning is still struggling with solving long-horizon sur...
research
11/04/2021

Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning

Reinforcement learning can train policies that effectively perform compl...
research
04/01/2023

Adaptive Skill Coordination for Robotic Mobile Manipulation

We present Adaptive Skill Coordination (ASC) - an approach for accomplis...
research
05/12/2021

Learning a Skill-sequence-dependent Policy for Long-horizon Manipulation Tasks

In recent years, the robotics community has made substantial progress in...
research
11/30/2017

Learning to Compose Skills

We present a differentiable framework capable of learning a wide variety...

Please sign up or login with your details

Forgot password? Click here to reset