Learning to Sequence and Blend Robot Skills via Differentiable Optimization

06/01/2022
by   Noémie Jaquier, et al.
0

In contrast to humans and animals who naturally execute seamless motions, learning and smoothly executing sequences of actions remains a challenge in robotics. This paper introduces a novel skill-agnostic framework that learns to sequence and blend skills based on differentiable optimization. Our approach encodes sequences of previously-defined skills as quadratic programs (QP), whose parameters determine the relative importance of skills along the task. Seamless skill sequences are then learned from demonstrations by exploiting differentiable optimization layers and a tailored loss formulated from the QP optimality conditions. Via the use of differentiable optimization, our work offers novel perspectives on multitask control. We validate our approach in a pick-and-place scenario with planar robots, a pouring experiment with a real humanoid robot, and a bimanual sweeping task with a human model.

READ FULL TEXT
research
03/30/2021

Inferring the Geometric Nullspace of Robot Skills from Human Demonstrations

In this paper we present a framework to learn skills from human demonstr...
research
03/26/2021

Robot Program Parameter Inference via Differentiable Shadow Program Inversion

Challenging manipulation tasks can be solved effectively by combining in...
research
03/26/2021

SKID RAW: Skill Discovery from Raw Trajectories

Integrating robots in complex everyday environments requires a multitude...
research
09/09/2021

Learning Forceful Manipulation Skills from Multi-modal Human Demonstrations

Learning from Demonstration (LfD) provides an intuitive and fast approac...
research
11/30/2017

Learning to Compose Skills

We present a differentiable framework capable of learning a wide variety...
research
03/01/2022

Capability-based Frameworks for Industrial Robot Skills: a Survey

The research community is puzzled with words like skill, action, atomic ...
research
10/28/2021

Similarity-Aware Skill Reproduction based on Multi-Representational Learning from Demonstration

Learning from Demonstration (LfD) algorithms enable humans to teach new ...

Please sign up or login with your details

Forgot password? Click here to reset