Dynamic Cloth Manipulation with Deep Reinforcement Learning

10/31/2019
by   Rishabh Jangir, et al.
0

In this paper we present a Deep Reinforcement Learning approach to solve dynamic cloth manipulation tasks. Differing from the case of rigid objects, we stress that the followed trajectory (including speed and acceleration) has a decisive influence on the final state of cloth, which can greatly vary even if the positions reached by the grasped points are the same. We explore how goal positions for non-grasped points can be attained through learning adequate trajectories for the grasped points. Our approach uses few demonstrations to improve control policy learning, and a sparse reward approach to avoid engineering complex reward functions. Since perception of textiles is challenging, we also study different state representations to assess the minimum observation space required for learning to succeed. Finally, we compare different combinations of control policy encodings, demonstrations, and sparse reward learning techniques, and show that our proposed approach can learn dynamic cloth manipulation in an efficient way, i.e., using a reduced observation space, a few demonstrations, and a sparse reward.

READ FULL TEXT

page 1

page 4

page 5

research
06/15/2021

Residual Reinforcement Learning from Demonstrations

Residual reinforcement learning (RL) has been proposed as a way to solve...
research
10/16/2019

Reinforcement Learning for Robotic Manipulation using Simulated Locomotion Demonstrations

Learning robot manipulation policies through reinforcement learning (RL)...
research
02/28/2023

Learning Sparse Control Tasks from Pixels by Latent Nearest-Neighbor-Guided Explorations

Recent progress in deep reinforcement learning (RL) and computer vision ...
research
06/06/2019

An Extensible Interactive Interface for Agent Design

In artificial intelligence, we often specify tasks through a reward func...
research
03/29/2023

Learning Excavation of Rigid Objects with Offline Reinforcement Learning

Autonomous excavation is a challenging task. The unknown contact dynamic...
research
04/12/2019

Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations

A critical flaw of existing inverse reinforcement learning (IRL) methods...
research
09/06/2021

Hindsight Reward Tweaking via Conditional Deep Reinforcement Learning

Designing optimal reward functions has been desired but extremely diffic...

Please sign up or login with your details

Forgot password? Click here to reset