Hierarchical Policy Learning for Mechanical Search

02/28/2022
by   Oussama Zenkri, et al.
0

Retrieving objects from clutters is a complex task, which requires multiple interactions with the environment until the target object can be extracted. These interactions involve executing action primitives like grasping or pushing as well as setting priorities for the objects to manipulate and the actions to execute. Mechanical Search (MS) is a framework for object retrieval, which uses a heuristic algorithm for pushing and rule-based algorithms for high-level planning. While rule-based policies profit from human intuition in how they work, they usually perform sub-optimally in many cases. Deep reinforcement learning (RL) has shown great performance in complex tasks such as taking decisions through evaluating pixels, which makes it suitable for training policies in the context of object-retrieval. In this work, we first formulate the MS problem in a principled formulation as a hierarchical POMDP. Based on this formulation, we propose a hierarchical policy learning approach for the MS problem. For demonstration, we present two main parameterized sub-policies: a push policy and an action selection policy. When integrated into the hierarchical POMDP's policy, our proposed sub-policies increase the success rate of retrieving the target object from less than 32 reducing the computation time for push actions from multiple seconds to less than 10 milliseconds.

READ FULL TEXT
research
03/04/2019

Mechanical Search: Multi-Step Retrieval of a Target Object Occluded by Clutter

When operating in unstructured environments such as warehouses, homes, a...
research
08/13/2020

Visuomotor Mechanical Search: Learning to Retrieve Target Objects in Clutter

When searching for objects in cluttered environments, it is often necess...
research
11/22/2021

Bridging the gap between learning and heuristic based pushing policies

Non-prehensile pushing actions have the potential to singulate a target ...
research
09/18/2021

Hierarchical Policy for Non-prehensile Multi-object Rearrangement with Deep Reinforcement Learning and Monte Carlo Tree Search

Non-prehensile multi-object rearrangement is a robotic task of planning ...
research
08/11/2020

Hardware as Policy: Mechanical and Computational Co-Optimization using Deep Reinforcement Learning

Deep Reinforcement Learning (RL) has shown great success in learning com...
research
09/17/2019

Split Deep Q-Learning for Robust Object Singulation

Extracting a known target object from a pile of other objects in a clutt...

Please sign up or login with your details

Forgot password? Click here to reset