Log In Sign Up

End-to-end Trainable Deep Neural Network for Robotic Grasp Detection and Semantic Segmentation from RGB

by   Stefan Ainetter, et al.

In this work, we introduce a novel, end-to-end trainable CNN-based architecture to deliver high quality results for grasp detection suitable for a parallel-plate gripper, and semantic segmentation. Utilizing this, we propose a novel refinement module that takes advantage of previously calculated grasp detection and semantic segmentation and further increases grasp detection accuracy. Our proposed network delivers state-of-the-art accuracy on two popular grasp dataset, namely Cornell and Jacquard. As additional contribution, we provide a novel dataset extension for the OCID dataset, making it possible to evaluate grasp detection in highly challenging scenes. Using this dataset, we show that semantic segmentation can additionally be used to assign grasp candidates to object classes, which can be used to pick specific objects in the scene.


page 1

page 6


Seamless Scene Segmentation

In this work we introduce a novel, CNN-based architecture that can be tr...

Depth-aware Object Segmentation and Grasp Detection for Robotic Picking Tasks

In this paper, we present a novel deep neural network architecture for j...

Model-based inexact graph matching on top of CNNs for semantic scene understanding

Deep learning based pipelines for semantic segmentation often ignore str...

End-to-end training of a two-stage neural network for defect detection

Segmentation-based, two-stage neural network has shown excellent results...

Toward Stance-based Personas for Opinionated Dialogues

In the context of chit-chat dialogues it has been shown that endowing sy...

A Coarse-To-Fine (C2F) Representation for End-To-End 6-DoF Grasp Detection

We proposed an end-to-end grasp detection network, Grasp Detection Netwo...