Stereo Vision Based Single-Shot 6D Object Pose Estimation for Bin-Picking by a Robot Manipulator

05/28/2020
by   Yoshihiro Nakano, et al.
21

We propose a fast and accurate method of 6D object pose estimation for bin-picking of mechanical parts by a robot manipulator. We extend the single-shot approach to stereo vision by application of attention architecture. Our convolutional neural network model regresses to object locations and rotations from either a left image or a right image without depth information. Then, a stereo feature matching module, designated as Stereo Grid Attention, generates stereo grid matching maps. The important point of our method is only to calculate disparity of the objects found by the attention from stereo images, instead of calculating a point cloud over the entire image. The disparity value is then used to calculate the depth to the objects by the principle of triangulation. Our method also achieves a rapid processing speed of pose estimation by the single-shot architecture and it is possible to process a 1024 x 1024 pixels image in 75 milliseconds on the Jetson AGX Xavier implemented with half-float model. Weakly textured mechanical parts are used to exemplify the method. First, we create original synthetic datasets for training and evaluating of the proposed model. This dataset is created by capturing and rendering numerous 3D models of several types of mechanical parts in virtual space. Finally, we use a robotic manipulator with an electromagnetic gripper to pick up the mechanical parts in a cluttered state to verify the validity of our method in an actual scene. When a raw stereo image is used by the proposed method from our stereo camera to detect black steel screws, stainless screws, and DC motor parts, i.e., cases, rotor cores and commutator caps, the bin-picking tasks are successful with 76.3 probability, respectively.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 6

research
11/03/2022

StereoPose: Category-Level 6D Transparent Object Pose Estimation from Stereo Images via Back-View NOCS

Most existing methods for category-level pose estimation rely on object ...
research
06/03/2022

End-to-End 3D Hand Pose Estimation from Stereo Cameras

This work proposes an end-to-end approach to estimate full 3D hand pose ...
research
11/28/2013

Glasgow's Stereo Image Database of Garments

To provide insight into cloth perception and manipulation with an active...
research
09/10/2020

Long Range Stereo Matching by Learning Depth and Disparity

Stereo matching generally involves computation of pixel correspondences ...
research
09/15/2014

Computing the Stereo Matching Cost with a Convolutional Neural Network

We present a method for extracting depth information from a rectified im...
research
12/12/2020

Uncalibrated Neural Inverse Rendering for Photometric Stereo of General Surfaces

This paper presents an uncalibrated deep neural network framework for th...
research
09/14/2022

FCDSN-DC: An Accurate and Lightweight Convolutional Neural Network for Stereo Estimation with Depth Completion

We propose an accurate and lightweight convolutional neural network for ...

Please sign up or login with your details

Forgot password? Click here to reset