ROBI: A Multi-View Dataset for Reflective Objects in Robotic Bin-Picking

by   Jun Yang, et al.

In robotic bin-picking applications, the perception of texture-less, highly reflective parts is a valuable but challenging task. The high glossiness can introduce fake edges in RGB images and inaccurate depth measurements especially in heavily cluttered bin scenario. In this paper, we present the ROBI (Reflective Objects in BIns) dataset, a public dataset for 6D object pose estimation and multi-view depth fusion in robotic bin-picking scenarios. The ROBI dataset includes a total of 63 bin-picking scenes captured with two active stereo camera: a high-cost Ensenso sensor and a low-cost RealSense sensor. For each scene, the monochrome/RGB images and depth maps are captured from sampled view spheres around the scene, and are annotated with accurate 6D poses of visible objects and an associated visibility score. For evaluating the performance of depth fusion, we captured the ground truth depth maps by high-cost Ensenso camera with objects coated in anti-reflective scanning spray. To show the utility of the dataset, we evaluated the representative algorithms of 6D object pose estimation and multi-view depth fusion on the full dataset. Evaluation results demonstrate the difficulty of highly reflective objects, especially in difficult cases due to the degradation of depth data quality, severe occlusions and cluttered scene. The ROBI dataset is available online at



There are no comments yet.


page 1

page 2

page 3

page 4

page 5

page 6

page 7


Probabilistic Multi-View Fusion of Active Stereo Depth Maps for Robotic Bin-Picking

The reliable fusion of depth maps from multiple viewpoints has become an...

T-LESS: An RGB-D Dataset for 6D Pose Estimation of Texture-less Objects

We introduce T-LESS, a new public dataset for estimating the 6D pose, i....

Active Perception with A Monocular Camera for Multiscopic Vision

We design a multiscopic vision system that utilizes a low-cost monocular...

Multi-view Fusion for Multi-level Robotic Scene Understanding

We present a system for multi-level scene awareness for robotic manipula...

KeyPose: Multi-view 3D Labeling and Keypoint Estimation for Transparent Objects

Estimating the 3D pose of desktop objects is crucial for applications su...

Depth Completion with RGB Prior

Depth cameras are a prominent perception system for robotics, especially...

Structure-From-Motion and RGBD Depth Fusion

This article describes a technique to augment a typical RGBD sensor by i...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.