Uncovering the Background-Induced bias in RGB based 6-DoF Object Pose Estimation

by   Elena Govi, et al.

In recent years, there has been a growing trend of using data-driven methods in industrial settings. These kinds of methods often process video images or parts, therefore the integrity of such images is crucial. Sometimes datasets, e.g. consisting of images, can be sophisticated for various reasons. It becomes critical to understand how the manipulation of video and images can impact the effectiveness of a machine learning method. Our case study aims precisely to analyze the Linemod dataset, considered the state of the art in 6D pose estimation context. That dataset presents images accompanied by ArUco markers; it is evident that such markers will not be available in real-world contexts. We analyze how the presence of the markers affects the pose estimation accuracy, and how this bias may be mitigated through data augmentation and other methods. Our work aims to show how the presence of these markers goes to modify, in the testing phase, the effectiveness of the deep learning method used. In particular, we will demonstrate, through the tool of saliency maps, how the focus of the neural network is captured in part by these ArUco markers. Finally, a new dataset, obtained by applying geometric tools to Linemod, will be proposed in order to demonstrate our hypothesis and uncovering the bias. Our results demonstrate the potential for bias in 6DOF pose estimation networks, and suggest methods for reducing this bias when training with markers.


page 2

page 4

page 6

page 7

page 8

page 11

page 13

page 14


On Object Symmetries and 6D Pose Estimation from Images

Objects with symmetries are common in our daily life and in industrial c...

SHREC 2020 track: 6D Object Pose Estimation

6D pose estimation is crucial for augmented reality, virtual reality, ro...

Large-scale 6D Object Pose Estimation Dataset for Industrial Bin-Picking

In this paper, we introduce a new public dataset for 6D object pose esti...

Learning a Disentangled Embedding for Monocular 3D Shape Retrieval and Pose Estimation

We propose a novel approach to jointly perform 3D object retrieval and p...

Learning Implicit Probability Distribution Functions for Symmetric Orientation Estimation from RGB Images Without Pose Labels

Object pose estimation is a necessary prerequisite for autonomous roboti...

Towards Deep Learning-based 6D Bin Pose Estimation in 3D Scans

An automated robotic system needs to be as robust as possible and fail-s...

A Method to Generate High Precision Mesh Model and RGB-D Datasetfor 6D Pose Estimation Task

Recently, 3D version has been improved greatly due to the development of...

Please sign up or login with your details

Forgot password? Click here to reset