MV6D: Multi-View 6D Pose Estimation on RGB-D Frames Using a Deep Point-wise Voting Network

08/01/2022
by   Fabian Duffhauss, et al.
11

Estimating 6D poses of objects is an essential computer vision task. However, most conventional approaches rely on camera data from a single perspective and therefore suffer from occlusions. We overcome this issue with our novel multi-view 6D pose estimation method called MV6D which accurately predicts the 6D poses of all objects in a cluttered scene based on RGB-D images from multiple perspectives. We base our approach on the PVN3D network that uses a single RGB-D image to predict keypoints of the target objects. We extend this approach by using a combined point cloud from multiple views and fusing the images from each view with a DenseFusion layer. In contrast to current multi-view pose detection networks such as CosyPose, our MV6D can learn the fusion of multiple perspectives in an end-to-end manner and does not require multiple prediction stages or subsequent fine tuning of the prediction. Furthermore, we present three novel photorealistic datasets of cluttered scenes with heavy occlusions. All of them contain RGB-D images from multiple perspectives and the ground truth for instance semantic segmentation and 6D pose estimation. MV6D significantly outperforms the state-of-the-art in multi-view 6D pose estimation even in cases where the camera poses are known inaccurately. Furthermore, we show that our approach is robust towards dynamic camera setups and that its accuracy increases incrementally with an increasing number of perspectives.

READ FULL TEXT

page 1

page 3

page 4

page 7

research
07/01/2023

SyMFM6D: Symmetry-aware Multi-directional Fusion for Multi-View 6D Object Pose Estimation

Detecting objects and estimating their 6D poses is essential for automat...
research
10/03/2022

Multi-view object pose estimation from correspondence distributions and epipolar geometry

In many automation tasks involving manipulation of rigid objects, the po...
research
08/21/2022

CenDerNet: Center and Curvature Representations for Render-and-Compare 6D Pose Estimation

We introduce CenDerNet, a framework for 6D pose estimation from multi-vi...
research
06/27/2020

Light Pose Calibration for Camera-light Vision Systems

Illuminating a scene with artificial light is a prerequisite for seeing ...
research
04/22/2021

H2O: Two Hands Manipulating Objects for First Person Interaction Recognition

We present, for the first time, a comprehensive framework for egocentric...
research
04/09/2020

MoreFusion: Multi-object Reasoning for 6D Pose Estimation from Volumetric Fusion

Robots and other smart devices need efficient object-based scene represe...
research
06/24/2021

Evaluation of deep lift pose models for 3D rodent pose estimation based on geometrically triangulated data

The assessment of laboratory animal behavior is of central interest in m...

Please sign up or login with your details

Forgot password? Click here to reset