SO-Pose: Exploiting Self-Occlusion for Direct 6D Pose Estimation

08/18/2021
by   Yan Di, et al.
0

Directly regressing all 6 degrees-of-freedom (6DoF) for the object pose (e.g. the 3D rotation and translation) in a cluttered environment from a single RGB image is a challenging problem. While end-to-end methods have recently demonstrated promising results at high efficiency, they are still inferior when compared with elaborate PnP/RANSAC-based approaches in terms of pose accuracy. In this work, we address this shortcoming by means of a novel reasoning about self-occlusion, in order to establish a two-layer representation for 3D objects which considerably enhances the accuracy of end-to-end 6D pose estimation. Our framework, named SO-Pose, takes a single RGB image as input and respectively generates 2D-3D correspondences as well as self-occlusion information harnessing a shared encoder and two separate decoders. Both outputs are then fused to directly regress the 6DoF pose parameters. Incorporating cross-layer consistencies that align correspondences, self-occlusion and 6D pose, we can further improve accuracy and robustness, surpassing or rivaling all other state-of-the-art approaches on various challenging datasets.

READ FULL TEXT

page 1

page 3

page 8

research
02/28/2018

Deep-6DPose: Recovering 6D Object Pose from a Single RGB Image

Detecting objects and their 6D poses from only RGB images is an importan...
research
11/22/2020

End-to-End Differentiable 6DoF Object Pose Estimation with Local and Global Constraints

Inferring the 6DoF pose of an object from a single RGB image is an impor...
research
02/05/2019

6D Object Pose Estimation without PnP

In this paper, we propose an efficient end-to-end algorithm to tackle th...
research
09/21/2023

Ego3DPose: Capturing 3D Cues from Binocular Egocentric Views

We present Ego3DPose, a highly accurate binocular egocentric 3D pose rec...
research
04/21/2022

DGECN: A Depth-Guided Edge Convolutional Network for End-to-End 6D Pose Estimation

Monocular 6D pose estimation is a fundamental task in computer vision. E...
research
09/23/2019

How to improve CNN-based 6-DoF camera pose estimation

Convolutional neural networks (CNNs) and transfer learning have recently...
research
03/24/2022

EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation

Locating 3D objects from a single RGB image via Perspective-n-Points (Pn...

Please sign up or login with your details

Forgot password? Click here to reset