Self6D: Self-Supervised Monocular 6D Object Pose Estimation

04/14/2020
by   Gu Wang, et al.
8

Estimating the 6D object pose is a fundamental problem in computer vision. Convolutional Neural Networks (CNNs) have recently proven to be capable of predicting reliable 6D pose estimates even from monocular images. Nonetheless, CNNs are identified as being extremely data-driven, yet, acquiring adequate annotations is oftentimes very time-consuming and labor intensive. To overcome this shortcoming, we propose the idea of monocular 6D pose estimation by means of self-supervised learning, which eradicates the need for real data with annotations. After training our proposed network fully supervised with synthetic RGB data, we leverage recent advances in neural rendering to further self-supervise the model on unannotated real RGB-D data, seeking for a visually and geometrically optimal alignment. Extensive evaluations demonstrate that our proposed self-supervision is able to significantly enhance the model's original performance, outperforming all other methods relying on synthetic data or employing elaborate techniques from the domain adaptation realm.

READ FULL TEXT

page 2

page 5

page 12

page 21

page 22

page 23

page 24

page 25

research
03/19/2022

Occlusion-Aware Self-Supervised Monocular 6D Object Pose Estimation

6D object pose estimation is a fundamental yet challenging problem in co...
research
02/14/2023

MSDA: Monocular Self-supervised Domain Adaptation for 6D Object Pose Estimation

Acquiring labeled 6D poses from real images is an expensive and time-con...
research
08/19/2023

Pseudo Flow Consistency for Self-Supervised 6D Object Pose Estimation

Most self-supervised 6D object pose estimation methods can only work wit...
research
02/28/2023

Markerless Camera-to-Robot Pose Estimation via Self-supervised Sim-to-Real Transfer

Solving the camera-to-robot pose is a fundamental requirement for vision...
research
04/14/2023

Tempo vs. Pitch: understanding self-supervised tempo estimation

Self-supervision methods learn representations by solving pretext tasks ...
research
08/21/2023

Polarimetric Information for Multi-Modal 6D Pose Estimation of Photometrically Challenging Objects with Limited Data

6D pose estimation pipelines that rely on RGB-only or RGB-D data show li...
research
03/23/2021

MonoRUn: Monocular 3D Object Detection by Reconstruction and Uncertainty Propagation

Object localization in 3D space is a challenging aspect in monocular 3D ...

Please sign up or login with your details

Forgot password? Click here to reset