DSC-PoseNet: Learning 6DoF Object Pose Estimation via Dual-scale Consistency

04/08/2021
by   Zongxin Yang, et al.
0

Compared to 2D object bounding-box labeling, it is very difficult for humans to annotate 3D object poses, especially when depth images of scenes are unavailable. This paper investigates whether we can estimate the object poses effectively when only RGB images and 2D object annotations are given. To this end, we present a two-step pose estimation framework to attain 6DoF object poses from 2D object bounding-boxes. In the first step, the framework learns to segment objects from real and synthetic data in a weakly-supervised fashion, and the segmentation masks will act as a prior for pose estimation. In the second step, we design a dual-scale pose estimation network, namely DSC-PoseNet, to predict object poses by employing a differential renderer. To be specific, our DSC-PoseNet firstly predicts object poses in the original image scale by comparing the segmentation masks and the rendered visible object masks. Then, we resize object regions to a fixed scale to estimate poses once again. In this fashion, we eliminate large scale variations and focus on rotation estimation, thus facilitating pose estimation. Moreover, we exploit the initial pose estimation to generate pseudo ground-truth to train our DSC-PoseNet in a self-supervised manner. The estimation results in these two scales are ensembled as our final pose estimation. Extensive experiments on widely-used benchmarks demonstrate that our method outperforms state-of-the-art models trained on synthetic data by a large margin and even is on par with several fully-supervised methods.

READ FULL TEXT

page 3

page 7

page 9

page 10

research
08/19/2023

Pseudo Flow Consistency for Self-Supervised 6D Object Pose Estimation

Most self-supervised 6D object pose estimation methods can only work wit...
research
04/01/2021

Wide-Depth-Range 6D Object Pose Estimation in Space

6D pose estimation in space poses unique challenges that are not commonl...
research
01/23/2019

DeepFashion2: A Versatile Benchmark for Detection, Pose Estimation, Segmentation and Re-Identification of Clothing Images

Understanding fashion images has been advanced by benchmarks with rich a...
research
08/09/2020

1-Point RANSAC-Based Method for Ground Object Pose Estimation

Solving Perspective-n-Point (PnP) problems is a traditional way of estim...
research
07/19/2022

PoserNet: Refining Relative Camera Poses Exploiting Object Detections

The estimation of the camera poses associated with a set of images commo...
research
08/07/2023

A Horse with no Labels: Self-Supervised Horse Pose Estimation from Unlabelled Images and Synthetic Prior

Obtaining labelled data to train deep learning methods for estimating an...
research
12/06/2020

Temporal-Aware Self-Supervised Learning for 3D Hand Pose and Mesh Estimation in Videos

Estimating 3D hand pose directly from RGB imagesis challenging but has g...

Please sign up or login with your details

Forgot password? Click here to reset