GDR-Net: Geometry-Guided Direct Regression Network for Monocular 6D Object Pose Estimation

02/24/2021
by   Gu Wang, et al.
4

6D pose estimation from a single RGB image is a fundamental task in computer vision. The current top-performing deep learning-based methods rely on an indirect strategy, i.e., first establishing 2D-3D correspondences between the coordinates in the image plane and object coordinate system, and then applying a variant of the PnP/RANSAC algorithm. However, this two-stage pipeline is not end-to-end trainable, thus is hard to be employed for many tasks requiring differentiable poses. On the other hand, methods based on direct regression are currently inferior to geometry-based methods. In this work, we perform an in-depth investigation on both direct and indirect methods, and propose a simple yet effective Geometry-guided Direct Regression Network (GDR-Net) to learn the 6D pose in an end-to-end manner from dense correspondence-based intermediate geometric representations. Extensive experiments show that our approach remarkably outperforms state-of-the-art methods on LM, LM-O and YCB-V datasets. Code is available at https://git.io/GDR-Net.

READ FULL TEXT

page 1

page 3

page 14

page 15

page 16

research
04/21/2022

DGECN: A Depth-Guided Edge Convolutional Network for End-to-End 6D Pose Estimation

Monocular 6D pose estimation is a fundamental task in computer vision. E...
research
12/22/2020

A Structure-Aware Method for Direct Pose Estimation

Estimating camera pose from a single image is a fundamental problem in c...
research
10/11/2022

DCL-Net: Deep Correspondence Learning Network for 6D Pose Estimation

Establishment of point correspondence between camera and object coordina...
research
11/01/2021

Learning Eye-in-Hand Camera Calibration from a Single Image

Eye-in-hand camera calibration is a fundamental and long-studied problem...
research
07/19/2020

AWR: Adaptive Weighting Regression for 3D Hand Pose Estimation

In this paper, we propose an adaptive weighting regression (AWR) method ...
research
11/21/2022

Simultaneous Multiple Object Detection and Pose Estimation using 3D Model Infusion with Monocular Vision

Multiple object detection and pose estimation are vital computer vision ...
research
03/24/2022

EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation

Locating 3D objects from a single RGB image via Perspective-n-Points (Pn...

Please sign up or login with your details

Forgot password? Click here to reset