KVN: Keypoints Voting Network with Differentiable RANSAC for Stereo Pose Estimation

07/21/2023
by   Ivano Donadi, et al.
0

Object pose estimation is a fundamental computer vision task exploited in several robotics and augmented reality applications. Many established approaches rely on predicting 2D-3D keypoint correspondences using RANSAC (Random sample consensus) and estimating the object pose using the PnP (Perspective-n-Point) algorithm. Being RANSAC non-differentiable, correspondences cannot be directly learned in an end-to-end fashion. In this paper, we address the stereo image-based object pose estimation problem by (i) introducing a differentiable RANSAC layer into a well-known monocular pose estimation network; (ii) exploiting an uncertainty-driven multi-view PnP solver which can fuse information from multiple views. We evaluate our approach on a challenging public stereo object pose estimation dataset, yielding state-of-the-art results against other recent approaches. Furthermore, in our ablation study, we show that the differentiable RANSAC layer plays a significant role in the accuracy of the proposed method. We release with this paper the open-source implementation of our method.

READ FULL TEXT

page 1

page 2

page 5

page 7

page 9

page 10

research
03/24/2022

EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation

Locating 3D objects from a single RGB image via Perspective-n-Points (Pn...
research
04/21/2022

DGECN: A Depth-Guided Edge Convolutional Network for End-to-End 6D Pose Estimation

Monocular 6D pose estimation is a fundamental task in computer vision. E...
research
03/21/2023

Linear-Covariance Loss for End-to-End Learning of 6D Pose Estimation

Most modern image-based 6D object pose estimation methods learn to predi...
research
12/12/2016

PoseAgent: Budget-Constrained 6D Object Pose Estimation via Reinforcement Learning

State-of-the-art computer vision algorithms often achieve efficiency by ...
research
11/09/2022

A Solution for a Fundamental Problem of 3D Inference based on 2D Representations

3D inference from monocular vision using neural networks is an important...
research
05/04/2015

See the Difference: Direct Pre-Image Reconstruction and Pose Estimation by Differentiating HOG

The Histogram of Oriented Gradient (HOG) descriptor has led to many adva...
research
05/16/2023

Learning Correspondence Uncertainty via Differentiable Nonlinear Least Squares

We propose a differentiable nonlinear least squares framework to account...

Please sign up or login with your details

Forgot password? Click here to reset