Deep Fusion Transformer Network with Weighted Vector-Wise Keypoints Voting for Robust 6D Object Pose Estimation

08/10/2023
by   Jun Zhou, et al.
0

One critical challenge in 6D object pose estimation from a single RGBD image is efficient integration of two different modalities, i.e., color and depth. In this work, we tackle this problem by a novel Deep Fusion Transformer (DFTr) block that can aggregate cross-modality features for improving pose estimation. Unlike existing fusion methods, the proposed DFTr can better model cross-modality semantic correlation by leveraging their semantic similarity, such that globally enhanced features from different modalities can be better integrated for improved information extraction. Moreover, to further improve robustness and efficiency, we introduce a novel weighted vector-wise voting algorithm that employs a non-iterative global optimization strategy for precise 3D keypoint localization while achieving near real-time inference. Extensive experiments show the effectiveness and strong generalization capability of our proposed 3D keypoint voting algorithm. Results on four widely used benchmarks also demonstrate that our method outperforms the state-of-the-art methods by large margins.

READ FULL TEXT

page 1

page 4

page 8

research
11/11/2019

PVN3D: A Deep Point-wise 3D Keypoints Voting Network for 6DoF Pose Estimation

In this work, we present a novel data-driven method for robust 6DoF obje...
research
10/07/2022

KRF: Keypoint Refinement with Fusion Network for 6D Pose Estimation

Existing refinement methods gradually lose their ability to further impr...
research
10/14/2022

Keypoint Cascade Voting for Point Cloud Based 6DoF Pose Estimation

We propose a novel keypoint voting 6DoF object pose estimation method, w...
research
12/31/2018

PVNet: Pixel-wise Voting Network for 6DoF Pose Estimation

This paper addresses the challenge of 6DoF pose estimation from a single...
research
04/06/2021

Vote from the Center: 6 DoF Pose Estimation in RGB-D Images by Radial Keypoint Voting

We propose a novel keypoint voting scheme based on intersecting spheres,...
research
12/17/2016

A Fusion Method Based on Decision Reliability Ratio for Finger Vein Verification

Finger vein verification has developed a lot since its first proposal, b...
research
02/10/2020

6DoF Object Pose Estimation via Differentiable Proxy Voting Loss

Estimating a 6DOF object pose from a single image is very challenging du...

Please sign up or login with your details

Forgot password? Click here to reset