P2-Net: Joint Description and Detection of Local Features for Pixel and Point Matching

03/01/2021
by   Bing Wang, et al.
0

Accurately describing and detecting 2D and 3D keypoints is crucial to establishing correspondences across images and point clouds. Despite a plethora of learning-based 2D or 3D local feature descriptors and detectors having been proposed, the derivation of a shared descriptor and joint keypoint detector that directly matches pixels and points remains under-explored by the community. This work takes the initiative to establish fine-grained correspondences between 2D images and 3D point clouds. In order to directly match pixels and points, a dual fully convolutional framework is presented that maps 2D and 3D inputs into a shared latent representation space to simultaneously describe and detect keypoints. Furthermore, an ultra-wide reception mechanism in combination with a novel loss function are designed to mitigate the intrinsic information variations between pixel and point local regions. Extensive experimental results demonstrate that our framework shows competitive performance in fine-grained matching between images and point clouds and achieves state-of-the-art results for the task of indoor visual localization. Our source code will be available at [no-name-for-blind-review].

READ FULL TEXT

page 3

page 6

research
03/06/2020

D3Feat: Joint Learning of Dense Detection and Description of 3D Local Features

A successful point cloud registration often lies on robust establishment...
research
03/18/2020

LRC-Net: Learning Discriminative Features on Point Clouds by EncodingLocal Region Contexts

Learning discriminative feature directly on point clouds is still challe...
research
03/18/2020

LRC-Net: Learning Discriminative Features on Point Clouds by Encoding Local Region Contexts

Learning discriminative feature directly on point clouds is still challe...
research
09/14/2023

EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization

Visual localization is the task of estimating a 6-DoF camera pose of a q...
research
05/09/2019

D2-Net: A Trainable CNN for Joint Detection and Description of Local Features

In this work we address the problem of finding reliable pixel-level corr...
research
10/26/2022

Learning a Task-specific Descriptor for Robust Matching of 3D Point Clouds

Existing learning-based point feature descriptors are usually task-agnos...
research
03/30/2019

Person-in-WiFi: Fine-grained Person Perception using WiFi

Fine-grained person perception such as body segmentation and pose estima...

Please sign up or login with your details

Forgot password? Click here to reset