CAP-Net: Correspondence-Aware Point-view Fusion Network for 3D Shape Analysis

09/03/2021
by   Xinwei He, et al.
0

Learning 3D representations by fusing point cloud and multi-view data has been proven to be fairly effective. While prior works typically focus on exploiting global features of the two modalities, in this paper we argue that more discriminative features can be derived by modeling "where to fuse". To investigate this, we propose a novel Correspondence-Aware Point-view Fusion Net (CAPNet). The core element of CAP-Net is a module named Correspondence-Aware Fusion (CAF) which integrates the local features of the two modalities based on their correspondence scores. We further propose to filter out correspondence scores with low values to obtain salient local correspondences, which reduces redundancy for the fusion process. In our CAP-Net, we utilize the CAF modules to fuse the multi-scale features of the two modalities both bidirectionally and hierarchically in order to obtain more informative features. Comprehensive evaluations on popular 3D shape benchmarks covering 3D object classification and retrieval show the superiority of the proposed framework.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/02/2018

PVRNet: Point-View Relation Neural Network for 3D Shape Recognition

Three-dimensional (3D) shape recognition has drawn much research attenti...
research
02/28/2020

MANet: Multimodal Attention Network based Point- View fusion for 3D Shape Recognition

3D shape recognition has attracted more and more attention as a task of ...
research
11/02/2020

Multi-View Adaptive Fusion Network for 3D Object Detection

3D object detection based on LiDAR-camera fusion is becoming an emerging...
research
11/23/2021

MFM-Net: Unpaired Shape Completion Network with Multi-stage Feature Matching

Unpaired 3D object completion aims to predict a complete 3D shape from a...
research
12/09/2021

PRA-Net: Point Relation-Aware Network for 3D Point Cloud Analysis

Learning intra-region contexts and inter-region relations are two effect...
research
10/10/2019

Adaptive and Azimuth-Aware Fusion Network of Multimodal Local Features for 3D Object Detection

This paper focuses on the construction of stronger local features and th...
research
08/26/2023

Central Similarity Multi-View Hashing for Multimedia Retrieval

Hash representation learning of multi-view heterogeneous data is the key...

Please sign up or login with your details

Forgot password? Click here to reset