3DViewGraph: Learning Global Features for 3D Shapes from A Graph of Unordered Views with Attention

05/17/2019
by   Zhizhong Han, et al.
0

Learning global features by aggregating information over multiple views has been shown to be effective for 3D shape analysis. For view aggregation in deep learning models, pooling has been applied extensively. However, pooling leads to a loss of the content within views, and the spatial relationship among views, which limits the discriminability of learned features. We propose 3DViewGraph to resolve this issue, which learns 3D global features by more effectively aggregating unordered views with attention. Specifically, unordered views taken around a shape are regarded as view nodes on a view graph. 3DViewGraph first learns a novel latent semantic mapping to project low-level view features into meaningful latent semantic embeddings in a lower dimensional space, which is spanned by latent semantic patterns. Then, the content and spatial information of each pair of view nodes are encoded by a novel spatial pattern correlation, where the correlation is computed among latent semantic patterns. Finally, all spatial pattern correlations are integrated with attention weights learned by a novel attention mechanism. This further increases the discriminability of learned features by highlighting the unordered view nodes with distinctive characteristics and depressing the ones with appearance ambiguity. We show that 3DViewGraph outperforms state-of-the-art methods under three large-scale benchmarks.

READ FULL TEXT
research
05/18/2019

Parts4Feature: Learning 3D Global Features from Generally Semantic Parts in Multiple Views

Deep learning has achieved remarkable results in 3D shape analysis by le...
research
08/08/2021

Hierarchical View Predictor: Unsupervised 3D Global Feature Learning through Hierarchical Prediction among Unordered Views

Unsupervised learning of global features for 3D shape analysis is an imp...
research
07/31/2019

ShapeCaptioner: Generative Caption Network for 3D Shapes by Learning a Mapping from Parts Detected in Multiple Views to Sentences

3D shape captioning is a challenging application in 3D shape understandi...
research
11/04/2022

GARNet: Global-Aware Multi-View 3D Reconstruction Network and the Cost-Performance Tradeoff

Deep learning technology has made great progress in multi-view 3D recons...
research
06/19/2023

Dual-view Correlation Hybrid Attention Network for Robust Holistic Mammogram Classification

Mammogram image is important for breast cancer screening, and typically ...
research
09/03/2018

Hierarchically Learned View-Invariant Representations for Cross-View Action Recognition

Recognizing human actions from varied views is challenging due to huge a...

Please sign up or login with your details

Forgot password? Click here to reset