Part2Word: Learning Joint Embedding of Point Clouds and Text by Matching Parts to Words

07/05/2021
by   Chuan Tang, et al.
0

It is important to learn joint embedding for 3D shapes and text in different shape understanding tasks, such as shape-text matching, retrieval, and shape captioning. Current multi-view based methods learn a mapping from multiple rendered views to text. However, these methods can not analyze 3D shapes well due to the self-occlusion and limitation of learning manifolds. To resolve this issue, we propose a method to learn joint embedding of point clouds and text by matching parts from shapes to words from sentences in a common space. Specifically, we first learn segmentation prior to segment point clouds into parts. Then, we map parts and words into an optimized space, where the parts and words can be matched with each other. In the optimized space, we represent a part by aggregating features of all points within the part, while representing each word with its context information, where we train our network to minimize the triplet ranking loss. Moreover, we also introduce cross-modal attention to capture the relationship of part-word in this matching procedure, which enhances joint embedding learning. Our experimental results outperform the state-of-the-art in multi-modal retrieval under the widely used benchmark.

READ FULL TEXT

page 1

page 2

page 4

page 5

page 6

page 7

page 8

page 9

research
07/31/2019

ShapeCaptioner: Generative Caption Network for 3D Shapes by Learning a Mapping from Parts Detected in Multiple Views to Sentences

3D shape captioning is a challenging application in 3D shape understandi...
research
11/07/2018

Y^2Seq2Seq: Cross-Modal Representation Learning for 3D Shape and Text by Joint Reconstruction and Prediction of View and Word Sequences

A recent method employs 3D voxels to represent 3D shapes, but this limit...
research
12/01/2020

Cross-modal registration using point clouds and graph-matching in the context of correlative microscopies

Correlative microscopy aims at combining two or more modalities to gain ...
research
11/28/2019

3D Shape Completion with Multi-view Consistent Inference

3D shape completion is important to enable machines to perceive the comp...
research
12/02/2021

CloudWalker: 3D Point Cloud Learning by Random Walks for Shape Analysis

Point clouds are gaining prominence as a method for representing 3D shap...
research
03/29/2022

Fruit Mapping with Shape Completion for Autonomous Crop Monitoring

Autonomous crop monitoring is a difficult task due to the complex struct...
research
06/04/2018

Deep Multi-Structural Shape Analysis: Application to Neuroanatomy

We propose a deep neural network for supervised learning on neuroanatomi...

Please sign up or login with your details

Forgot password? Click here to reset