PartGlot: Learning Shape Part Segmentation from Language Reference Games

12/13/2021
by   Juil Koo, et al.
1

We introduce PartGlot, a neural framework and associated architectures for learning semantic part segmentation of 3D shape geometry, based solely on part referential language. We exploit the fact that linguistic descriptions of a shape can provide priors on the shape's parts – as natural language has evolved to reflect human perception of the compositional structure of objects, essential to their recognition and use. For training, we use the paired geometry / language data collected in the ShapeGlot work for their reference game, where a speaker creates an utterance to differentiate a target shape from two distractors and the listener has to find the target based on this utterance. Our network is designed to solve this target discrimination problem, carefully incorporating a Transformer-based attention module so that the output attention can precisely highlight the semantic part or parts described in the language. Furthermore, the network operates without any direct supervision on the 3D geometry itself. Surprisingly, we further demonstrate that the learned part information is generalizable to shape classes unseen during training. Our approach opens the possibility of learning 3D shape parts from language alone, without the need for large-scale part geometry annotations, thus facilitating annotation acquisition.

READ FULL TEXT

page 15

page 16

page 17

page 22

page 23

page 24

page 25

page 26

research
12/24/2021

iSeg3D: An Interactive 3D Shape Segmentation Tool

A large-scale dataset is essential for learning good features in 3D shap...
research
12/04/2020

Compositionally Generalizable 3D Structure Prediction

Single-image 3D shape reconstruction is an important and long-standing p...
research
11/03/2019

Leveraging Pretrained Image Classifiers for Language-Based Segmentation

Current semantic segmentation models cannot easily generalize to new obj...
research
12/09/2022

LADIS: Language Disentanglement for 3D Shape Editing

Natural language interaction is a promising direction for democratizing ...
research
02/16/2020

Learning to Group: A Bottom-Up Framework for 3D Part Discovery in Unseen Categories

We address the problem of discovering 3D parts for objects in unseen cat...
research
12/04/2018

Multiview Cross-supervision for Semantic Segmentation

This paper presents a semi-supervised learning framework for a customize...
research
05/08/2019

ShapeGlot: Learning Language for Shape Differentiation

In this work we explore how fine-grained differences between the shapes ...

Please sign up or login with your details

Forgot password? Click here to reset