CIGMO: Categorical invariant representations in a deep generative framework

05/27/2022
by   Haruo Hosoya, et al.
9

Data of general object images have two most common structures: (1) each object of a given shape can be rendered in multiple different views, and (2) shapes of objects can be categorized in such a way that the diversity of shapes is much larger across categories than within a category. Existing deep generative models can typically capture either structure, but not both. In this work, we introduce a novel deep generative model, called CIGMO, that can learn to represent category, shape, and view factors from image data. The model is comprised of multiple modules of shape representations that are each specialized to a particular category and disentangled from view representation, and can be learned using a group-based weakly supervised learning method. By empirical investigation, we show that our model can effectively discover categories of object shapes despite large view variation and quantitatively supersede various previous methods including the state-of-the-art invariant clustering algorithm. Further, we show that our approach using category-specialization can enhance the learned shape representation to better perform down-stream tasks such as one-shot object identification as well as shape-view disentanglement.

READ FULL TEXT

page 1

page 2

page 9

research
12/18/2016

3D Shape Induction from 2D Views of Multiple Objects

In this paper we investigate the problem of inducing a distribution over...
research
09/07/2018

A simple probabilistic deep generative model for learning generalizable disentangled representations from grouped data

The disentangling problem is to discover multiple complex factors of var...
research
02/02/2015

Adaptive Scene Category Discovery with Generative Learning and Compositional Sampling

This paper investigates a general framework to discover categories of un...
research
06/22/2014

3D ShapeNets: A Deep Representation for Volumetric Shapes

3D shape is a crucial but heavily underutilized cue in today's computer ...
research
06/09/2021

ClipGen: A Deep Generative Model for Clipart Vectorization and Synthesis

This paper presents a novel deep learning-based approach for automatical...
research
08/04/2020

PatchNets: Patch-Based Generalizable Deep Implicit 3D Shape Representations

Implicit surface representations, such as signed-distance functions, com...
research
08/05/2022

Learning to Generate 3D Shapes from a Single Example

Existing generative models for 3D shapes are typically trained on a larg...

Please sign up or login with your details

Forgot password? Click here to reset