Learning Structured Output Representations from Attributes using Deep Conditional Generative Models

04/30/2023
by   Mohamed Debbagh, et al.
0

Structured output representation is a generative task explored in computer vision that often times requires the mapping of low dimensional features to high dimensional structured outputs. Losses in complex spatial information in deterministic approaches such as Convolutional Neural Networks (CNN) lead to uncertainties and ambiguous structures within a single output representation. A probabilistic approach through deep Conditional Generative Models (CGM) is presented by Sohn et al. in which a particular model known as the Conditional Variational Auto-encoder (CVAE) is introduced and explored. While the original paper focuses on the task of image segmentation, this paper adopts the CVAE framework for the task of controlled output representation through attributes. This approach allows us to learn a disentangled multimodal prior distribution, resulting in more controlled and robust approach to sample generation. In this work we recreate the CVAE architecture and train it on images conditioned on various attributes obtained from two image datasets; the Large-scale CelebFaces Attributes (CelebA) dataset and the Caltech-UCSD Birds (CUB-200-2011) dataset. We attempt to generate new faces with distinct attributes such as hair color and glasses, as well as different bird species samples with various attributes. We further introduce strategies for improving generalized sample generation by applying a weighted term to the variational lower bound.

READ FULL TEXT

page 8

page 10

page 11

research
03/06/2016

Variational methods for Conditional Multimodal Deep Learning

In this paper, we address the problem of conditional modality learning, ...
research
02/23/2020

Assembling Semantically-Disentangled Representations for Predictive-Generative Models via Adaptation from Synthetic Domain

Deep neural networks can form high-level hierarchical representations of...
research
12/01/2016

CDVAE: Co-embedding Deep Variational Auto Encoder for Conditional Variational Generation

Problems such as predicting a new shading field (Y) for an image (X) are...
research
11/01/2017

Multi-View Data Generation Without View Supervision

The development of high-dimensional generative models has recently gaine...
research
05/30/2023

DualVAE: Controlling Colours of Generated and Real Images

Colour controlled image generation and manipulation are of interest to a...
research
09/17/2017

Multi-Entity Dependence Learning with Rich Context via Conditional Variational Auto-encoder

Multi-Entity Dependence Learning (MEDL) explores conditional correlation...
research
11/30/2019

Disentanglement Challenge: From Regularization to Reconstruction

The challenge of learning disentangled representation has recently attra...

Please sign up or login with your details

Forgot password? Click here to reset