Hybrid VAE: Improving Deep Generative Models using Partial Observations

11/30/2017
by   Sergey Tulyakov, et al.
0

Deep neural network models trained on large labeled datasets are the state-of-the-art in a large variety of computer vision tasks. In many applications, however, labeled data is expensive to obtain or requires a time consuming manual annotation process. In contrast, unlabeled data is often abundant and available in large quantities. We present a principled framework to capitalize on unlabeled data by training deep generative models on both labeled and unlabeled data. We show that such a combination is beneficial because the unlabeled data acts as a data-driven form of regularization, allowing generative models trained on few labeled samples to reach the performance of fully-supervised generative models trained on much larger datasets. We call our method Hybrid VAE (H-VAE) as it contains both the generative and the discriminative parts. We validate H-VAE on three large-scale datasets of different modalities: two face datasets: (MultiPIE, CelebA) and a hand pose dataset (NYU Hand Pose). Our qualitative visualizations further support improvements achieved by using partial observations.

READ FULL TEXT

page 2

page 6

research
07/09/2023

Score-based Conditional Generation with Fewer Labeled Data by Self-calibrating Classifier Guidance

Score-based Generative Models (SGMs) are a popular family of deep genera...
research
03/24/2018

Unsupervised Domain Adaptation: from Simulation Engine to the RealWorld

Large-scale labeled training datasets have enabled deep neural networks ...
research
05/22/2023

Phased data augmentation for training PixelCNNs with VQ-VAE-2 and limited data

With development of deep learning, researchers have developed generative...
research
10/09/2021

Harnessing Unlabeled Data to Improve Generalization of Biometric Gender and Age Classifiers

With significant advances in deep learning, many computer vision applica...
research
11/11/2015

Universum Prescription: Regularization using Unlabeled Data

This paper shows that simply prescribing "none of the above" labels to u...
research
06/11/2021

Generate, Annotate, and Learn: Generative Models Advance Self-Training and Knowledge Distillation

Semi-Supervised Learning (SSL) has seen success in many application doma...
research
03/03/2021

Comparing the Value of Labeled and Unlabeled Data in Method-of-Moments Latent Variable Estimation

Labeling data for modern machine learning is expensive and time-consumin...

Please sign up or login with your details

Forgot password? Click here to reset