DatasetGAN: Efficient Labeled Data Factory with Minimal Human Effort

04/13/2021
by   Yuxuan Zhang, et al.
40

We introduce DatasetGAN: an automatic procedure to generate massive datasets of high-quality semantically segmented images requiring minimal human effort. Current deep networks are extremely data-hungry, benefiting from training on large-scale datasets, which are time consuming to annotate. Our method relies on the power of recent GANs to generate realistic images. We show how the GAN latent code can be decoded to produce a semantic segmentation of the image. Training the decoder only needs a few labeled examples to generalize to the rest of the latent space, resulting in an infinite annotated dataset generator! These generated datasets can then be used for training any computer vision architecture just as real datasets are. As only a few images need to be manually segmented, it becomes possible to annotate images in extreme detail and generate datasets with rich object and part segmentations. To showcase the power of our approach, we generated datasets for 7 image segmentation tasks which include pixel-level labels for 34 human face parts, and 32 car parts. Our approach outperforms all semi-supervised baselines significantly and is on par with fully supervised methods, which in some cases require as much as 100x more annotated data as our method.

READ FULL TEXT

page 1

page 4

page 5

page 8

research
08/11/2023

DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models

Current deep networks are very data-hungry and benefit from training on ...
research
02/24/2022

SLRNet: Semi-Supervised Semantic Segmentation Via Label Reuse for Human Decomposition Images

Semantic segmentation is a challenging computer vision task demanding a ...
research
11/06/2022

Learning to Annotate Part Segmentation with Gradient Matching

The success of state-of-the-art deep neural networks heavily relies on t...
research
12/06/2021

Semantic Segmentation In-the-Wild Without Seeing Any Segmentation Examples

Semantic segmentation is a key computer vision task that has been active...
research
12/09/2016

Automatic Model Based Dataset Generation for Fast and Accurate Crop and Weeds Detection

Selective weeding is one of the key challenges in the field of agricultu...
research
06/18/2020

Learning High-Resolution Domain-Specific Representations with a GAN Generator

In recent years generative models of visual data have made a great progr...
research
12/24/2022

HandsOff: Labeled Dataset Generation With No Additional Human Annotations

Recent work leverages the expressive power of generative adversarial net...

Please sign up or login with your details

Forgot password? Click here to reset