RareGAN: Generating Samples for Rare Classes

03/20/2022
by   Zinan Lin, et al.
0

We study the problem of learning generative adversarial networks (GANs) for a rare class of an unlabeled dataset subject to a labeling budget. This problem is motivated from practical applications in domains including security (e.g., synthesizing packets for DNS amplification attacks), systems and networking (e.g., synthesizing workloads that trigger high resource usage), and machine learning (e.g., generating images from a rare class). Existing approaches are unsuitable, either requiring fully-labeled datasets or sacrificing the fidelity of the rare class for that of the common classes. We propose RareGAN, a novel synthesis of three key ideas: (1) extending conditional GANs to use labelled and unlabelled data for better generalization; (2) an active learning approach that requests the most useful labels; and (3) a weighted loss function to favor learning the rare class. We show that RareGAN achieves a better fidelity-diversity tradeoff on the rare class than prior work across different applications, budgets, rare class fractions, GAN losses, and architectures.

READ FULL TEXT

page 2

page 7

research
10/30/2016

Conditional Image Synthesis With Auxiliary Classifier GANs

Synthesizing high resolution photorealistic images has been a long-stand...
research
06/17/2022

Active Data Discovery: Mining Unknown Data using Submodular Information Measures

Active Learning is a very common yet powerful framework for iteratively ...
research
11/02/2020

Exemplar Guided Active Learning

We consider the problem of wisely using a limited budget to label a smal...
research
07/27/2020

XCAT-GAN for Synthesizing 3D Consistent Labeled Cardiac MR Images on Anatomically Variable XCAT Phantoms

Generative adversarial networks (GANs) have provided promising data enri...
research
01/29/2019

Rare geometries: revealing rare categories via dimension-driven statistics

In many situations, the classes of data points of primary interest also ...
research
07/07/2022

An Exploration of How Training Set Composition Bias in Machine Learning Affects Identifying Rare Objects

When training a machine learning classifier on data where one of the cla...
research
08/03/2019

On the Veracity of Cyber Intrusion Alerts Synthesized by Generative Adversarial Networks

Recreating cyber-attack alert data with a high level of fidelity is chal...

Please sign up or login with your details

Forgot password? Click here to reset