Online Adaptative Curriculum Learning for GANs

07/31/2018
by   Thang Doan, et al.
20

Generative Adversarial Networks (GANs) can successfully learn a probability distribution and produce realistic samples. However, open questions such as sufficient convergence conditions and mode collapse still persist. In this paper, we build on existing work in the area by proposing a novel framework for training the generator against an ensemble of discriminator networks, which can be seen as a one-student/multiple-teachers setting. We formalize this problem within the non-stationary Multi-Armed Bandit (MAB) framework, where we evaluate the capability of a bandit algorithm to select discriminators for providing the generator with feedback during learning. To this end, we propose a reward function which reflects the amount of knowledge learned by the generator and dynamically selects the optimal discriminator network. Finally, we connect our algorithm to stochastic optimization methods and show that existing methods using multiple discriminators in literature can be recovered from our parametric model. Experimental results based on the Fréchet Inception Distance (FID) demonstrates faster convergence than existing baselines and show that our method learns a curriculum.

READ FULL TEXT

page 15

page 17

page 18

page 19

page 20

page 21

page 22

page 23

research
07/30/2018

Dropout-GAN: Learning from a Dynamic Ensemble of Discriminators

We propose to incorporate adversarial dropout in generative multi-advers...
research
11/14/2022

Shared Loss between Generators of GANs

Generative adversarial networks are generative models that are capable o...
research
12/19/2014

On distinguishability criteria for estimating generative models

Two recently introduced criteria for estimation of generative models are...
research
07/24/2018

Improved Training with Curriculum GANs

In this paper we introduce Curriculum GANs, a curriculum learning strate...
research
04/10/2017

Automated Curriculum Learning for Neural Networks

We introduce a method for automatically selecting the path, or syllabus,...
research
05/23/2022

Falsification of Multiple Requirements for Cyber-Physical Systems Using Online Generative Adversarial Networks and Multi-Armed Bandits

We consider the problem of falsifying safety requirements of Cyber-Physi...
research
02/04/2014

Online Stochastic Optimization under Correlated Bandit Feedback

In this paper we consider the problem of online stochastic optimization ...

Please sign up or login with your details

Forgot password? Click here to reset