On Training Sample Memorization: Lessons from Benchmarking Generative Modeling with a Large-scale Competition

06/06/2021
by   Ching-Yuan Bai, et al.
0

Many recent developments on generative models for natural images have relied on heuristically-motivated metrics that can be easily gamed by memorizing a small sample from the true distribution or training a model directly to improve the metric. In this work, we critically evaluate the gameability of these metrics by designing and deploying a generative modeling competition. Our competition received over 11000 submitted models. The competitiveness between participants allowed us to investigate both intentional and unintentional memorization in generative modeling. To detect intentional memorization, we propose the “Memorization-Informed Fréchet Inception Distance” (MiFID) as a new memorization-aware metric and design benchmark procedures to ensure that winning submissions made genuine improvements in perceptual quality. Furthermore, we manually inspect the code for the 1000 top-performing models to understand and label different forms of memorization. Our analysis reveals that unintentional memorization is a serious and common issue in popular generative models. The generated images and our memorization labels of those models as well as code to compute MiFID are released to facilitate future studies on benchmarking generative models.

READ FULL TEXT
research
06/07/2023

Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models

We systematically study a wide variety of image-based generative models ...
research
11/02/2020

Toward a Generalization Metric for Deep Generative Models

Measuring the generalization capacity of Deep Generative Models (DGMs) i...
research
01/31/2022

On the Robustness of Quality Measures for GANs

This work evaluates the robustness of quality measures of generative mod...
research
02/25/2020

I Am Going MAD: Maximum Discrepancy Competition for Comparing Classifiers Adaptively

The learning of hierarchical representations for image classification ha...
research
08/08/2023

Application-Oriented Benchmarking of Quantum Generative Learning Using QUARK

Benchmarking of quantum machine learning (QML) algorithms is challenging...
research
03/17/2021

Pros and Cons of GAN Evaluation Measures: New Developments

This work is an update of a previous paper on the same topic published a...
research
02/15/2022

Predictability and Surprise in Large Generative Models

Large-scale pre-training has recently emerged as a technique for creatin...

Please sign up or login with your details

Forgot password? Click here to reset