Revisiting the Evaluation of Image Synthesis with GANs

04/04/2023
by   Mengping Yang, et al.
4

A good metric, which promises a reliable comparison between solutions, is essential to a well-defined task. Unlike most vision tasks that have per-sample ground-truth, image synthesis targets generating unseen data and hence is usually evaluated with a distributional distance between one set of real samples and another set of generated samples. This work provides an empirical study on the evaluation of synthesis performance by taking the popular generative adversarial networks (GANs) as a representative of generative models. In particular, we make in-depth analyses on how to represent a data point in the feature space, how to calculate a fair distance using selected samples, and how many instances to use from each set. Experiments on multiple datasets and settings suggest that (1) a group of models including both CNN-based and ViT-based architectures serve as reliable and robust feature extractors, (2) Centered Kernel Alignment (CKA) enables better comparison across various extractors and hierarchical layers in one model, and (3) CKA shows satisfactory sample efficiency and complements existing metrics (e.g., FID) in characterizing the similarity between two internal data correlations. These findings help us design a new measurement system, based on which we re-evaluate the state-of-the-art generative models in a consistent and reliable way.

READ FULL TEXT

page 4

page 5

page 6

page 13

page 14

page 15

page 16

page 17

research
06/19/2018

An empirical study on evaluation metrics of generative adversarial networks

Evaluating generative adversarial networks (GANs) is inherently challeng...
research
08/19/2022

Demystifying Randomly Initialized Networks for Evaluating Generative Models

Evaluation of generative models is mostly based on the comparison betwee...
research
06/16/2020

The Bures Metric for Taming Mode Collapse in Generative Adversarial Networks

Generative Adversarial Networks (GANs) are performant generative methods...
research
04/04/2020

Theoretical Insights into the Use of Structural Similarity Index In Generative Models and Inferential Autoencoders

Generative models and inferential autoencoders mostly make use of ℓ_2 no...
research
01/21/2022

Evaluating Generalization in Classical and Quantum Generative Models

Defining and accurately measuring generalization in generative models re...
research
04/15/2019

Improved Precision and Recall Metric for Assessing Generative Models

The ability to evaluate the performance of a computational model is a vi...
research
11/19/2022

Towards good validation metrics for generative models in offline model-based optimisation

In this work we propose a principled evaluation framework for model-base...

Please sign up or login with your details

Forgot password? Click here to reset