The Role of ImageNet Classes in Fréchet Inception Distance

03/11/2022
by   Tuomas Kynkäänniemi, et al.
6

Fréchet Inception Distance (FID) is a metric for quantifying the distance between two distributions of images. Given its status as a standard yardstick for ranking models in data-driven generative modeling research, it seems important that the distance is computed from general, "vision-related" features. But is it? We observe that FID is essentially a distance between sets of ImageNet class probabilities. We trace the reason to the fact that the standard feature space, the penultimate "pre-logit" layer of a particular Inception-V3 classifier network, is only one affine transform away from the logits, i.e., ImageNet classes, and thus, the features are necessarily highly specialized to them. This has unintuitive consequences for the metric's sensitivity. For example, when evaluating a model for human faces, we observe that, on average, FID is actually very insensitive to the facial region, and that the probabilities of classes like "bow tie" or "seat belt" play a much larger role. Further, we show that FID can be significantly reduced – without actually improving the quality of results – by an attack that first generates a slightly larger set of candidates, and then chooses a subset that happens to match the histogram of such "fringe features" in the real data. We then demonstrate that this observation has practical relevance in case of ImageNet pre-training of GANs, where a part of the observed FID improvement turns out not to be real. Our results suggest caution against over-interpreting FID improvements, and underline the need for distribution metrics that are more perceptually uniform.

READ FULL TEXT

page 5

page 8

page 10

page 17

page 18

page 20

page 21

page 23

research
10/08/2021

Evaluating generative networks using Gaussian mixtures of image features

We develop a measure for evaluating the performance of generative networ...
research
05/26/2019

Classification Accuracy Score for Conditional Generative Models

Deep generative models (DGMs) of images are now sufficiently mature that...
research
06/16/2021

Compound Frechet Inception Distance for Quality Assessment of GAN Created Images

Generative adversarial networks or GANs are a type of generative modelin...
research
05/31/2023

F?D: On understanding the role of deep feature spaces on face generation evaluation

Perceptual metrics, like the Fréchet Inception Distance (FID), are widel...
research
08/30/2016

What makes ImageNet good for transfer learning?

The tremendous success of ImageNet-trained deep features on a wide range...
research
10/16/2018

Discriminator Rejection Sampling

We propose a rejection sampling scheme using the discriminator of a GAN ...
research
11/22/2021

Evaluating Adversarial Attacks on ImageNet: A Reality Check on Misclassification Classes

Although ImageNet was initially proposed as a dataset for performance be...

Please sign up or login with your details

Forgot password? Click here to reset