F?D: On understanding the role of deep feature spaces on face generation evaluation

05/31/2023
by   Krish Kabra, et al.
0

Perceptual metrics, like the Fréchet Inception Distance (FID), are widely used to assess the similarity between synthetically generated and ground truth (real) images. The key idea behind these metrics is to compute errors in a deep feature space that captures perceptually and semantically rich image features. Despite their popularity, the effect that different deep features and their design choices have on a perceptual metric has not been well studied. In this work, we perform a causal analysis linking differences in semantic attributes and distortions between face image distributions to Fréchet distances (FD) using several popular deep feature spaces. A key component of our analysis is the creation of synthetic counterfactual faces using deep face generators. Our experiments show that the FD is heavily influenced by its feature space's training dataset and objective function. For example, FD using features extracted from ImageNet-trained models heavily emphasize hats over regions like the eyes and mouth. Moreover, FD using features from a face gender classifier emphasize hair length more than distances in an identity (recognition) feature space. Finally, we evaluate several popular face generation models across feature spaces and find that StyleGAN2 consistently ranks higher than other face generators, except with respect to identity (recognition) features. This suggests the need for considering multiple feature spaces when evaluating generative models and using feature spaces that are tuned to nuances of the domain of interest.

READ FULL TEXT
research
08/10/2023

Benchmarking Algorithmic Bias in Face Recognition: An Experimental Approach Using Synthetic Faces and Human Evaluation

We propose an experimental method for measuring bias in face recognition...
research
06/17/2022

Rarity Score : A New Metric to Evaluate the Uncommonness of Synthesized Images

Evaluation metrics in image synthesis play a key role to measure perform...
research
08/19/2022

Demystifying Randomly Initialized Networks for Evaluating Generative Models

Evaluation of generative models is mostly based on the comparison betwee...
research
07/20/2023

BlendFace: Re-designing Identity Encoders for Face-Swapping

The great advancements of generative adversarial networks and face recog...
research
03/11/2022

The Role of ImageNet Classes in Fréchet Inception Distance

Fréchet Inception Distance (FID) is a metric for quantifying the distanc...
research
09/06/2020

The role of feature space in atomistic learning

Efficient, physically-inspired descriptors of the structure and composit...
research
11/29/2022

Approximating Intersections and Differences Between Statistical Shape Models

To date, the comparison of Statistical Shape Models (SSMs) is often sole...

Please sign up or login with your details

Forgot password? Click here to reset