On Buggy Resizing Libraries and Surprising Subtleties in FID Calculation

04/22/2021
by   Gaurav Parmar, et al.
9

We investigate the sensitivity of the Fréchet Inception Distance (FID) score to inconsistent and often incorrect implementations across different image processing libraries. FID score is widely used to evaluate generative models, but each FID implementation uses a different low-level image processing process. Image resizing functions in commonly-used deep learning libraries often introduce aliasing artifacts. We observe that numerous subtle choices need to be made for FID calculation and a lack of consistencies in these choices can lead to vastly different FID scores. In particular, we show that the following choices are significant: (1) selecting what image resizing library to use, (2) choosing what interpolation kernel to use, (3) what encoding to use when representing images. We additionally outline numerous common pitfalls that should be avoided and provide recommendations for computing the FID score accurately. We provide an easy-to-use optimized implementation of our proposed recommendations in the accompanying code.

READ FULL TEXT

page 3

page 4

page 5

02/21/2015

MILJS : Brand New JavaScript Libraries for Matrix Calculation and Machine Learning

MILJS is a collection of state-of-the-art, platform-independent, scalabl...
01/16/2021

A Hitchhiker's Guide to Structural Similarity

The Structural Similarity (SSIM) Index is a very widely used image/video...
04/10/2021

MIPROT: A Medical Image Processing Toolbox for MATLAB

This paper presents a Matlab toolbox to perform basic image processing a...
12/10/2018

Functional Design of Computation Graph

Representing the control flow of a computer program as a computation gra...
03/19/2015

Implementation of a Practical Distributed Calculation System with Browsers and JavaScript, and Application to Distributed Deep Learning

Deep learning can achieve outstanding results in various fields. However...
04/29/2021

The Behavioral Diversity of Java JSON Libraries

JSON is a popular file and data format that is precisely specified by th...
09/20/2019

Deep Generative Models for Library Augmentation in Multiple Endmember Spectral Mixture Analysis

Multiple Endmember Spectral Mixture Analysis (MESMA) is one of the leadi...

Code Repositories

stargan-v2

StarGAN v2 - Official PyTorch Implementation (CVPR 2020)


view repo

clean-fid

PyTorch - FID calculation with proper image resizing and quantization steps


view repo