On Buggy Resizing Libraries and Surprising Subtleties in FID Calculation

04/22/2021
by   Gaurav Parmar, et al.
9

We investigate the sensitivity of the Fréchet Inception Distance (FID) score to inconsistent and often incorrect implementations across different image processing libraries. FID score is widely used to evaluate generative models, but each FID implementation uses a different low-level image processing process. Image resizing functions in commonly-used deep learning libraries often introduce aliasing artifacts. We observe that numerous subtle choices need to be made for FID calculation and a lack of consistencies in these choices can lead to vastly different FID scores. In particular, we show that the following choices are significant: (1) selecting what image resizing library to use, (2) choosing what interpolation kernel to use, (3) what encoding to use when representing images. We additionally outline numerous common pitfalls that should be avoided and provide recommendations for computing the FID score accurately. We provide an easy-to-use optimized implementation of our proposed recommendations in the accompanying code.

READ FULL TEXT

page 3

page 4

page 5

research
02/21/2015

MILJS : Brand New JavaScript Libraries for Matrix Calculation and Machine Learning

MILJS is a collection of state-of-the-art, platform-independent, scalabl...
research
01/16/2021

A Hitchhiker's Guide to Structural Similarity

The Structural Similarity (SSIM) Index is a very widely used image/video...
research
04/10/2021

MIPROT: A Medical Image Processing Toolbox for MATLAB

This paper presents a Matlab toolbox to perform basic image processing a...
research
12/10/2018

Functional Design of Computation Graph

Representing the control flow of a computer program as a computation gra...
research
03/19/2015

Implementation of a Practical Distributed Calculation System with Browsers and JavaScript, and Application to Distributed Deep Learning

Deep learning can achieve outstanding results in various fields. However...
research
04/29/2021

The Behavioral Diversity of Java JSON Libraries

JSON is a popular file and data format that is precisely specified by th...
research
09/20/2019

Deep Generative Models for Library Augmentation in Multiple Endmember Spectral Mixture Analysis

Multiple Endmember Spectral Mixture Analysis (MESMA) is one of the leadi...

Please sign up or login with your details

Forgot password? Click here to reset