Feature Likelihood Score: Evaluating Generalization of Generative Models Using Samples

02/09/2023
by   Marco Jiralerspong, et al.
0

Deep generative models have demonstrated the ability to generate complex, high-dimensional, and photo-realistic data. However, a unified framework for evaluating different generative modeling families remains a challenge. Indeed, likelihood-based metrics do not apply in many cases while pure sample-based metrics such as FID fail to capture known failure modes such as overfitting on training data. In this work, we introduce the Feature Likelihood Score (FLS), a parametric sample-based score that uses density estimation to quantitatively measure the quality/diversity of generated samples while taking into account overfitting. We empirically demonstrate the ability of FLS to identify specific overfitting problem cases, even when previously proposed metrics fail. We further perform an extensive experimental evaluation on various image datasets and model classes. Our results indicate that FLS matches intuitions of previous metrics, such as FID, while providing a more holistic evaluation of generative models that highlights models whose generalization abilities are under or overappreciated. Code for computing FLS is provided at https://github.com/marcojira/fls

READ FULL TEXT

page 5

page 6

page 7

page 8

page 13

page 14

page 15

page 16

research
04/12/2020

A Non-Parametric Test to Detect Data-Copying in Generative Models

Detecting overfitting in generative models is an important challenge in ...
research
05/30/2023

One-Line-of-Code Data Mollification Improves Optimization of Likelihood-based Generative Models

Generative Models (GMs) have attracted considerable attention due to the...
research
01/24/2022

On Evaluation Metrics for Graph Generative Models

In image generation, generative models can be evaluated naturally by vis...
research
06/07/2023

Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models

We systematically study a wide variety of image-based generative models ...
research
01/21/2022

Evaluating Generalization in Classical and Quantum Generative Models

Defining and accurately measuring generalization in generative models re...
research
06/16/2023

Understanding Deep Generative Models with Generalized Empirical Likelihoods

Understanding how well a deep generative model captures a distribution o...
research
05/20/2022

Diversity vs. Recognizability: Human-like generalization in one-shot generative models

Robust generalization to new concepts has long remained a distinctive fe...

Please sign up or login with your details

Forgot password? Click here to reset