Towards good validation metrics for generative models in offline model-based optimisation

11/19/2022
by   Christopher Beckham, et al.
0

In this work we propose a principled evaluation framework for model-based optimisation to measure how well a generative model can extrapolate. We achieve this by interpreting the training and validation splits as draws from their respective `truncated' ground truth distributions, where examples in the validation set contain scores much larger than those in the training set. Model selection is performed on the validation set for some prescribed validation metric. A major research question however is in determining what validation metric correlates best with the expected value of generated candidates with respect to the ground truth oracle; work towards answering this question can translate to large economic gains since it is expensive to evaluate the ground truth oracle in the real world. We compare various validation metrics for generative adversarial networks using our framework. We also discuss limitations with our framework with respect to existing datasets and how progress can be made to mitigate them.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/28/2019

PT-MMD: A Novel Statistical Framework for the Evaluation of Generative Systems

Stochastic-sampling-based Generative Neural Networks, such as Restricted...
research
01/17/2022

Evaluation of HTR models without Ground Truth Material

The evaluation of Handwritten Text Recognition (HTR) models during their...
research
05/01/2020

KPQA: A Metric for Generative Question Answering Using Word Weights

For the automatic evaluation of Generative Question Answering (genQA) sy...
research
02/09/2021

Benchmarks, Algorithms, and Metrics for Hierarchical Disentanglement

In representation learning, there has been recent interest in developing...
research
07/06/2020

Generative Model-Based Loss to the Rescue: A Method to Overcome Annotation Errors for Depth-Based Hand Pose Estimation

We propose to use a model-based generative loss for training hand pose e...
research
04/04/2023

Revisiting the Evaluation of Image Synthesis with GANs

A good metric, which promises a reliable comparison between solutions, i...
research
06/29/2023

TemperatureGAN: Generative Modeling of Regional Atmospheric Temperatures

Stochastic generators are useful for estimating climate impacts on vario...

Please sign up or login with your details

Forgot password? Click here to reset