A General Model Validation and Testing Tool

08/29/2019
by   Kevin Vanslette, et al.
0

We construct and propose the "Bayesian Validation Metric" (BVM) as a general model validation and testing tool. We find the BVM to be capable of representing all of the standard validation metrics (square error, reliability, probability of agreement, frequentist, area, statistical hypothesis testing, and Bayesian model testing) as special cases and find that it can be used to improve, generalize, or further quantify their uncertainties. Thus, the BVM allows us to assess the similarities and differences between existing validation metrics in a new light. The BVM may be used to select models according to novel model validation comparison measures. We constructed the BVM ratio for the purpose of quantifying model selection under arbitrary definitions of agreement. This construction generalizes the Bayesian model testing framework. As an example of the versatility and effectiveness of our method, we formulated a quantitative comparison function to represent the visual inspection an engineer might use to validate a model. The BVM ratio leads to the correct selection of the preferable model in both the completely certain and uncertain cases.

READ FULL TEXT

page 13

page 14

research
11/26/2019

Generalized Bayesian Regression and Model Learning

We propose a generalized Bayesian regression and model learning tool bas...
research
10/26/2017

On the Ubiquity of Information Inconsistency for Conjugate Priors

Informally, "Information Inconsistency" is the property that has been ob...
research
11/26/2020

Development and Realization of Validation Benchmarks

In the field of modeling, the word validation refers to simple compariso...
research
05/17/2020

Marginal likelihood computation for model selection and hypothesis testing: an extensive review

This is an up-to-date introduction to, and overview of, marginal likelih...
research
06/16/2019

Designing Test Information and Test Information in Design

DeGroot (1962) developed a general framework for constructing Bayesian m...
research
04/30/2018

Automatic Metric Validation for Grammatical Error Correction

Metric validation in Grammatical Error Correction (GEC) is currently don...
research
06/07/2023

Personality testing of GPT-3: Limited temporal reliability, but highlighted social desirability of GPT-3's personality instruments results

To assess the potential applications and limitations of chatbot GPT-3 Da...

Please sign up or login with your details

Forgot password? Click here to reset