A Widely Applicable Bayesian Information Criterion

08/31/2012
by   Sumio Watanabe, et al.
0

A statistical model or a learning machine is called regular if the map taking a parameter to a probability distribution is one-to-one and if its Fisher information matrix is always positive definite. If otherwise, it is called singular. In regular statistical models, the Bayes free energy, which is defined by the minus logarithm of Bayes marginal likelihood, can be asymptotically approximated by the Schwarz Bayes information criterion (BIC), whereas in singular models such approximation does not hold. Recently, it was proved that the Bayes free energy of a singular model is asymptotically given by a generalized formula using a birational invariant, the real log canonical threshold (RLCT), instead of half the number of parameters in BIC. Theoretical values of RLCTs in several statistical models are now being discovered based on algebraic geometrical methodology. However, it has been difficult to estimate the Bayes free energy using only training samples, because an RLCT depends on an unknown true distribution. In the present paper, we define a widely applicable Bayesian information criterion (WBIC) by the average log likelihood function over the posterior distribution with the inverse temperature 1/ n, where n is the number of training samples. We mathematically prove that WBIC has the same asymptotic expansion as the Bayes free energy, even if a statistical model is singular for and unrealizable by a statistical model. Since WBIC can be numerically calculated without any information about a true distribution, it is a generalized version of BIC onto singular statistical models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/28/2019

On the overestimation of widely applicable Bayesian information criterion

A widely applicable Bayesian information criterion (Watanabe, 2013) is a...
research
03/14/2022

Asymptotic Behavior of Bayesian Generalization Error in Multinomial Mixtures

Multinomial mixtures are widely used in the information engineering fiel...
research
06/02/2015

An objective prior that unifies objective Bayes and information-based inference

There are three principle paradigms of statistical inference: (i) Bayesi...
research
12/07/2022

Criterion for the resemblance between the mother and the model distribution

If the probability distribution model aims to approximate the hidden mot...
research
12/31/2019

Bayesian Generalization Error of Poisson Mixture and Simplex Vandermonde Matrix Type Singularity

A Poisson mixture is one of the practically important models in computer...
research
06/04/2019

Estimating Real Log Canonical Thresholds

Evaluation of the marginal likelihood plays an important role in model s...
research
11/18/2022

Recent Advances in Algebraic Geometry and Bayesian Statistics

This article is a review of theoretical advances in the research field o...

Please sign up or login with your details

Forgot password? Click here to reset