Moments and Root-Mean-Square Error of the Bayesian MMSE Estimator of Classification Error in the Gaussian Model

10/05/2013
by   Amin Zollanvari, et al.
0

The most important aspect of any classifier is its error rate, because this quantifies its predictive capacity. Thus, the accuracy of error estimation is critical. Error estimation is problematic in small-sample classifier design because the error must be estimated using the same data from which the classifier has been designed. Use of prior knowledge, in the form of a prior distribution on an uncertainty class of feature-label distributions to which the true, but unknown, feature-distribution belongs, can facilitate accurate error estimation (in the mean-square sense) in circumstances where accurate completely model-free error estimation is impossible. This paper provides analytic asymptotically exact finite-sample approximations for various performance metrics of the resulting Bayesian Minimum Mean-Square-Error (MMSE) error estimator in the case of linear discriminant analysis (LDA) in the multivariate Gaussian model. These performance metrics include the first, second, and cross moments of the Bayesian MMSE error estimator with the true error of LDA, and therefore, the Root-Mean-Square (RMS) error of the estimator. We lay down the theoretical groundwork for Kolmogorov double-asymptotics in a Bayesian setting, which enables us to derive asymptotic expressions of the desired performance metrics. From these we produce analytic finite-sample approximations and demonstrate their accuracy via numerical examples. Various examples illustrate the behavior of these approximations and their use in determining the necessary sample size to achieve a desired RMS. The Supplementary Material contains derivations for some equations and added figures.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/28/2017

Estimation and prediction of Gaussian processes using generalized Cauchy covariance model under fixed domain asymptotics

We study estimation and prediction of Gaussian processes with covariance...
research
08/11/2022

Statistical parameters for assessing environmental model performance related to sample size: Case study in ocean color remote sensing

Environmental model performances need to be assessed using some statisti...
research
09/05/2021

Robust Importance Sampling for Error Estimation in the Context of Optimal Bayesian Transfer Learning

Classification has been a major task for building intelligent systems as...
research
01/18/2011

Minimum mean square distance estimation of a subspace

We consider the problem of subspace estimation in a Bayesian setting. Si...
research
08/06/2019

On cylindrical regression in three-dimensional Euclidean space

The three-dimensional cylindrical regression problem is a problem of fin...
research
01/31/2020

Semi-Exact Control Functionals From Sard's Method

This paper focuses on the numerical computation of posterior expected qu...
research
09/01/2021

Measuring Information from Moments

We investigate the problem of representing information measures in terms...

Please sign up or login with your details

Forgot password? Click here to reset