Model Selection as a Multiple Testing Procedure: Improving Akaike's Information Criterion

03/06/2018
by   Adrien Saumard, et al.
0

By interpreting the model selection problem as a multiple hypothesis testing task, we propose a modification of Akaike's Information Criterion that avoids overfitting, even when the sample size is small. We call this correction an over-penalization procedure. As a proof of concept, we show nonasymptotic optimality of our procedure for histogram selection in density estimation, by establishing sharp oracle inequalities for the Kullback-Leibler divergence. A strong feature of our theoretical results is that they include the estimation of unbounded log-densities. To do so, we prove several analytical and probabilistic lemmas that are of independent interest. In an experimental study, we also demonstrate state-of-the-art performance of our over-penalization procedure for bin size selection.

READ FULL TEXT
research
11/21/2018

Sharp non asymptotic oracle inequalities for non parametric computerized tomography model

We consider non parametric estimation problem for stochastic tomography ...
research
04/30/2021

Bayesian Information Criterion for Linear Mixed-effects Models

The use of Bayesian information criterion (BIC) in the model selection p...
research
03/31/2023

Bootstrapping multiple systems estimates to account for model selection

Multiple systems estimation is a standard approach to quantifying hidden...
research
09/06/2018

Sequential Model Selection Method for Nonparametric Autoregression

In this paper for the first time the nonparametric autoregression estima...
research
12/27/2022

Fast and fully-automated histograms for large-scale data sets

G-Enum histograms are a new fast and fully automated method for irregula...
research
07/14/2022

Have we been Naive to Select Machine Learning Models? Noisy Data are here to Stay!

The model selection procedure is usually a single-criterion decision mak...
research
05/03/2022

Complementary Goodness of Fit Procedure for Crash Frequency Models

This paper presents a new procedure for evaluating the goodness of fit o...

Please sign up or login with your details

Forgot password? Click here to reset