PanIC: consistent information criteria for general model selection problems

03/07/2023
by   Hien Duy Nguyen, et al.
0

Model selection is a ubiquitous problem that arises in the application of many statistical and machine learning methods. In the likelihood and related settings, it is typical to use the method of information criteria (IC) to choose the most parsimonious among competing models by penalizing the likelihood-based objective function. Theorems guaranteeing the consistency of IC can often be difficult to verify and are often specific and bespoke. We present a set of results that guarantee consistency for a class of IC, which we call PanIC (from the Greek root 'pan', meaning 'of everything'), with easily verifiable regularity conditions. The PanIC are applicable in any loss-based learning problem and are not exclusive to likelihood problems. We illustrate the verification of regularity conditions for model selection problems regarding finite mixture models, least absolute deviation and support vector regression, and principal component analysis, and we demonstrate the effectiveness of the PanIC for such problems via numerical simulations. Furthermore, we present new sufficient conditions for the consistency of BIC-like estimators and provide comparisons of the BIC to PanIC.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/23/2019

Consistent model selection criteria and goodness-of-fit test for affine causal processes

This paper studies the model selection problem in a large class of causa...
research
08/10/2019

Law of the Iterated Logarithm and Model Selection Consistency for Independent and Dependent GLMs

We study the law of the iterated logarithm (LIL) for the maximum likelih...
research
11/28/2018

Asymptotic Analysis of Model Selection Criteria for General Hidden Markov Models

The paper obtains analytical results for the asymptotic properties of Mo...
research
08/23/2013

Likelihood Adaptively Modified Penalties

A new family of penalty functions, adaptive to likelihood, is introduced...
research
09/04/2023

Generalized Information Criteria for Structured Sparse Models

Regularized m-estimators are widely used due to their ability of recover...
research
08/20/2020

Strong consistent model selection for general causal time series

We consider the strongly consistent question for model selection in a la...
research
10/25/2018

Model Selection using Multi-Objective Optimization

Choices in scientific research and management require balancing multiple...

Please sign up or login with your details

Forgot password? Click here to reset