Model Selection in High-Dimensional Misspecified Models

12/23/2014
by   Pallavi Basu, et al.
0

Model selection is indispensable to high-dimensional sparse modeling in selecting the best set of covariates among a sequence of candidate models. Most existing work assumes implicitly that the model is correctly specified or of fixed dimensions. Yet model misspecification and high dimensionality are common in real applications. In this paper, we investigate two classical Kullback-Leibler divergence and Bayesian principles of model selection in the setting of high-dimensional misspecified models. Asymptotic expansions of these principles reveal that the effect of model misspecification is crucial and should be taken into account, leading to the generalized AIC and generalized BIC in high dimensions. With a natural choice of prior probabilities, we suggest the generalized BIC with prior probability which involves a logarithmic factor of the dimensionality in penalizing model complexity. We further establish the consistency of the covariance contrast matrix estimator in a general setting. Our results and new method are supported by numerical studies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/17/2018

Large-Scale Model Selection with Misspecification

Model selection is crucial to high-dimensional learning and inference fo...
research
03/14/2022

Bayesian inference on hierarchical nonlocal priors in generalized linear models

Variable selection methods with nonlocal priors have been widely studied...
research
09/04/2023

Generalized Information Criteria for Structured Sparse Models

Regularized m-estimators are widely used due to their ability of recover...
research
05/15/2019

Revisiting High Dimensional Bayesian Model Selection for Gaussian Regression

Model selection for regression problems with an increasing number of cov...
research
05/24/2023

Post-model-selection prediction for GLM's

We give two prediction intervals (PI) for Generalized Linear Models that...
research
09/06/2021

Bayesian data selection

Insights into complex, high-dimensional data can be obtained by discover...
research
03/23/2019

Bayesian Factor-adjusted Sparse Regression

This paper investigates the high-dimensional linear regression with high...

Please sign up or login with your details

Forgot password? Click here to reset