Multifold Cross-Validation Model Averaging for Generalized Additive Partial Linear Models

12/05/2022
by   Ze Chen, et al.
0

Generalized additive partial linear models (GAPLMs) are appealing for model interpretation and prediction. However, for GAPLMs, the covariates and the degree of smoothing in the nonparametric parts are often difficult to determine in practice. To address this model selection uncertainty issue, we develop a computationally feasible model averaging (MA) procedure. The model weights are data-driven and selected based on multifold cross-validation (CV) (instead of leave-one-out) for computational saving. When all the candidate models are misspecified, we show that the proposed MA estimator for GAPLMs is asymptotically optimal in the sense of achieving the lowest possible Kullback-Leibler loss. In the other scenario where the candidate model set contains at least one correct model, the weights chosen by the multifold CV are asymptotically concentrated on the correct models. As a by-product, we propose a variable importance measure to quantify the importances of the predictors in GAPLMs based on the MA weights. It is shown to be able to asymptotically identify the variables in the true model. Moreover, when the number of candidate models is very large, a model screening method is provided. Numerical experiments show the superiority of the proposed MA method over some existing model averaging and selection methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/30/2021

Optimal model averaging for single-index models with divergent dimensions

This paper offers a new approach to address the model uncertainty in (po...
research
05/03/2021

Model Averaging by Cross-validation for Partially Linear Functional Additive Models

We consider averaging a number of candidate models to produce a predicti...
research
08/14/2020

Bayesian model selection in additive partial linear models via locally adaptive splines

We consider a model selection problem for additive partial linear models...
research
12/24/2020

On Statistical Efficiency in Learning

A central issue of many statistical learning problems is to select an ap...
research
12/29/2021

Model Averaging for Support Vector Machine by J-fold Cross-Validation

Support vector machine (SVM) is a classical tool to deal with classifica...
research
10/25/2017

Model Averaging for Generalized Linear Model with Covariates that are Missing completely at Random

In this paper, we consider the estimation of generalized linear models w...
research
09/04/2023

Frequentist Model Averaging for Global Fréchet Regression

To consider model uncertainty in global Fréchet regression and improve d...

Please sign up or login with your details

Forgot password? Click here to reset