Model Averaging for Support Vector Machine by J-fold Cross-Validation

12/29/2021
by   Jiahui Zou, et al.
0

Support vector machine (SVM) is a classical tool to deal with classification problems, which is widely used in biology, statistics and machine learning and good at small sample size and high-dimensional situation. This paper proposes a model averaging method, called SVMMA, to address the uncertainty from deciding which covariates should be included for SVM and to promote its prediction ability. We offer a criterion to search the weights to combine many candidate models that are composed of different parts from the total covariates. To build up the candidate model set, we suggest to use a screening-averaging form in practice. Especially, the model averaging estimator is proved to be asymptotically optimal in the sense of achieving the lowest hinge risk among all possible combination. Finally, we do some simulation to compare the proposed model averaging method with several other model selection/averaging and ensemble learning methods, and apply to four real datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/24/2021

Optimal Model Averaging of Support Vector Machines in Diverging Model Spaces

Support vector machine (SVM) is a powerful classification method that ha...
research
12/30/2021

Optimal model averaging for single-index models with divergent dimensions

This paper offers a new approach to address the model uncertainty in (po...
research
12/05/2022

Multifold Cross-Validation Model Averaging for Generalized Additive Partial Linear Models

Generalized additive partial linear models (GAPLMs) are appealing for mo...
research
10/27/2019

Jackknife Model Averaging for Composite Quantile Regression

Model averaging considers the model uncertainty and is an alternative to...
research
05/05/2013

Efficient Estimation of the number of neighbours in Probabilistic K Nearest Neighbour Classification

Probabilistic k-nearest neighbour (PKNN) classification has been introdu...
research
04/24/2023

A Semi-parametric Promotion Time Cure Model with Support Vector Machine

The promotion time cure rate model (PCM) is an extensively studied model...
research
05/11/2020

Ensembled sparse-input hierarchical networks for high-dimensional datasets

Neural networks have seen limited use in prediction for high-dimensional...

Please sign up or login with your details

Forgot password? Click here to reset