Structured variable selection in support vector machines

10/02/2007
by   Seongho Wu, et al.
0

When applying the support vector machine (SVM) to high-dimensional classification problems, we often impose a sparse structure in the SVM to eliminate the influences of the irrelevant predictors. The lasso and other variable selection techniques have been successfully used in the SVM to perform automatic variable selection. In some problems, there is a natural hierarchical structure among the variables. Thus, in order to have an interpretable SVM classifier, it is important to respect the heredity principle when enforcing the sparsity in the SVM. Many variable selection methods, however, do not respect the heredity principle. In this paper we enforce both sparsity and the heredity principle in the SVM by using the so-called structured variable selection (SVS) framework originally proposed in Yuan, Joseph and Zou (2007). We minimize the empirical hinge loss under a set of linear inequality constraints and a lasso-type penalty. The solution always obeys the desired heredity principle and enjoys sparsity. The new SVM classifier can be efficiently fitted, because the optimization problem is a linear program. Another contribution of this work is to present a nonparametric extension of the SVS framework, and we propose nonparametric heredity SVMs. Simulated and real data are used to illustrate the merits of the proposed method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/13/2013

Mean field variational Bayesian inference for support vector machine classification

A mean field variational Bayes approach to support vector machines (SVMs...
research
09/29/2021

A gradient-based variable selection for binary classification in reproducing kernel Hilbert space

Variable selection is essential in high-dimensional data analysis. Altho...
research
10/14/2021

Algorithms for Sparse Support Vector Machines

Many problems in classification involve huge numbers of irrelevant featu...
research
03/05/2013

An Equivalence between the Lasso and Support Vector Machines

We investigate the relation of two fundamental tools in machine learning...
research
02/22/2013

Accelerated Linear SVM Training with Adaptive Variable Selection Frequencies

Support vector machine (SVM) training is an active research area since t...
research
10/29/2016

A general multiblock method for structured variable selection

Regularised canonical correlation analysis was recently extended to more...
research
07/30/2020

Accuracy and stability of solar variable selection comparison under complicated dependence structures

In this paper we focus on the variable-selection peformance of solar on ...

Please sign up or login with your details

Forgot password? Click here to reset