Algorithms for Sparse Support Vector Machines

10/14/2021
by   Alfonso Landeros, et al.
0

Many problems in classification involve huge numbers of irrelevant features. Model selection reveals the crucial features, reduces the dimensionality of feature space, and improves model interpretation. In the support vector machine literature, model selection is achieved by ℓ_1 penalties. These convex relaxations seriously bias parameter estimates toward 0 and tend to admit too many irrelevant features. The current paper presents an alternative that replaces penalties by sparse-set constraints. Penalties still appear, but serve a different purpose. The proximal distance principle takes a loss function L(β) and adds the penalty ρ/2dist(β, S_k)^2 capturing the squared Euclidean distance of the parameter vector β to the sparsity set S_k where at most k components of β are nonzero. If β_ρ represents the minimum of the objective f_ρ(β)=L(β)+ρ/2dist(β, S_k)^2, then β_ρ tends to the constrained minimum of L(β) over S_k as ρ tends to ∞. We derive two closely related algorithms to carry out this strategy. Our simulated and real examples vividly demonstrate how the algorithms achieve much better sparsity without loss of classification power.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/15/2011

Support vector machines/relevance vector machine for remote sensing classification: A review

Kernel-based machine learning algorithms are based on mapping data from ...
research
10/02/2007

Structured variable selection in support vector machines

When applying the support vector machine (SVM) to high-dimensional class...
research
12/24/2021

Optimal Model Averaging of Support Vector Machines in Diverging Model Spaces

Support vector machine (SVM) is a powerful classification method that ha...
research
05/13/2013

Mean field variational Bayesian inference for support vector machine classification

A mean field variational Bayes approach to support vector machines (SVMs...
research
07/24/2016

Scaling Up Sparse Support Vector Machines by Simultaneous Feature and Sample Reduction

Sparse support vector machine (SVM) is a popular classification techniqu...
research
05/22/2015

Greedy Biomarker Discovery in the Genome with Applications to Antimicrobial Resistance

The Set Covering Machine (SCM) is a greedy learning algorithm that produ...
research
06/27/2012

Sparse Support Vector Infinite Push

In this paper, we address the problem of embedded feature selection for ...

Please sign up or login with your details

Forgot password? Click here to reset