Feature subset selection for kernel SVM classification via mixed-integer optimization

05/28/2022
by   Ryuta Tamura, et al.
0

We study the mixed-integer optimization (MIO) approach to feature subset selection in nonlinear kernel support vector machines (SVMs) for binary classification. First proposed for linear regression in the 1970s, this approach has recently moved into the spotlight with advances in optimization algorithms and computer hardware. The goal of this paper is to establish an MIO approach for selecting the best subset of features for kernel SVM classification. To measure the performance of subset selection, we use the kernel-target alignment, which is the distance between the centroids of two response classes in a high-dimensional feature space. We propose a mixed-integer linear optimization (MILO) formulation based on the kernel-target alignment for feature subset selection, and this MILO problem can be solved to optimality using optimization software. We also derive a reduced version of the MILO problem to accelerate our MILO computations. Experimental results show good computational efficiency for our MILO formulation with the reduced problem. Moreover, our method can often outperform the linear-SVM-based MILO formulation and recursive feature elimination in prediction performance, especially when there are relatively few data instances.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/19/2015

Piecewise-Linear Approximation for Feature Subset Selection in a Sequential Logit Model

This paper concerns a method of selecting a subset of features for a seq...
research
02/15/2023

Variable Selection for Kernel Two-Sample Tests

We consider the variable selection problem for two-sample tests, aiming ...
research
08/07/2018

Efficient and Effective L_0 Feature Selection

Because of continuous advances in mathematical programing, Mix Integer O...
research
09/12/2022

Bilevel Optimization for Feature Selection in the Data-Driven Newsvendor Problem

We study the feature-based newsvendor problem, in which a decision-maker...
research
04/18/2013

Feature Elimination in Kernel Machines in moderately high dimensions

We develop an approach for feature elimination in statistical learning w...
research
11/16/2021

Multiclass Optimal Classification Trees with SVM-splits

In this paper we present a novel mathematical optimization-based methodo...
research
12/22/2017

Linear centralization classifier

A classification algorithm, called the Linear Centralization Classifier ...

Please sign up or login with your details

Forgot password? Click here to reset