Safe Sample Screening for Support Vector Machines

01/27/2014
by   Kohei Ogawa, et al.
0

Sparse classifiers such as the support vector machines (SVM) are efficient in test-phases because the classifier is characterized only by a subset of the samples called support vectors (SVs), and the rest of the samples (non SVs) have no influence on the classification result. However, the advantage of the sparsity has not been fully exploited in training phases because it is generally difficult to know which sample turns out to be SV beforehand. In this paper, we introduce a new approach called safe sample screening that enables us to identify a subset of the non-SVs and screen them out prior to the training phase. Our approach is different from existing heuristic approaches in the sense that the screened samples are guaranteed to be non-SVs at the optimal solution. We investigate the advantage of the safe sample screening approach through intensive numerical experiments, and demonstrate that it can substantially decrease the computational cost of the state-of-the-art SVM solvers such as LIBSVM. In the current big data era, we believe that safe sample screening would be of great practical importance since the data size can be reduced without sacrificing the optimality of the final solution.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/08/2016

Simultaneous Safe Screening of Features and Samples in Doubly Sparse Modeling

The problem of learning a sparse model is conceptually interpreted as th...
research
07/24/2016

Scaling Up Sparse Support Vector Machines by Simultaneous Feature and Sample Reduction

Sparse support vector machine (SVM) is a popular classification techniqu...
research
06/27/2012

Poisoning Attacks against Support Vector Machines

We investigate a family of poisoning attacks against Support Vector Mach...
research
06/13/2016

Specialized Support Vector Machines for open-set recognition

Often, when dealing with real-world recognition problems, we do not need...
research
01/19/2021

Utilizing Import Vector Machines to Identify Dangerous Pro-active Traffic Conditions

Traffic accidents have been a severe issue in metropolises with the deve...
research
12/17/2017

Super-sparse Learning in Similarity Spaces

In several applications, input samples are more naturally represented in...
research
06/08/2015

Distributed Training of Structured SVM

Training structured prediction models is time-consuming. However, most e...

Please sign up or login with your details

Forgot password? Click here to reset