Leverage classifier: Another look at support vector machine

08/23/2023
by   Yixin Han, et al.
0

Support vector machine (SVM) is a popular classifier known for accuracy, flexibility, and robustness. However, its intensive computation has hindered its application to large-scale datasets. In this paper, we propose a new optimal leverage classifier based on linear SVM under a nonseparable setting. Our classifier aims to select an informative subset of the training sample to reduce data size, enabling efficient computation while maintaining high accuracy. We take a novel view of SVM under the general subsampling framework and rigorously investigate the statistical properties. We propose a two-step subsampling procedure consisting of a pilot estimation of the optimal subsampling probabilities and a subsampling step to construct the classifier. We develop a new Bahadur representation of the SVM coefficients and derive unconditional asymptotic distribution and optimal subsampling probabilities without giving the full sample. Numerical results demonstrate that our classifiers outperform the existing methods in terms of estimation, computation, and prediction.

READ FULL TEXT
research
02/26/2017

Support vector machine and its bias correction in high-dimension, low-sample-size settings

In this paper, we consider asymptotic properties of the support vector m...
research
06/19/2021

EMG Signal Classification Using Reflection Coefficients and Extreme Value Machine

Electromyography is a promising approach to the gesture recognition of h...
research
08/20/2017

Accelerating Kernel Classifiers Through Borders Mapping

Support vector machines (SVM) and other kernel techniques represent a fa...
research
11/29/2018

Distributed Inference for Linear Support Vector Machine

The growing size of modern data brings many new challenges to existing s...
research
03/21/2019

Prescriptive Cluster-Dependent Support Vector Machines with an Application to Reducing Hospital Readmissions

We augment linear Support Vector Machine (SVM) classifiers by adding thr...
research
05/29/2018

Classification with imperfect training labels

We study the effect of imperfect training data labels on the performance...
research
07/30/2020

Regional Rainfall Prediction Using Support Vector Machine Classification of Large-Scale Precipitation Maps

Rainfall prediction helps planners anticipate potential social and econo...

Please sign up or login with your details

Forgot password? Click here to reset