Distributed estimation of principal support vector machines for sufficient dimension reduction

11/28/2019
by   Jun Jin, et al.
0

The principal support vector machines method (Li et al., 2011) is a powerful tool for sufficient dimension reduction that replaces original predictors with their low-dimensional linear combinations without loss of information. However, the computational burden of the principal support vector machines method constrains its use for massive data. To address this issue, we in this paper propose two distributed estimation algorithms for fast implementation when the sample size is large. Both the two distributed sufficient dimension reduction estimators enjoy the same statistical efficiency as merging all the data together, which provides rigorous statistical guarantees for their application to large scale datasets. The two distributed algorithms are further adapt to principal weighted support vector machines (Shin et al., 2016) for sufficient dimension reduction in binary classification. The statistical accuracy and computational complexity of our proposed methods are examined through comprehensive simulation studies and a real data application with more than 600000 samples.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/12/2020

On sufficient dimension reduction via principal asymmetric least squares

In this paper, we introduce principal asymmetric least squares (PALS) as...
research
05/25/2022

Linear Algorithms for Nonparametric Multiclass Probability Estimation

Multiclass probability estimation is the problem of estimating condition...
research
07/11/2012

Applying Discrete PCA in Data Analysis

Methods for analysis of principal components in discrete data have exist...
research
05/19/2023

A Foray into Parallel Optimisation Algorithms for High Dimension Low Sample Space Generalized Distance Weighted Discrimination problems

In many modern data sets, High dimension low sample size (HDLSS) data is...
research
03/13/2019

Distributed and Streaming Linear Programming in Low Dimensions

We study linear programming and general LP-type problems in several big ...
research
04/19/2012

Speech Recognition: Increasing Efficiency of Support Vector Machines

With the advancement of communication and security technologies, it has ...

Please sign up or login with your details

Forgot password? Click here to reset