Another Look at DWD: Thrifty Algorithm and Bayes Risk Consistency in RKHS

08/24/2015
by   Boxiang Wang, et al.
0

Distance weighted discrimination (DWD) is a margin-based classifier with an interesting geometric motivation. DWD was originally proposed as a superior alternative to the support vector machine (SVM), however DWD is yet to be popular compared with the SVM. The main reasons are twofold. First, the state-of-the-art algorithm for solving DWD is based on the second-order-cone programming (SOCP), while the SVM is a quadratic programming problem which is much more efficient to solve. Second, the current statistical theory of DWD mainly focuses on the linear DWD for the high-dimension-low-sample-size setting and data-piling, while the learning theory for the SVM mainly focuses on the Bayes risk consistency of the kernel SVM. In fact, the Bayes risk consistency of DWD is presented as an open problem in the original DWD paper. In this work, we advance the current understanding of DWD from both computational and theoretical perspectives. We propose a novel efficient algorithm for solving DWD, and our algorithm can be several hundred times faster than the existing state-of-the-art algorithm based on the SOCP. In addition, our algorithm can handle the generalized DWD, while the SOCP algorithm only works well for a special DWD but not the generalized DWD. Furthermore, we consider a natural kernel DWD in a reproducing kernel Hilbert space and then establish the Bayes risk consistency of the kernel DWD. We compare DWD and the SVM on several benchmark data sets and show that the two have comparable classification accuracy, but DWD equipped with our new algorithm can be much faster to compute than the SVM.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/11/2013

Distance-weighted Support Vector Machine

A novel linear classification method that possesses the merits of both t...
research
02/26/2017

Support vector machine and its bias correction in high-dimension, low-sample-size settings

In this paper, we consider asymptotic properties of the support vector m...
research
01/24/2015

Sparse Distance Weighted Discrimination

Distance weighted discrimination (DWD) was originally proposed to handle...
research
05/19/2023

A Foray into Parallel Optimisation Algorithms for High Dimension Low Sample Space Generalized Distance Weighted Discrimination problems

In many modern data sets, High dimension low sample size (HDLSS) data is...
research
11/07/2002

Maximing the Margin in the Input Space

We propose a novel criterion for support vector machine learning: maximi...
research
12/05/2012

Cost-Sensitive Support Vector Machines

A new procedure for learning cost-sensitive SVM(CS-SVM) classifiers is p...
research
06/07/2018

Kernel Machines With Missing Responses

Missing responses is a missing data format in which outcomes are not alw...

Please sign up or login with your details

Forgot password? Click here to reset