Multi-category Angle-based Classifier Refit

07/19/2016
by   Guo Xian Yau, et al.
0

Classification is an important statistical learning tool. In real application, besides high prediction accuracy, it is often desirable to estimate class conditional probabilities for new observations. For traditional problems where the number of observations is large, there exist many well developed approaches. Recently, high dimensional low sample size problems are becoming increasingly popular. Margin-based classifiers, such as logistic regression, are well established methods in the literature. On the other hand, in terms of probability estimation, it is known that for binary classifiers, the commonly used methods tend to under-estimate the norm of the classification function. This can lead to biased probability estimation. Remedy approaches have been proposed in the literature. However, for the simultaneous multicategory classification framework, much less work has been done. We fill the gap in this paper. In particular, we give theoretical insights on why heavy regularization terms are often needed in high dimensional applications, and how this can lead to bias in probability estimation. To overcome this difficulty, we propose a new refit strategy for multicategory angle-based classifiers. Our new method only adds a small computation cost to the problem, and is able to attain prediction accuracy that is as good as the regular margin-based classifiers. On the other hand, the improvement of probability estimation can be very significant. Numerical results suggest that the new refit approach is highly competitive.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/28/2018

A New Angle on L2 Regularization

Imagine two high-dimensional clusters and a hyperplane separating them. ...
research
09/12/2019

A Note on Posterior Probability Estimation for Classifiers

One of the central themes in the classification task is the estimation o...
research
06/16/2023

Multi-Classification using One-versus-One Deep Learning Strategy with Joint Probability Estimates

The One-versus-One (OvO) strategy is an approach of multi-classification...
research
07/16/2020

Large scale analysis of generalization error in learning using margin based classification methods

Large-margin classifiers are popular methods for classification. We deri...
research
02/17/2021

Split Modeling for High-Dimensional Logistic Regression

A novel method is proposed to learn an ensemble of logistic classificati...
research
07/28/2018

Logistic regression and Ising networks: prediction and estimation when violating lasso assumptions

The Ising model was originally developed to model magnetisation of solid...
research
12/03/2018

Rademacher Complexity and Generalization Performance of Multi-category Margin Classifiers

One of the main open problems in the theory of multi-category margin cla...

Please sign up or login with your details

Forgot password? Click here to reset