A ROAD to Classification in High Dimensional Space

11/28/2010
by   Jianqing Fan, et al.
0

For high-dimensional classification, it is well known that naively performing the Fisher discriminant rule leads to poor results due to diverging spectra and noise accumulation. Therefore, researchers proposed independence rules to circumvent the diverse spectra, and sparse independence rules to mitigate the issue of noise accumulation. However, in biological applications, there are often a group of correlated genes responsible for clinical outcomes, and the use of the covariance information can significantly reduce misclassification rates. The extent of such error rate reductions is unveiled by comparing the misclassification rates of the Fisher discriminant rule and the independence rule. To materialize the gain based on finite samples, a Regularized Optimal Affine Discriminant (ROAD) is proposed based on a covariance penalty. ROAD selects an increasing number of features as the penalization relaxes. Further benefits can be achieved when a screening method is employed to narrow the feature pool before hitting the ROAD. An efficient Constrained Coordinate Descent algorithm (CCD) is also developed to solve the associated optimization problems. Sampling properties of oracle type are established. Simulation studies and real data analysis support our theoretical results and demonstrate the advantages of the new classification procedure under a variety of correlation structures. A delicate result on continuous piecewise linear solution path for the ROAD optimization problem at the population level justifies the linear interpolation of the CCD algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/21/2013

Supervised Classification Using Sparse Fisher's LDA

It is well known that in a supervised classification setting when the nu...
research
11/13/2017

Sparse quadratic classification rules via linear dimension reduction

We consider the problem of high-dimensional classification between the t...
research
05/26/2022

Unequal Covariance Awareness for Fisher Discriminant Analysis and Its Variants in Classification

Fisher Discriminant Analysis (FDA) is one of the essential tools for fea...
research
12/05/2019

A Convex Optimization Approach to High-Dimensional Sparse Quadratic Discriminant Analysis

In this paper, we study high-dimensional sparse Quadratic Discriminant A...
research
05/08/2022

Sequential Linear Discriminant Analysis in High Dimensions Using Individual Discriminant Functions

High dimensional classification has been highlighted for last two decade...
research
04/09/2018

High-dimensional Linear Discriminant Analysis: Optimality, Adaptive Algorithm, and Missing Data

This paper aims to develop an optimality theory for linear discriminant ...
research
08/13/2022

A sequential stepwise screening procedure for sparse recovery in high-dimensional multiresponse models with complex group structures

Multiresponse data with complex group structures in both responses and p...

Please sign up or login with your details

Forgot password? Click here to reset