Supervised Classification Using Sparse Fisher's LDA

01/21/2013
by   Irina Gaynanova, et al.
0

It is well known that in a supervised classification setting when the number of features is smaller than the number of observations, Fisher's linear discriminant rule is asymptotically Bayes. However, there are numerous modern applications where classification is needed in the high-dimensional setting. Naive implementation of Fisher's rule in this case fails to provide good results because the sample covariance matrix is singular. Moreover, by constructing a classifier that relies on all features the interpretation of the results is challenging. Our goal is to provide robust classification that relies only on a small subset of important features and accounts for the underlying correlation structure. We apply a lasso-type penalty to the discriminant vector to ensure sparsity of the solution and use a shrinkage type estimator for the covariance matrix. The resulting optimization problem is solved using an iterative coordinate ascent algorithm. Furthermore, we analyze the effect of nonconvexity on the sparsity level of the solution and highlight the difference between the penalized and the constrained versions of the problem. The simulation results show that the proposed method performs favorably in comparison to alternatives. The method is used to classify leukemia patients based on DNA methylation features.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/05/2021

Classification of high-dimensional data with spiked covariance matrix structure

We study the classification problem for high-dimensional data with n obs...
research
11/28/2010

A ROAD to Classification in High Dimensional Space

For high-dimensional classification, it is well known that naively perfo...
research
12/21/2012

Optimal classification in sparse Gaussian graphic model

Consider a two-class classification problem where the number of features...
research
05/26/2022

Unequal Covariance Awareness for Fisher Discriminant Analysis and Its Variants in Classification

Fisher Discriminant Analysis (FDA) is one of the essential tools for fea...
research
09/17/2015

Sparse Fisher's Linear Discriminant Analysis for Partially Labeled Data

Classification is an important tool with many useful applications. Among...
research
05/03/2020

High Dimensional Classification for Spatially Dependent Data with Application to Neuroimaging

Discriminating patients with Alzheimer's disease (AD) from healthy subje...

Please sign up or login with your details

Forgot password? Click here to reset