Optimal Linear Classification via Eigenvalue Shrinkage: The Case of Additive Noise

03/22/2021
by   Benjamin Robinson, et al.
0

In this paper, we consider the general problem of testing the mean of two high-dimensional distributions with a common, unknown covariance using a linear classifier. Traditionally such a classifier is formed from the sample covariance matrix of some given training data, but, as is well-known, the performance of this classifier is poor when the number of training data n is not much larger than the data dimension p. We thus seek a covariance estimator to replace sample covariance. To account for the fact that n and p may be of comparable size, we adopt the "large-dimensional asymptotic model" in which n and p go to infinity in a fixed ratio. Under this assumption, we identify a covariance estimator that is detection-theoretic optimal within the general shrinkage class of C. Stein, and we give consistent estimates for the corresponding classifier's type-I and type-II errors.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/14/2023

Ledoit-Wolf linear shrinkage with unknown mean

This work addresses large dimensional covariance matrix estimation with ...
research
03/30/2020

Regularization in High-Dimensional Regression and Classification via Random Matrix Theory

We study general singular value shrinkage estimators in high-dimensional...
research
10/10/2022

Optimal Eigenvalue Shrinkage in the Semicircle Limit

Recent studies of high-dimensional covariance estimation often assume th...
research
09/01/2021

Nonasymptotic one-and two-sample tests in high dimension with unknown covariance structure

Let 𝐗 = (X_i)_1≤ i ≤ n be an i.i.d. sample of square-integrable variable...
research
11/28/2022

Double Data Piling for Heterogeneous Covariance Models

In this work, we characterize two data piling phenomenon for a high-dime...
research
09/05/2021

James-Stein estimation of the first principal component

The Stein paradox has played an influential role in the field of high di...
research
07/08/2020

Robust Bayesian Classification Using an Optimistic Score Ratio

We build a Bayesian contextual classification model using an optimistic ...

Please sign up or login with your details

Forgot password? Click here to reset